Erfan Shayegani 😈's picture

Erfan Shayegani 😈

Erfan-Shayegani

·

https://erfanshayegani.github.io/

AI & ML interests

AI Safety - Responsible AI - Multi-Modal Alignment

Recent Activity

upvoted a paper 17 days ago

The Collaboration Gap

upvoted a paper about 1 month ago

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots

authored a paper about 2 months ago

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

View all activity

Organizations

authored a paper about 2 months ago

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness

Paper • 2510.01670 • Published Oct 2 • 6

authored 2 papers 7 months ago

Misaligned Roles, Misplaced Images: Structural Input Perturbations Expose Multimodal Alignment Blind Spots

Paper • 2504.03735 • Published Apr 1 • 1

Unfair Alignment: Examining Safety Alignment Across Vision Encoder Layers in Vision-Language Models

Paper • 2411.04291 • Published Nov 6, 2024

authored 3 papers over 1 year ago

Cross-Modal Safety Alignment: Is textual unlearning all you need?

Paper • 2406.02575 • Published May 27, 2024 • 1

Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks

Paper • 2310.10844 • Published Oct 16, 2023

Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Paper • 2307.14539 • Published Jul 26, 2023 • 2