arxiv:2510.08759
Siyuan Ma
HuggingFriends
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal
Large Language Models against Jailbreak Attacks
authored
a paper
about 1 month ago
Benchmarking Vision Language Model Unlearning via Fictitious Facial
Identity Dataset
authored
a paper
about 1 month ago
Code Agent can be an End-to-end System Hacker: Benchmarking Real-world
Threats of Computer-use Agent
Organizations
None yet