Safety classifiers fine-tuned on a bilingual dataset composed of the English QA pairs from BeaverTails and the Italian QA pairs from BeaverTails-IT.
Giuseppe Magazzù
saiteki-kai
AI & ML interests
My research focuses on the developement of safety mitigation strategies and benchmarks for large language models.
Recent Activity
liked
a model about 20 hours ago
Qwen/Qwen3Guard-Gen-8B liked
a model 15 days ago
prem-research/MiniGuard-v0.1 upvoted an article 15 days ago
MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier