2 10 2

Alexander Panfilov

kotekjedi

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

upvoted a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

commented on a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

View all activity

Organizations

authored a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10 • 5

upvoted a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10 • 5

commented a paper about 1 month ago

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Paper • 2510.09462 • Published Oct 10 • 5 •

upvoted 2 papers about 1 month ago

DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

Paper • 2510.07959 • Published Oct 9 • 14

D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models

Paper • 2509.17938 • Published Sep 22 • 4

upvoted 2 papers about 2 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22 • 12

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24 • 96

authored a paper about 2 months ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22 • 12

updated 2 models about 2 months ago

kotekjedi/qwq3-32b-lora-jailbreak-detection-merged

Updated Sep 18

kotekjedi/qwq3-32b-lora-jailbreak-detection

Updated Sep 17

published 2 models about 2 months ago

kotekjedi/qwq3-32b-lora-jailbreak-detection

Updated Sep 17

kotekjedi/qwq3-32b-lora-jailbreak-detection-merged

Updated Sep 18

updated a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection_v2

Updated Sep 15

published a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection_v2

Updated Sep 15

updated a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection-merged_v2

Text Generation • 33B • Updated Sep 15 • 15

published a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection-merged_v2

Text Generation • 33B • Updated Sep 15 • 15

updated a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection-merged

Text Generation • 33B • Updated Sep 13 • 11

published a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection-merged

Text Generation • 33B • Updated Sep 13 • 11

updated a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection

Updated Sep 13

published a model about 2 months ago

kotekjedi/qwen3-32b-lora-jailbreak-detection

Updated Sep 13

Alexander Panfilov

AI & ML interests

Recent Activity

Organizations

kotekjedi's activity