YANG SHU
babytreecc
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced
Misalignment
upvoted
a
paper
about 2 months ago
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced
Misalignment
updated
a dataset
about 2 months ago
babytreecc/DeliberationBank