FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes Paper • 2506.03278 • Published Jun 3 • 6
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey Paper • 2409.11564 • Published Sep 17, 2024 • 20
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey Paper • 2409.11564 • Published Sep 17, 2024 • 20
Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers Paper • 2403.01061 • Published Mar 2, 2024