Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Paper • 2403.16950 • Published Mar 25, 2024 • 4
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners Paper • 2406.02537 • Published Jun 4, 2024
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments Paper • 2406.11370 • Published Jun 17, 2024
From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation Paper • 2502.00330 • Published Feb 1
Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Paper • 2502.02533 • Published Feb 4 • 3
Retrofitting (Large) Language Models with Dynamic Tokenization Paper • 2411.18553 • Published Nov 27, 2024 • 2
Cross-Tokenizer Distillation via Approximate Likelihood Matching Paper • 2503.20083 • Published Mar 25 • 1
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10 • 101
Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 10
Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Paper • 2404.03098 • Published Apr 3, 2024
LegalVis: Exploring and Inferring Precedent Citations in Legal Documents Paper • 2203.02001 • Published Mar 3, 2022
Distill n' Explain: explaining graph neural networks using simple surrogates Paper • 2303.10139 • Published Mar 17, 2023 • 1
Empirical analysis of Binding Precedent efficiency in the Brazilian Supreme Court via Similar Case Retrieval Paper • 2407.07004 • Published Jul 9, 2024
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 66