F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data Paper • 2510.02294 • Published Oct 2 • 44 • 2
CodeFuse-CR-Bench: A Comprehensiveness-aware Benchmark for End-to-End Code Review Evaluation in Python Projects Paper • 2509.14856 • Published Sep 18 • 1 • 2
CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China Paper • 2509.09990 • Published Sep 12 • 1 • 2
From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms Paper • 2508.10860 • Published Aug 14 • 3 • 2
Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Paper • 2505.16901 • Published May 22 • 47 • 2
Multilingual Encoder Knows more than You Realize: Shared Weights Pretraining for Extremely Low-Resource Languages Paper • 2502.10852 • Published Feb 15 • 2 • 2
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding Paper • 2411.18462 • Published Nov 27, 2024 • 6 • 2