Language Technology Lab @University of Cambridge

university

http://ltl.mml.cam.ac.uk/

cambridgeltl

Activity Feed Request to join this org

AI & ML interests

Representation Learning, Multilingual NLP, Multimodal NLP, BioNLP, Self-Supervised Learning, Explainable AI

Recent Activity

ljvmiranda921 authored a paper 8 days ago

FilBench: Can LLMs Understand and Generate Filipino?

ljvmiranda921 authored a paper 5 months ago

R3: Robust Rubric-Agnostic Reward Models

hzhouml authored a paper 6 months ago

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

View all activity

ljvmiranda921

authored a paper 8 days ago

FilBench: Can LLMs Understand and Generate Filipino?

Paper • 2508.03523 • Published Aug 5

ljvmiranda921

authored a paper 5 months ago

R3: Robust Rubric-Agnostic Reward Models

Paper • 2505.13388 • Published May 19 • 11

hzhouml

authored 3 papers 6 months ago

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Paper • 2403.16950 • Published Mar 25, 2024 • 4

TopViewRS: Vision-Language Models as Top-View Spatial Reasoners

Paper • 2406.02537 • Published Jun 4, 2024

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments

Paper • 2406.11370 • Published Jun 17, 2024

ivulic

authored a paper 6 months ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 56

hzhouml

authored 3 papers 6 months ago

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 56

From Few to Many: Self-Improving Many-Shot Reasoners Through Iterative Optimization and Generation

Paper • 2502.00330 • Published Feb 1

Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies

Paper • 2502.02533 • Published Feb 4 • 3

benjamin

authored 2 papers 7 months ago

Retrofitting (Large) Language Models with Dynamic Tokenization

Paper • 2411.18553 • Published Nov 27, 2024 • 2

Cross-Tokenizer Distillation via Approximate Likelihood Matching

Paper • 2503.20083 • Published Mar 25 • 1

ljvmiranda921

authored 2 papers 8 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 41

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 101

ljvmiranda921

authored 2 papers 10 months ago

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 10

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

lucasresck

authored 4 papers 11 months ago

Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales

Paper • 2404.03098 • Published Apr 3, 2024

LegalVis: Exploring and Inferring Precedent Citations in Legal Documents

Paper • 2203.02001 • Published Mar 3, 2022

Distill n' Explain: explaining graph neural networks using simple surrogates

Paper • 2303.10139 • Published Mar 17, 2023 • 1

Empirical analysis of Binding Precedent efficiency in the Brazilian Supreme Court via Similar Case Retrieval

Paper • 2407.07004 • Published Jul 9, 2024

ljvmiranda921

authored a paper 12 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 66