Daniel Khashabi

danyaljj

danyaljj

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains

authored a paper 24 days ago

World-in-World: World Models in a Closed-Loop World

upvoted a paper 24 days ago

World-in-World: World Models in a Closed-Loop World

View all activity

Organizations

upvoted a paper 11 days ago

SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains

Paper • 2507.07229 • Published Jul 9 • 11

upvoted a paper 24 days ago

World-in-World: World Models in a Closed-Loop World

Paper • 2510.18135 • Published 28 days ago • 88

upvoted a paper 25 days ago

MedScore: Generalizable Factuality Evaluation of Free-Form Medical Answers by Domain-adapted Claim Decomposition and Verification

Paper • 2505.18452 • Published May 24 • 4

upvoted a paper about 1 month ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published Oct 9 • 40

upvoted 2 papers about 2 months ago

IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

Paper • 2509.22621 • Published Sep 26 • 8

The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks

Paper • 2509.25671 • Published Sep 30 • 6

upvoted a paper 2 months ago

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Paper • 2509.06888 • Published Sep 8 • 12

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15 • 28

upvoted 3 papers 5 months ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Paper • 2506.22724 • Published Jun 28 • 10

Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

Paper • 2506.02327 • Published Jun 2 • 20

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Paper • 2506.11930 • Published Jun 13 • 53

upvoted 2 papers 6 months ago

BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases

Paper • 2505.20321 • Published May 23 • 5

Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find

Paper • 2505.18148 • Published May 23 • 5

upvoted 2 papers 7 months ago

Certified Mitigation of Worst-Case LLM Copyright Infringement

Paper • 2504.16046 • Published Apr 22 • 13

ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers

Paper • 2504.19395 • Published Apr 28 • 5

upvoted a paper 9 months ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 28

upvoted a paper 11 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 157

Daniel Khashabi

AI & ML interests

Recent Activity

Organizations

danyaljj's activity