1 17 6

Abdulhakeem Adefioye

kokolamba

AI & ML interests

None yet

Recent Activity

upvoted a paper 31 minutes ago

Estimating Knowledge in Large Language Models Without Generating a Single Token

updated a dataset 12 days ago

kokolamba/keen_popqa_gpt2xl_generations

published a dataset 12 days ago

kokolamba/keen_popqa_gpt2xl_generations

View all activity

Organizations

upvoted a paper 31 minutes ago

Estimating Knowledge in Large Language Models Without Generating a Single Token

Paper • 2406.12673 • Published Jun 18, 2024 • 9

upvoted a collection about 1 month ago

LMEnt

Collection

14 items • Updated Sep 14 • 6

upvoted 2 papers 2 months ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

Paper • 2505.02819 • Published May 5 • 26

Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

Paper • 2508.04581 • Published Aug 6 • 5

upvoted an article 3 months ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

Mar 18, 2024

•

upvoted a paper 5 months ago

DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities

Paper • 2410.07722 • Published Oct 10, 2024 • 15

upvoted 2 articles 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

740

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Jul 1

•

132

upvoted an article 7 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

•

222

upvoted a paper 7 months ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 74

upvoted an article 7 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

•

177

upvoted 3 papers 7 months ago

Rank-K: Test-Time Reasoning for Listwise Reranking

Paper • 2505.14432 • Published May 20 • 4

Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

Paper • 2505.16967 • Published May 22 • 24

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Paper • 1902.04094 • Published Feb 11, 2019 • 1

upvoted an article 9 months ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

•

upvoted an article about 1 year ago

Article

Deriving DPO's Loss

Dec 24, 2024

•

upvoted a paper about 1 year ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

Abdulhakeem Adefioye

AI & ML interests

Recent Activity

Organizations

kokolamba's activity

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

SmolLM3: smol, multilingual, long-context reasoner

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Train 400x faster Static Embedding Models with Sentence Transformers

Training and Finetuning Reranker Models with Sentence Transformers v4

Unlocking Longer Generation with Key-Value Cache Quantization

Deriving DPO's Loss