Djellal Mohamed Aniss

dmaniss

djellalmohamedaniss

AI & ML interests

SLM, distillation, synthetic data and reasoning.

Recent Activity

liked a Space 5 days ago

HuggingFaceTB/smol-training-playbook

liked a dataset 8 days ago

LangAGI-Lab/magpie-reasoning-v1-100k-math-verifiable

upvoted a paper 9 days ago

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published Sep 29 • 9

upvoted a paper 21 days ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published Aug 13 • 14

upvoted 3 papers about 1 month ago

upvoted a paper 3 months ago

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 20

upvoted an article 3 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

•

174

upvoted 4 papers 5 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23 • 56

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 49

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81

upvoted an article 7 months ago

Article

An Introduction to AI Model Optimization Techniques

Apr 18

•

upvoted a collection 8 months ago

Orpheus TTS

Collection

TTS Towards Human-Sounding Speech • 2 items • Updated Mar 18 • 71

upvoted a paper 12 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 66

upvoted a paper about 1 year ago

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 42

upvoted a collection about 1 year ago

Synthetic (text) Dataset Generation

Collection

Papers about synthetic dataset generation • 9 items • Updated Jun 21, 2024 • 9

upvoted a paper about 1 year ago

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Paper • 2410.02089 • Published Oct 2, 2024 • 13

upvoted an article about 1 year ago

Article

Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning

Feb 20, 2024

•

upvoted a collection about 1 year ago

MIT Talk 31/10 Papers

Collection

14 items • Updated Oct 28, 2024 • 32

upvoted a paper about 1 year ago

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9, 2024 • 46

Djellal Mohamed Aniss

AI & ML interests

Recent Activity

Organizations

dmaniss's activity

Training and Finetuning Reranker Models with Sentence Transformers v4

An Introduction to AI Model Optimization Techniques

Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning