AI & ML interests
None defined yet.
Recent Activity
View all activity
models
70
science-of-finetuning/SAE-chat-gemma-2-2b-L13-k100-x32-lr1e-04-local-shuffling-ft-chat
Updated
•
5
science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-mu5.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Updated
•
11
science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-k256-lr1e-04-local-shuffling-Crosscoder
Updated
•
7
science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-k55-lr1e-04-local-shuffling-Crosscoder
Updated
•
4
science-of-finetuning/gemma-2-2b-gemma-2-2b-it-L13-mu2.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Updated
•
6
science-of-finetuning/SAE-base-Llama-3.2-1B-L8-k100-x32-lr1e-04-local-shuffling
Updated
•
3
science-of-finetuning/R1dist-Qwen-1.5B-Nemotron-L16-k100-lr1e-04-local-shuffling-CCLoss
Updated
science-of-finetuning/R1dist-Qwen-1.5B-Nemotron-L16-mu3.6e-02-lr1e-04-local-shuffling-CCLoss
Updated
science-of-finetuning/gemma-2-2b-it-Meditron3-L16-k100-lr1e-04-local-shuffling-CCLoss
Updated
•
1
science-of-finetuning/gemma-2-2b-it-Meditron3-L16-mu3.8e-02-lr1e-04-local-shuffling-CCLoss
Updated
•
1
datasets
98
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-mu5.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Viewer
•
Updated
•
73.7k
•
10
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-k55-lr1e-04-local-shuffling-Crosscoder
Viewer
•
Updated
•
73.7k
•
8
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-k256-lr1e-04-local-shuffling-Crosscoder
Viewer
•
Updated
•
73.7k
•
11
science-of-finetuning/diffing-stats-gemma-2-2b-gemma-2-2b-it-L13-mu2.5e-02-lr1e-04-local-shuffling-CrosscoderLoss
Viewer
•
Updated
•
73.7k
•
5
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-k200-lr1e-04-local-shuffling-Crosscoder-ni0.3-ka1k5k
Viewer
•
Updated
•
131k
•
6
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-mu2.0e-02-lr1e-04-local-shuffling-CCLoss
Viewer
•
Updated
•
131k
•
8
science-of-finetuning/diffing-stats-Meta-Llama-3.1-8B-L16-k222-lr1e-04-local-shuffling-Crosscoder
Viewer
•
Updated
•
131k
•
7
science-of-finetuning/diffing-stats-Llama-3.2-1B-L8-mu3.6e-02-lr1e-04-local-shuffling-CrosscoderLoss
Viewer
•
Updated
•
65.5k
•
5
science-of-finetuning/ultrachat_200k_generated_llama3.1-8b-Instruct-mini
Viewer
•
Updated
•
3.97k
•
4
science-of-finetuning/diffing-stats-gemma-2-2b-it-Meditron3-L16-mu3.8e-02-lr1e-04-local-shuffling-CCLoss
Viewer
•
Updated
•
73.7k
•
4