albertge/ni-unique-100-tasks-modernbert-split-kmeans-dim768-20250923 Viewer • Updated Sep 23 • 285k • 65
albertge/ni-unique-100-tasks-modernbert-split-kmeans-dim768-20250923 Viewer • Updated Sep 23 • 285k • 65
albertge/databricks-dolly-15k-modernbert-split-kmeans-dim768-20250917 Viewer • Updated Sep 17 • 15k • 33
albertge/databricks-dolly-15k-modernbert-split-kmeans-dim768-20250917 Viewer • Updated Sep 17 • 15k • 33
albertge/databricks-dolly-15k-tfidf-sweep-kmeans-dim10000-20250914 Viewer • Updated Sep 14 • 15k • 14
albertge/databricks-dolly-15k-tfidf-sweep-kmeans-dim10000-20250914 Viewer • Updated Sep 14 • 15k • 14
albertge/databricks-dolly-15k-tfidf-train-kmeans-dim10000-20250914 Viewer • Updated Sep 14 • 15k • 13
albertge/databricks-dolly-15k-tfidf-train-kmeans-dim10000-20250914 Viewer • Updated Sep 14 • 15k • 13
albertge/databricks-dolly-15k-modernbert-train-kmeans-dim768-20250723 Viewer • Updated Jul 23 • 15k • 28
albertge/databricks-dolly-15k-modernbert-train-kmeans-dim768-20250723 Viewer • Updated Jul 23 • 15k • 28
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training Paper • 2505.00358 • Published May 1 • 26