File size: 1,081 Bytes
b48396b
 
 
3e0ee9b
b48396b
 
 
3e0ee9b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
---
datasets:
- bbunzeck/babylm-german
- bbunzeck/german-babylm-5m-subsets
language:
- de
pipeline_tag: text-generation
---

This is a German BabyLM model trained on a [5M token subset](https://huggingface.co/datasets/bbunzeck/german-babylm-5m-subsets) of the [German BabyLM corpus](https://huggingface.co/datasets/bbunzeck/babylm-german).

If you use this model, please cite the following publication:
```
@inproceedings{bunzeck-etal-2025-construction,
    title = "Do Construction Distributions Shape Formal Language Learning In {G}erman {B}aby{LM}s?",
    author = "Bunzeck, Bastian  and
      Duran, Daniel  and
      Zarrie{\ss}, Sina",
    editor = "Boleda, Gemma  and
      Roth, Michael",
    booktitle = "Proceedings of the 29th Conference on Computational Natural Language Learning",
    month = jul,
    year = "2025",
    address = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.conll-1.12/",
    doi = "10.18653/v1/2025.conll-1.12",
    pages = "169--186",
    ISBN = "979-8-89176-271-8",
}
```