This technique is not a better lesson pilled at all. Waste of time when the model will just learn to do this anyways.
Kian Kyars PRO
kyars
·
AI & ML interests
None yet
Recent Activity
commented on
an
article
about 1 month ago
Efficient LLM Pretraining: Packed Sequences and Masked Attention
commented on
a paper
3 months ago
Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time
Markers
commented on
a paper
3 months ago
Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time
Markers
