2 8 4

Raphaël Merx

raphaelmerx

AI & ML interests

None yet

Recent Activity

updated a dataset 9 days ago

raphaelmerx/openwho

upvoted an article 28 days ago

Supercharge your OCR Pipelines with Open Models

commented on an article about 2 months ago

There is no such thing as a tokenizer-free lunch

View all activity

Organizations

updated a dataset 9 days ago

raphaelmerx/openwho

Viewer • Updated 9 days ago • 30.8k • 10 • 1

upvoted an article 28 days ago

Article

Supercharge your OCR Pipelines with Open Models

29 days ago

•

256

commented on There is no such thing as a tokenizer-free lunch about 2 months ago

Really cool post! In particular this was eye-opening to me:

However, I would consider both Unicode and UTF-8 to be tokenizers.

upvoted an article about 2 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25

•

New activity in google/gemma-3-1b-it about 2 months ago

Remove processor class from tokenizer_config.json

#7 opened 8 months ago by

Xenova

Add chat_template.json

#30 opened about 2 months ago by

raphaelmerx

upvoted 2 articles 2 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

•

397

Article

An Analysis of Multilingual Models on Hugging Face

Sep 18

•

updated 2 datasets 5 months ago

raphaelmerx/openwho

Viewer • Updated 9 days ago • 30.8k • 10 • 1

raphaelmerx/openwho

Viewer • Updated 9 days ago • 30.8k • 10 • 1

Raphaël Merx

AI & ML interests

Recent Activity

Organizations

raphaelmerx's activity

Supercharge your OCR Pipelines with Open Models

There is no such thing as a tokenizer-free lunch

Remove processor class from tokenizer_config.json

Add chat_template.json

You could have designed state of the art positional encoding

An Analysis of Multilingual Models on Hugging Face