Research

Publications

Posted on
11 Jun 202620:45
Research preview

Cadmus: A tokenizer that reads Europe in its own words

A multilingual tokenizer that leads every major open source tokenizer we tested on European language compression.