A multilingual tokenizer that leads every major open source tokenizer we tested on European language compression.