GitHub - BERT, Tokenizer, Python, WordPiece, pybind11,C++,Flash,Trie
byโข
๐ **FlashTokenizer: World's Fastest CPU Tokenizer!**
โก 8~15x faster than `BertTokenizerFast`
๐ ๏ธ High-performance C++
๐ Parallel with OpenMP
๐ฆ Easy pip install
๐ป Cross-platform (Win/Mac/Linux)
โถ๏ธ Demo: https://youtu.be/a_sTiAXeSE0

Replies