GitHub
BERT, Tokenizer, Python, WordPiece, pybind11,C++,Flash,Trie
4 followers
BERT, Tokenizer, Python, WordPiece, pybind11,C++,Flash,Trie
4 followers
π **FlashTokenizer: World's Fastest CPU Tokenizer!** β‘ 8~15x faster than `BertTokenizerFast` π οΈ High-performance C++ π Parallel with OpenMP π¦ Easy pip install π» Cross-platform (Win/Mac/Linux) βΆοΈ Demo: https://youtu.be/a_sTiAXeSE0





