Thomas Werkmeister

Thomas Werkmeister

Software & ML engineer
TokenfloodSlackGitHub
All activity
Tokenflood allows you to 1) figure out how to slash LLM latency by adjusting prompt parameters 2) assess the load curve of LLM providers before going to production with them
Tokenflood
TokenfloodFigure out who or what is stealing your LLM latency