Byung-Gon Chun

PeriFlow - Supercharge generative AI serving

by
PeriFlow is an innovative serving engine for generative AI models including LLMs. PeriFlow achieves speed at low costs, giving 70~90% GPU savings. PeriFlow has two deployment options: PeriFlow Container and PeriFlow Cloud.

Add a comment

Replies

Be the first to comment