SCAO — Optimizer
p/scao-optimizer
I built a 2nd-order optimizer for LLMs.
0 reviews2 followers
Start new thread
trending
Danilo Souza

5d ago

I was tired of OOM errors while fine-tuning, so I built my own optimizer

Hey Product Hunt!

If you ve ever tried to fine-tune an LLM locally, you know the "Cuda Out of Memory" heartbreak.

I wanted the convergence speed of 2nd-order optimizers (like Shampoo), but those methods usually destroy consumer GPUs because they require massive matrix inversions.

Danilo Souza

4d ago

SCAO — Optimizer - I built a 2nd-order optimizer for LLMs.

54% faster LLM training. SCAO is a sparse, second-order PyTorch optimizer designed as a high-throughput, drop-in replacement for AdamW. - whispering3/scao