All activity
Aleksleft a comment
I gave OLLM a real try for coding with Zed, routing requests to DeepSeek-V3.1 on NEAR, and wanted to share honest feedback. First: this is not just a landing-page idea, it actually works. I ran fairly large coding contexts (20/25k input tokens per request), and inference was stable. Latency varied (a few seconds for short completions, ~20s+ for heavier ones), which is expected with confidential...

OLLM.COMThe Confidential AI Gateway
