We shipped a new feature in oneinfer-edge (fully open source) to connect your locally deployed model to coding copilots like codex, OpenClaw, OpenCode and kilo code etc....
No plugin. No config file. No IDE restart. You click ONEINFER, a local proxy intercepts your copilot's requests, translates the format, routes to your self hosted model, and returns the response.
Your IDE doesn't know anything changed.
The proxy handles the ugly parts, model name rewriting, response format translation, streaming, so you don't have to spend an afternoon debugging why Codex expects an OpenAI messages format and your local model returns something else.
Switch back to original models in one click, mid-session, no restart. For when you actually need it.
This is just the start. Support for more agentic harnesses and copilots is already in the works, we're expanding the list based on what the community actually uses. So please voice out what you need in the github issues.
oneinfer-edge is the proxy, the hardware compatibility scanner, the inference routing, it's all in the repo. We'd rather you read the code than take our word for it.
Swytchcode
Really awesome. Questions
Do you allow private models?
Is there a cost difference between original model and yours?
Thanks@chilarai for checking our product out, it meant a lot for our team. Feel free to ask followup questions. Now, Answering your questions -
Yes, we support your private models. You can create an endpoint for your private model with available cloud provider and choose a fallback cloud provider, and its done.
No, There is no cost difference for the models we display and the original model costs.