Long Yi

IncidentFox - AI on-call engineer for enterprise

by
IncidentFox is an AI assistant that investigates production incidents for software teams. It automatically gathers context across logs, metrics, traces, Kubernetes, cloud tools, Slack, runbooks, and past incidents to produce root cause analysis and mitigations. Unlike existing tools, it analyzes your system on setup and auto-builds integrations, eliminating weeks of manual setup while boosting accuracy. Slack-first by design, it continuously learns from every incident.

Add a comment

Replies

Best
Long Yi
Maker
📌
Hey Product Hunt! I’m Long Yi, founder of IncidentFox. We built IncidentFox after too many incidents where the hardest part wasn’t fixing the issue, but figuring out what was actually going on. Most on-call tools are great at alerting and escalation, but they stop short of investigation. Engineers are left to manually dig through metrics, logs, dashboards, and runbooks under pressure. IncidentFox acts like an AI SRE. When an incident happens, it forms hypotheses, queries your infrastructure and observability tools, correlates signals, and reasons its way toward likely root causes, all inside Slack. It learns from every investigation so future incidents get easier. The core is fully open source and can be self-hosted. We’d love feedback from SREs, DevOps, and platform engineers, especially on where this helps and where it falls short. Happy to answer questions and go deep on the technical details.