Why "top" missed a cron job that was killing our API latency(parth21shah.substack.com) |
Why "top" missed a cron job that was killing our API latency(parth21shah.substack.com) |
Anything that appears + disappears between polls is effectively invisible unless you’re streaming syscalls/process events. It’s surprising how often “short-lived, high-impact” processes cause the worst production spikes.
Curious whether you’re planning to surface this at the scheduler level (run queue latency/involuntary context switches) or stick to process-lifecycle tracing?