"One of the biggest things is that the UI is so simple to use. We will empower engineers to write their own metrics, to monitor their own software better."
How do you use SignalFx?
- Monitoring our production Riak database that handles 250,000 ops/sec at peak
- System level monitoring for all prod systems with CollectD for things like CPU, disk, RAM, etc.
Soon we’ll be sending business metrics to correlate those KPIs with service and infrastructure performance; for example: total impressions at any moment, total clicks at any moment, total conversions, or if any of those went up or down in a statistically significant way tied to a particular code change or roll out.
We frequently use aggregations like 95th percentile and min/max’s, as well SignalFx‘s “timeshift” capability which allows us to stream those aggregations and show them alongside the same exact aggregations from a day, week, or more ago — side by side within seconds of the raw data streaming in. This is very valuable when you’re trying to diagnose a problem because it becomes very evident what has changed.
And we love looking at the number of raw ops/sec flowing through our service.