Just as conditions can change rapidly in response to the unpredictable nature of flight, streaming analytics on infrastructure and application telemetry help ensure that the operations team is focused on the right things at any given moment. During the flight, like in a distributed cloud environment in production, alerting on the atmospheric and computer data that indicate performance relative to historical and expected patterns is the best way to know when a pilot (or infrastructure engineer) needs to take action. A modern infrastructure monitoring solution aggregates metrics from every element manager, including APM, to become the system of record for real-time operations.
After the plane has landed, and you once again have the luxury of time, log management helps engineers evaluate any systems errors in-depth to ensure that specific action can happen prior to the next scheduled flight to avoid similar events. Like the black box recorder, logs are incredibly useful for root cause analysis a er a problem has occurred. Once the outcome is known, log management provides many of the details on chains of events, dependencies, and the decisions that contributed along the way.
SignalFx is the best way to aggregate and alert on streaming metrics, helping today’s dev and ops teams fill the gap between APM’s pre-flight performance engineering and log management’s post-mortem event analysis. SignalFx’s real-time visibility into and analytics on the live production environment also help rationalize your existing investments with better overall results.