Introduction
What is Graphsignal
Graphsignal is an inference observability platform that helps developers accelerate and troubleshoot AI systems. It provides essential visibility across the inference stack, including:
- Continuous, high-resolution profiling timelines exposing operation durations and resource utilization across inference workloads.
- LLM generation tracing with per-step timing, token throughput, and latency breakdowns for major inference frameworks.
- System-level metrics for inference engines and hardware (CPU, GPU, accelerators).
- Error monitoring for device-level failures, runtime exceptions, and inference errors.
- AI debugging to explain performance data and errors, identify bottlenecks, and recommend optimizations across the inference stack.
How it works
Graphsignal SDK is added to application code. It automatically measures and records operations in single-run scripts as well as long-running server applications.
After recording, the performance data is sent to Graphsignal servers, post-processed and ready to be analyzed at app.graphsignal.com. This allows Graphsignal to run in any environment without the need to install any additional software.
Getting started
- Sign up for an account.
- See the Quick Start guide on how to add Graphsignal to your application.