Introduction

What is Graphsignal

Graphsignal is an inference observability platform that helps developers accelerate and troubleshoot AI systems. It provides essential visibility across the inference stack, including:

  • Continuous, high-resolution profiling timelines exposing operation durations and resource utilization across inference workloads.
  • LLM generation tracing with per-step timing, token throughput, and latency breakdowns for major inference frameworks.
  • System-level metrics for inference engines and hardware (CPU, GPU, accelerators).
  • Error monitoring for device-level failures, runtime exceptions, and inference errors.

How it works

Graphsignal SDK is added to application code. It automatically measures and records operations in single-run scripts as well as long-running server applications.

After recording, the performance data is sent to Graphsignal servers, post-processed and ready to be analyzed at app.graphsignal.com. This allows Graphsignal to run in any environment without the need to install any additional software.

Getting started

  • Sign up for an account.
  • See the Quick Start guide on how to add Graphsignal to your application.