Distributed Inference Monitoring and Profiling

Integration

Graphsignal has a built-in support for distributed inference, e.g. multi-node and multi-GPU inference. When runs involve multiple workers, the dashboards seamlessly aggregate, structure and visualize data from all workers.

To identify each run or job, provide tags to start_trace method.

for sample in dataset:
    with graphsignal.start_trace(endpoint='predict', tags=dict(job_id='job1')):
        # inference code