Distributed Inference Monitoring and Profiling
Graphsignal has a built-in support for distributed inference, e.g. multi-node and multi-GPU inference. When runs involve multiple workers, the dashboards seamlessly aggregate, structure and visualize data from all workers.
To identify each run or job, provide
for sample in dataset: with graphsignal.start_trace(endpoint='predict', tags=dict(job_id='job1')): # inference code