Key hardware specifications of the server executing this step.
Network discovery indicates the following cloud environment was utilized for this step.
Current and historical (including up-to the last five successful runs) averages, peaks and other summaries on resource usage.
Based on recent average CPU usage, historical peak memory and GPU utilization.
CPU usage for both the system and specific tasks is calculated by summing user+nice and system CPU times (in clock ticks), normalized by dividing by the total elapsed time and ticks per second. Task CPU usage encompasses all child processes.
On Linux, the used server memory is calculated by total - free - buffers - cached
, while it depends on psutil
for other systems. Task memory usage is measured by summing PSS (on Linux), USS (on MacOS and Windows), or RSS rollups of all subprocesses.
Task-specific disk usage tracking is unreliable; therefore, it is recommended to monitor disk usage at the system level, encompassing all mounted disks.
System-level disk space usage on all mounted disks.
Network usage is monitored solely at the system level across all interfaces.
nvidia-smi
reported ratios standardized between 0 and GPU count, proxying how many GPUs have been 100% utilized. Note that task-specific GPU usage is not as reliable as system-level GPU usage and limited up to 4 GPUs.
nvidia-smi
reported number of GPUs with a utilization greater than 0. Note that task-specific GPU usage is not as reliable as system-level GPU usage and limited up to 4 GPUs.
nvidia-smi
reported, summed up VRAM usage for all GPUs. Note that task-specific GPU usage is not as reliable as system-level GPU usage and limited up to 4 GPUs.