Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Retrieve the temperature trend per minute for each GPU.
Cluster Id
Start date for the query
End date for the query
OK
Target Metrics
Cluster Id
"gpu-cluster"
GPU Index
"0"
""
Node Name
"cocktail-gpu-node-1"
GPU temperature by timestamp
Timestamps per time interval
Retrieve the GR Engine usage trend per minute for each GPU.
Cluster Id
Start date for the query
End date for the query
OK
Target Metrics
Cluster Id
"gpu-cluster"
GPU Index
"0"
""
Node Name
"cocktail-gpu-node-1"
GPU GR Engine usage by timestamp
Timestamps per time interval
Retrieve the Tensor Core usage trend per minute for each GPU.
Cluster Id
Start date for the query
End date for the query
OK
Target Metrics
Cluster Id
"gpu-cluster"
GPU Index
"0"
""
Node Name
"cocktail-gpu-node-1"
GPU Tensor Core usage by timestamp
Timestamps per time interval
Retrieve the power consumption trend per minute for each GPU.
Cluster Id
Start date for the query
End date for the query
OK
Target Metrics
Cluster Id
"gpu-cluster"
GPU Index
"0"
""
Node Name
"cocktail-gpu-node-1"
GPU power consumption by timestamp
Timestamps per time interval
Retrieve the framebuffer memory usage trend per minute for each GPU.
Cluster Id
Start date for the query
End date for the query
OK
Target Metrics
Cluster Id
"gpu-cluster"
GPU Index
"0"
""
Node Name
"cocktail-gpu-node-1"
GPU Framebuffer memory usage by timestamp
Timestamps per time interval
Retrieve the Capacity and Used status when the GPU is used in Time-Slicing mode.
Cluster Id
OK
Target Metrics
Universally Unique Identifier
"GPU-79e36614..."
Cluster Id
"gpu-cluster"
GPU Index
"0"
Status
"allocate"
Resource Name
""
Node Name
"cocktail-gpu-node-1"
Timestamp
1727159760
The number of resources in the status
1
Retrieve the average activation ratio of the GR Engine for each GPU.
Cluster Id
OK
Target Metrics
Cluster Id
"gpu-cluster"
Node Name
"cocktail-gpu-node-1u"
Timestamp
1727158980
GPU GR Engine activation rate
0
Retrieve GPU instance information (GPU, MIG).
Cluster Id
OK
대상 메트릭
Universally Unique Identifier
"GPU-79e36614..."
Cluster Id
"gpu-cluster"
GPU Index
"0"
Status
"allocate"
GPU Model Name
"NVIDIA A30"
Node Name
"cocktail-gpu-node-1"
Timestamps per time interval
1727159760
The number of GPUs in the status
1
Retrieve the framebuffer memory usage rate for each GPU.
Cluster Id
OK
Target Metrics
Cluster Id
"gpu-cluster"
Node Name
"cocktail-gpu-node-1u"
Timestamp
1727158980
GPU Framebuffer memory usage rate
0