3.3. AMD GPU Operator のテスト
ROCmInfo のインストールをテストし、AMD MI210 GPU のログを表示するには、次の手順を使用します。
手順
ROCmInfo をテストする YAML ファイルを作成します。
Copy to Clipboard Copied! Toggle word wrap Toggle overflow cat << EOF > rocminfo.yaml apiVersion: v1 kind: Pod metadata: name: rocminfo spec: containers: - image: docker.io/rocm/pytorch:latest name: rocminfo command: ["/bin/sh","-c"] args: ["rocminfo"] resources: limits: amd.com/gpu: 1 requests: amd.com/gpu: 1 restartPolicy: Never EOF
$ cat << EOF > rocminfo.yaml apiVersion: v1 kind: Pod metadata: name: rocminfo spec: containers: - image: docker.io/rocm/pytorch:latest name: rocminfo command: ["/bin/sh","-c"] args: ["rocminfo"] resources: limits: amd.com/gpu: 1 requests: amd.com/gpu: 1 restartPolicy: Never EOF
rocminfo
Pod を作成します。Copy to Clipboard Copied! Toggle word wrap Toggle overflow oc create -f rocminfo.yaml
$ oc create -f rocminfo.yaml
出力例
Copy to Clipboard Copied! Toggle word wrap Toggle overflow apiVersion: v1 pod/rocminfo created
apiVersion: v1 pod/rocminfo created
1 つの MI210 GPU を含む
rocmnfo
ログを確認します。Copy to Clipboard Copied! Toggle word wrap Toggle overflow oc logs rocminfo | grep -A5 "Agent"
$ oc logs rocminfo | grep -A5 "Agent"
出力例
Copy to Clipboard Copied! Toggle word wrap Toggle overflow HSA Agents ========== ******* Agent 1 ******* Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Uuid: CPU-XX Marketing Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Vendor Name: CPU -- Agent 2 ******* Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Uuid: CPU-XX Marketing Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Vendor Name: CPU -- Agent 3 ******* Name: gfx90a Uuid: GPU-024b776f768a638b Marketing Name: AMD Instinct MI210 Vendor Name: AMD
HSA Agents ========== ******* Agent 1 ******* Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Uuid: CPU-XX Marketing Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Vendor Name: CPU -- Agent 2 ******* Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Uuid: CPU-XX Marketing Name: Intel(R) Xeon(R) Gold 6330 CPU @ 2.00GHz Vendor Name: CPU -- Agent 3 ******* Name: gfx90a Uuid: GPU-024b776f768a638b Marketing Name: AMD Instinct MI210 Vendor Name: AMD
Pod を削除します。
Copy to Clipboard Copied! Toggle word wrap Toggle overflow oc delete -f rocminfo.yaml
$ oc delete -f rocminfo.yaml
出力例
Copy to Clipboard Copied! Toggle word wrap Toggle overflow pod "rocminfo" deleted
pod "rocminfo" deleted