NPU Management Interface#
Note
This feature is currently in the Early Access stage. Early Access features are features which are still undergoing some optimization and fine-tuning. These features are not in their final form and may change as we continue to work in order to mature them into full-fledged features.
The xrt-smi
utility is a command-line interface to monitor and manage the NPU integrated AMD CPUs.
It is installed in C:\Windows\System32\AMD
and it can be directly invoked from within the conda environment created by the Ryzen AI Software installer.
The xrt-smi
utility currently supports three primary commands:
examine: Examines the state of the AI PC and the NPU.
validate: Executes sanity tests on the NPU.
configure: Manages the performance level of the NPU.
You can use --help
with any command, such as xrt-smi examine --help
, to view all supported subcommands and their details.
Both examine and validate support an additional option –format which takes JSON (-f JSON
) and can be used to get JSON output (-o <example.json>
) for automation purposes.
Examining the AI PC and the NPU#
To provide OS/system information of the AI PC and confirm the presence of the AMD NPU:
xrt-smi examine
Sample Command Line Output:
To get output in JSON format:
xrt-smi examine -f JSON -o <specify path to create json file examine.json>
To provide more detailed information about the NPU, such as its performance mode and clocks:
xrt-smi examine --report platform
Sample Command Line Output:
To show details about the NPU partition and column occupancy on the NPU:
xrt-smi examine --report aie-partitions
Sample Command Line Output:
To show details about the columns to NPU HW context binding:
xrt-smi examine --report aie-partitions --verbose
Sample Command Line Output:
To show details about the ctrlcode opcode trace (on Strix NPU) or stream buffer tokens (on Phonix NPU):
xrt-smi examine -r telemetry
Sample Command Line Output on Strix NPU:
Sample Command Line Output on Phoenix NPU:
Note: To view aie-partition
and telemetry
reports, the model must be run concurrently on the NPU. For models running for a shorter timespan, you can run the model or xrt-smi commands in a loop to see the output of these commands.
Executing a Sanity Check on the NPU#
To validate AMD NPU, run a set of built-in sanity tests which includes verify, df-bw, tct and gemm:
xrt-smi validate --run <all>
Sample Command Line Output:
Managing the Performance Level of the NPU#
To set the performance level of the NPU. You can choose powersaver mode, balanced mode, performance mode, or use the default:
xrt-smi configure --pmode <powersaver | balanced | performance | default>
Sample Command Line Output: