Besides performance analysis for VTL0 VM and host OS, we'd like to capture trace
inside VTL2 as well. This allows us to analyze the Linux kernel and
openvmm_hcl worker process inside VTL2.
The release Linux image doesn't have user mode perf program by default. To capture perf data inside VTL2, it is necessary to build OpenHCL with the below instructions, which will include the necessary programs.
cargo xflowey build-igvm [RECIPE] --release --with-perf-toolsTo capture and save perf data file inside VTL2, we need to increase VTL2's memory page count by increasing the memory_page_count value in the IGVM configuration file (openhcl-x64-dev.json). E.g. increase it to 524288 like:
{
"image": {
"openhcl": {
....................
"memory_page_count": 524288
}
}
}While workloads run inside VTL0 VM, use ohcldiag-dev.exe to run perf to capture perf_events profiling data inside VTL2. E.g. the following command captures 15s of perf data and saves it in a file. You can find more info about perf on external page perf Examples
.\ohcldiag-dev.exe <VM Name> run -- perf record -F 999 -a --call-graph=dwarf -o ./openhcl.fio.perf -- sleep 15Use perf to convert perf data to plain text and dump it to a file on host
.\ohcldiag-dev.exe <VM Name> run -- perf script -i ./openhcl.fio.perf > .\traces\openhcl.fio.perf.scriptPlease follow up instructions on the external page Flame Graphs to create flame graph SVG file. It requires scripts from FlameGraph GitHub repo, so it is better to do it on WSL2.
Here is an example command on WSL2. It converts the perf script file to SVG file. Both of files are located under d:\tmp\vtl2 folder on Windows.
perf_file=/mnt/d/tmp/vtl2/openhcl.fio.perf.script; cat $perf_file | ./stackcollapse-perf.pl > $perf_file.folded; cat $perf_file.folded | ./flamegraph.pl > $perf_file.svgIf the perf script file doesn't have rust functions demangled correctly, please add rustfilt in the pipe to assist demangling.
perf_file=/mnt/d/tmp/vtl2/openhcl.fio.perf.script; cat $perf_file | rustfilt | ./stackcollapse-perf.pl > $perf_file.folded; cat $perf_file.folded | ./flamegraph.pl > $perf_file.svgOpenHCL supports a mem-profile-tracing feature (disabled by default) for collecting
heap memory profiling traces in a format based on DHAT.
Use the steps below to enable the feature, capture a trace, and view it.
First, follow the earlier steps in this guide to increase the VTL2 memory page count. Including debug info in OpenHCL requires additional VTL2 memory.
Then build OpenHCL with the mem-profile-tracing feature. Include debug info so the
trace contains useful backtraces:
cargo xflowey build-igvm x64 --override-openvmm-hcl-feature mem-profile-tracing --with-debuginfo --releaseohcldiag-dev provides a memory-profile-trace command that collects a trace for a
specific process by name or PID.
Examples:
Collect a trace for the VM process:
ohcldiag-dev.exe <vmname> memory-profile-trace -n vm > vm.jsonCollect a trace for the init process:
ohcldiag-dev.exe <vmname> memory-profile-trace -n underhill > underhill.jsonOpen the generated trace in the online DHAT viewer, or follow the local viewing instructions in the dhat docs here.