Version 3.1.0 is a feature and maintenance release with the following additions/fixes.
New Features & Enhancements
- Gaudi device stats now included in the
telemetry
command report. Metrics
command event data can now be re-processed so that a previously unknown transaction rate (--txnrate) can be applied.- The
telemetry
command now accepts a duration value of zero (--duration 0) to run until interrupted by SIGINT (ctrl-c). - The
telemetry
command HTML report now includes time stamps on the x-axis of charts. - The
config
command now allows setting the compute and I/O die frequencies independently (SRF and GNR) - The branch misprediction metric was added to the
metrics
report. - The
report
command now includes the Speed Select Technology frequency table when it is enabled. - Added insight entry to
report
command to warn when ELC is configured in latency-optimized mode and EPB is non-zero. - The
report
andconfig
commands now determine which EPB configuration value (OS or BIOS) is active and report and/or change the appropriate entry. Report
command tables that are not relevant to a given CPU architecture are now not include in the output.
Fixes
- L3 per core reported by the
report
command was inaccurate on some CPU architectures - On multi-socket systems where a socket has been disabled via BIOS, the microarchitecture may be reported incorrectly.
What's Changed
- enable post-processing of pre-collected metric events by @harp-intel in #192
- enable indefinite duration for telemetry collection by @harp-intel in #203
- show timestamps in metrics summary and telemetry charts by @harp-intel in #205
- refactor html report generation to reduce duplication by @harp-intel in #206
- add branch mispredict ratio metric by @harp-intel in #207
- use remote target's perf for metrics collection if it is installed and new enough by @harp-intel in #208
- Highlight notes, tips, and warnings in README by @harp-intel in #209
- Bump github.com/spf13/cobra from 1.8.1 to 1.9.1 by @dependabot in #210
- add speed select turbo frequency tables by @harp-intel in #211
- fix report for l3 size per core when l3 instances are used by multipl… by @harp-intel in #213
- Get and set compute and I/O die max/min frequencies independently by @harp-intel in #216
- add example output images to README by @harp-intel in #218
- refactor scripts to use templating by @harp-intel in #222
- use alternate EPB value when configured to do so by @harp-intel in #223
- report tables associated with CPU models by @harp-intel in #225
- fix GNR_X* microarchitecture detection by @harp-intel in #227
- collect and report Gaudi device telemetry by @harp-intel in #217
Full Changelog: v3.3.1...v3.4.0