ASV Benchmarks Integration#209

Open

vchamarthi wants to merge 5 commits into

IntelPython:mainfrom

vchamarthi:asv-benchmarks

vchamarthi commented May 13, 2026

Adds an ASV benchmark suite to track mkl_umath performance over time.

Benchmarks

micro/ - Single-ufunc timing benchmarks across
dtype - {float32, float64} × size - {10K, 100K, 1M}.
Arrays are pre-allocated in setup() and reused across timing calls.

File	Ufuncs
`bench_trig.py`	`sin`, `cos`, `tan`, `arcsin`, `arccos`, `arctan`, `arctan2`, `sinh`, `cosh`, `tanh`
`bench_exp_log.py`	`exp`, `exp2`, `expm1`, `log`, `log2`, `log10`, `log1p`
`bench_sqrt_misc.py`	`sqrt`, `cbrt`, `square`, `fabs`, `absolute`, `reciprocal`

npbench/ - 14 application-level workloads adapted from the npbench benchmark suite
(kernels inlined, no external dependency). Each runs at preset - {M, L}. All use
setup_cache() so expensive array initialization runs once per commit, not once per
timing repeat.

Patch script

_patch_setup.py - Runs once per ASV worker process at package import. Applies
mkl_fft, mkl_random, and mkl_umath patches via their public APIs and hard-fails
with a descriptive RuntimeError if any patch does not take effect. Benchmarks can
never silently fall back to stock NumPy.

vchamarthi added 2 commits

May 4, 2026 08:08


          initial commit

b6b4489


          update benchmarks with M preset, configurations

428e440

vchamarthi requested review from antonwolfy, jharlow-intel, ndgrigorian and xaleryb as code owners

May 13, 2026 22:03


          Merge branch 'main' into asv-benchmarks

9277f14

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_cholesky2.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/micro/bench_exp_log.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/_patch_setup.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/_patch_setup.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_k3mm.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_k2mm.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_gesummv.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_gemver.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_gemm.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_doitgen.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_correlation.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_covariance.py Outdated

ndgrigorian reviewed

View reviewed changes

benchmarks/benchmarks/npbench/bench_deriche.py Outdated

vchamarthi added 2 commits

May 18, 2026 08:43


          pre-commit-fixes

b27dc03


          Fix PR comments

5da98d4

vchamarthi commented

View reviewed changes

benchmarks/benchmarks/micro/bench_micro.py

+                  params = (
+                      sorted(_UFUNC_CONFIGS.keys()),
+                      ["float32", "float64"],
+                      [10_000, 100_000, 1_000_000],

Author

vchamarthi May 19, 2026

@ndgrigorian Do you think these sizes are good enough?
on pvc machine Intel Xeon Platinum 8480+, 1M looks solid L3-resident (L3 cache size on this machine is 210 MiB (2 instances))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

ndgrigorian ndgrigorian left review comments

antonwolfy Awaiting requested review from antonwolfy antonwolfy is a code owner

xaleryb Awaiting requested review from xaleryb xaleryb is a code owner

jharlow-intel Awaiting requested review from jharlow-intel jharlow-intel is a code owner

At least 1 approving review is required to merge this pull request.

Labels

None yet