History

Sebastian Graf 4cd7a85334 test: speed up Sym mvcgen by doing fewer redundant program matches (#12712 ) This PR changes the spec lookup procedure in Sym-based mvcgen so that 1. Spec candidates are sorted first before being filtered 2. Instead of filtering the whole set of candidates using `spec.pattern.match?`, we take the first match with the highest priority. The second point means we will do a lot fewer matches when the highest priority spec matches immediately. In this case, the one match is still partially redundant with the final application of the backward rule application. It would be great if could somehow specialize the backward rule after it has been created. Still, this yields some welcome speedups. Before and after for each. ``` vcgen_add_sub_cancel: goal_1000: 865 ms, 1 VCs by grind: 228 ms, kernel: 435 ms goal_1000: 540 ms, 1 VCs by grind: 229 ms, kernel: 426 ms vcgen_ping_pong: goal_1000: 458 ms, 0 VCs, kernel: 431 ms goal_1000: 454 ms, 0 VCs, kernel: 443 ms (unchanged, because there is only ever one candidate spec) vcgen_deep_add_sub_cancel: goal_1000: 986 ms, 1 VCs by grind: 234 ms, kernel: 735 ms goal_1000: 728 ms, 1 VCs by grind: 231 ms, kernel: 708 ms vcgen_reader_state: goal_1000: 746 ms, 1 VCs by sorry: 1 ms, kernel: 803 ms goal_1000: 525 ms, 1 VCs by sorry: 1 ms, kernel: 840 ms ```		2026-02-27 03:24:34 +00:00
..
build	chore: switch to new test/bench suite (#12590 )	2026-02-25 13:51:53 +00:00
mergeSort	chore: relative lean-toolchains (#12652 )	2026-02-25 10:23:35 +00:00
mvcgen	test: speed up Sym mvcgen by doing fewer redundant program matches (#12712 )	2026-02-27 03:24:34 +00:00
qsort	chore: relative lean-toolchains (#12652 )	2026-02-25 10:23:35 +00:00
size	chore: switch to new test/bench suite (#12590 )	2026-02-25 13:51:53 +00:00
sym	feat: improves `simpArrowTelescope` simproc (#12153 )	2026-01-25 22:29:38 +00:00
.gitignore	chore: switch to new test/bench suite (#12590 )	2026-02-25 13:51:53 +00:00
accumulate_profile.py	chore: add `lakeprof` benchmarks (#9709 )	2025-08-06 11:25:45 +00:00
arith_eval.ml
binarytrees.ghc-6.hs	doc: fix typos	2021-03-07 15:06:02 +01:00
binarytrees.ocaml-2.ml
binarytrees.st.hs	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.st.mlton-2.sml	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.st.sml	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.st.swift	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.swift	feat(tests/bench): add safe binarytrees.swift from https://benchmarksgame-team.pages.debian.net/benchmarksgame/program/binarytrees-swift-1.html	2019-05-30 19:33:38 +02:00
binarytrees5.ml	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees5_multicore.ml	chore: more benchmarking setup	2023-01-17 13:28:05 +01:00
compile.sh	feat: LLVM backend (#1837 )	2022-12-30 12:45:30 +01:00
const_fold.hs	chore(tests/bench): rename benchmarks	2019-05-30 16:25:41 +02:00
const_fold.ml	chore(tests/bench): rename benchmarks	2019-05-30 16:25:41 +02:00
const_fold.sml	chore(tests/bench): rename benchmarks	2019-05-30 16:25:41 +02:00
const_fold.swift	chore(tests/bench): rename benchmarks	2019-05-30 16:25:41 +02:00
cross.yaml	chore: fix more typos in comments	2023-10-08 14:37:34 -07:00
dag_hassorry_issue.lean	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
dag_hassorry_issue.lean.args	chore: reduce stack space usage at `instantiate_mvars_fn` (#4931 )	2024-08-06 17:38:59 +00:00
dag_hassorry_issue.lean.expected.out	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
delayed_assign.lean	test: delayed assignment performance issue (#12201 )	2026-01-28 02:08:39 +00:00
deriv.hs
deriv.ml
deriv.sml
deriv.swift	test(tests/bench): add `deriv.swift`	2019-05-30 11:34:58 -07:00
flake.lock	chore: update cross-bench setup	2024-04-15 10:59:07 +02:00
flake.nix	chore: robustify Nix shell (#8141 )	2025-04-28 15:08:32 +00:00
full-stdlib.exec.yaml	feat: separate benchmark for profiling the stdlib per-file	2020-10-29 11:53:03 +01:00
ghc-gc.py
lean-gc.py
Makefile	chore: update cross-bench setup	2024-04-15 10:59:07 +02:00
mlkit-gc.py
ocaml-gc.py	chore: more benchmarking setup	2023-01-17 13:28:05 +01:00
perf.py	chore: update benchmark suite	2022-05-25 18:26:36 +02:00
qsort.hs	chore: update benchmark suite	2022-05-25 18:26:36 +02:00
qsort.ml	test: more fair qsort.ml benchmark	2022-10-12 20:22:55 +02:00
qsort.sml
qsort.swift	test: more fair qsort.ml benchmark	2022-10-12 20:22:55 +02:00
rbmap.hs	chore: make rbmap.hs more similar to other implementations	2022-09-24 14:16:48 +02:00
rbmap.ml
rbmap.sml
rbmap.swift	tests(tests/bench): add `rbmap.swift`	2019-05-30 14:47:06 -07:00
rbmap2.lean	chore: remove command `universes`	2021-06-29 17:01:07 -07:00
rbmap3.lean	chore: remove command `universes`	2021-06-29 17:01:07 -07:00
rbmap500k.lean	chore: remove command `universes`	2021-06-29 17:01:07 -07:00
rbmap_checkpoint.hs	chore: make rbmap.hs more similar to other implementations	2022-09-24 14:16:48 +02:00
rbmap_checkpoint.ml	test(tests/bench/rbmap_checkpoint): OCaml version using myLen	2019-05-30 07:40:53 -07:00
rbmap_checkpoint.sml	chore(tests/bench/rbmap_checkpoint): use `myLean`	2019-05-30 07:30:07 -07:00
rbmap_checkpoint.swift	test(tests/bench/rbmap_checkpoint): add swift version	2019-05-30 14:35:58 -07:00
rbmap_checkpoint2.lean	chore: remove command `universes`	2021-06-29 17:01:07 -07:00
rbmap_checkpoint2.sml
rbmap_checkpoint_cpp_lean3.cpp	test(tests/bench): add C++ versions of rbmap benchmarks	2019-06-22 06:58:27 -07:00
rbmap_checkpoint_cpp_std.cpp	test(tests/bench): add C++ versions of rbmap benchmarks	2019-06-22 06:58:27 -07:00
rbmap_cpp_lean3.cpp	test(tests/bench): add C++ versions of rbmap benchmarks	2019-06-22 06:58:27 -07:00
rbmap_cpp_std.cpp	test(tests/bench): add C++ versions of rbmap benchmarks	2019-06-22 06:58:27 -07:00
README.md	chore: update cross-bench setup	2024-04-15 10:59:07 +02:00
report.py	chore: safer bench script	2023-07-19 08:31:39 +02:00
run.sh
speedcenter.yaml	chore: try refining some benchmark settings (#8377 )	2025-05-16 11:24:11 +00:00
states35.lean	chore: move `states35` to bench directory	2022-04-09 15:46:28 -07:00
test_single.sh	feat: LLVM backend (#1837 )	2022-12-30 12:45:30 +01:00
unionfind_clean.lean	chore(frontends/lean): use `=>` instead of `:=` in match-expressions	2019-07-04 11:38:38 -07:00

README.md

Lean Benchmark Suites

This folder contains multiple small Lean programs for benchmarking used by two separate benchmark suites based on the temci benchmarking tool:

The light-weight "Speedcenter" suite benchmarks the current build of Lean. It can be used for quick comparisons on the cmdline and powers the Lean Speedcenter website.
The heavy-weight "Cross" suite benchmarks multiple Lean configurations and other functional compilers against each other and generates CSV and HTML reports from that. It was created for the paper "Counting Immutable Beans - Reference Counting Optimized for Purely Functional Programming" (IFL19).

Speedcenter Suite

Requirements:

A local Lean build in ../../build/release. Build at least the bin target.
temci. Using Nix, open a nix-shell in the project root directory to add a compatible version to your PATH. Alternatively, try pip3 install git+https://github.com/parttimenerd/temci.git.

To execute the suite and save the results in base.yaml, run (in this folder)

temci exec --config speedcenter.yaml --out base.yaml

Other interesting exec flags:

use --runs N to modify the default number of 10 runs per benchmark
use --included_blocks fast to excluded slow benchmarks like the stdlib benchmark. You can replace fast with any benchmark name or label in speedcenter.exec.yaml.

If you have multiple saved result files, you can compare them with

temci report --config speedcenter.yaml report1.yaml report2.yaml ...

Cross Suite

We recommend using Nix for building/obtaining all Lean variants and used compilers in a reproducible way. After installing Nix, running the benchmarks is as easy as

nix develop
make

This will record 50 runs for each benchmark configuration (this can be changed with runs in cross.yaml), generate results in report_lean.csv and report_cross.csv, and print them to stdout in a tabulated format. It will also generate HTML reports in report/ comparing the time-based benchmarks.

In order to reduce noise in the benchmarking data, you may instead want to try calling make inside a temci shell:

temci short shell --sudo --preset usable --cpuset_active make

Using root powers, this will temporarily configure your machine similarly to the LLVM benchmarking recommendations and move all your other processes to a single CPU core.