History

Sebastian Graf c0b9ff1148 test: measure VC discharging separately in Sym mvcgen benchmarks (#12551 ) This PR refactors the benchmark driver of the Sym mvcgen benchmarks such that time spent for discharging VCs and instantiation of MVars is measured separately from VC generation. Example output: ``` baseline_add_sub_cancel goal_100: 57 ms, 0 VCs, kernel: 22 ms goal_500: 353 ms, 0 VCs, kernel: 160 ms goal_1000: 755 ms, 0 VCs, kernel: 437 ms vcgen_add_sub_cancel goal_100: 36 ms, 1 VCs by grind: 21 ms, kernel: 35 ms goal_500: 149 ms, 1 VCs by grind: 115 ms, kernel: 214 ms goal_1000: 314 ms, 1 VCs by grind: 249 ms, kernel: 478 ms vcgen_deep_add_sub_cancel goal_100: 65 ms, 1 VCs by grind: 23 ms, kernel: 82 ms goal_500: 262 ms, 1 VCs by grind: 123 ms, kernel: 539 ms goal_1000: 611 ms, 1 VCs by grind: 292 ms, kernel: 1075 ms vcgen_get_throw_set goal_100: 87 ms, 101 VCs by sorry: 16 ms, kernel: 93 ms goal_500: 332 ms, 501 VCs by sorry: 289 ms, instantiate > 1000ms: 23363 ms, kernel: 770 ms goal_1000: 794 ms, 1001 VCs by sorry: 1332 ms, instantiate > 1000ms: 334614 ms, kernel: 1882 ms ```		2026-02-18 12:03:47 +00:00
..
cbv	test: add `cbv` benchmark for evaluating `Decidable.decide` (#12467 )	2026-02-13 13:25:35 +00:00
inundation	refactor: migrate to new ranges (#8841 )	2025-07-07 12:41:53 +00:00
mergeSort	chore: deprecate List.iota (#6708 )	2025-01-21 02:32:35 +00:00
mvcgen	test: measure VC discharging separately in Sym mvcgen benchmarks (#12551 )	2026-02-18 12:03:47 +00:00
qsort	feat: remove runtime bounds checks and partial from qsort (#6241 )	2024-12-01 06:26:00 +00:00
sym	feat: improves `simpArrowTelescope` simproc (#12153 )	2026-01-25 22:29:38 +00:00
.gitignore	chore: add .dSYM files (Mac debug symbols) to tests .gitignore files (#8771 )	2025-06-13 15:27:46 +00:00
accumulate_profile.py	chore: add `lakeprof` benchmarks (#9709 )	2025-08-06 11:25:45 +00:00
arith_eval.ml
big_beq.lean	perf: mkNoConfusionCtors: cheaper `inferType` (#10455 )	2025-09-19 10:51:17 +00:00
big_beq_rec.lean	chore: benchmark for deriving BEq on large inductive (#10028 )	2025-08-21 15:50:12 +00:00
big_deceq.lean	chore: benchmarks for deriving DecidableEq on large inductives (#10149 )	2025-08-27 12:05:04 +00:00
big_deceq_rec.lean	chore: benchmarks for deriving DecidableEq on large inductives (#10149 )	2025-08-27 12:05:04 +00:00
big_do.lean	test: add a benchmark that is slow to elaborate (#5656 )	2024-10-23 08:20:15 +00:00
big_match.lean	chore: large match statement benchmark (#9665 )	2025-08-01 15:25:07 +00:00
big_match_nat.lean	test: add big match on nat lit benchmarks (#11502 )	2025-12-04 08:21:56 +00:00
big_match_nat_split.lean	test: add big match on nat lit benchmarks (#11502 )	2025-12-04 08:21:56 +00:00
big_match_partial.lean	test: benchmark for large partial match (#11199 )	2025-11-16 11:20:31 +00:00
big_omega.lean	test: `big_omega` benchmark (#5817 )	2024-10-24 07:26:29 +00:00
big_struct.lean	perf: add introSubstEq shortcut (#12190 )	2026-01-28 12:33:14 +00:00
big_struct_dep.lean	test: add a big dependent struct test (#12061 )	2026-01-20 12:00:25 +00:00
big_struct_dep1.lean	test: add big_struct_dep1 benchmark (#12191 )	2026-01-27 14:36:09 +00:00
binarytrees.ghc-6.hs
binarytrees.lean	test: clean up binarytrees.lean	2023-01-19 14:44:20 +01:00
binarytrees.lean.args
binarytrees.lean.expected.out
binarytrees.ocaml-2.ml
binarytrees.st.hs	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.st.lean	refactor: migrate to new ranges (#8841 )	2025-07-07 12:41:53 +00:00
binarytrees.st.mlton-2.sml	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.st.sml	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.st.swift	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees.swift
binarytrees5.ml	test: add binarytrees.st benchmark	2023-01-19 14:44:20 +01:00
binarytrees5_multicore.ml	chore: more benchmarking setup	2023-01-17 13:28:05 +01:00
bv_decide_inequality.lean	fix: bv_decide benchmarks (#6017 )	2024-11-09 11:18:33 +00:00
bv_decide_large_aig.lean	perf: add a large AIG benchmark for bv_decide (#7721 )	2025-03-29 16:04:25 +00:00
bv_decide_mod.lean	fix: bv_decide benchmarks (#6017 )	2024-11-09 11:18:33 +00:00
bv_decide_mul.lean	feat: add bv_decide benchmarks (#5203 )	2024-08-29 12:45:58 +00:00
bv_decide_realworld.lean	chore: notation ^^ for Bool.xor (#5332 )	2024-09-18 08:59:11 +00:00
bv_decide_rewriter.lean	perf: bv_decide rewriting benchmark (#9231 )	2025-07-07 10:24:08 +00:00
channel.lean	refactor: migrate to new ranges (#8841 )	2025-07-07 12:41:53 +00:00
charactersIn.lean	chore: benchmark for charactersIn (#11643 )	2025-12-12 22:23:51 +00:00
compile.sh	feat: LLVM backend (#1837 )	2022-12-30 12:45:30 +01:00
const_fold.hs
const_fold.lean
const_fold.lean.args
const_fold.lean.expected.out
const_fold.ml
const_fold.sml
const_fold.swift
cross.yaml	chore: fix more typos in comments	2023-10-08 14:37:34 -07:00
dag_hassorry_issue.lean	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
dag_hassorry_issue.lean.args	chore: reduce stack space usage at `instantiate_mvars_fn` (#4931 )	2024-08-06 17:38:59 +00:00
dag_hassorry_issue.lean.expected.out	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
delayed_assign.lean	test: delayed assignment performance issue (#12201 )	2026-01-28 02:08:39 +00:00
deriv.hs
deriv.lean
deriv.lean.args
deriv.lean.expected.out
deriv.ml
deriv.sml
deriv.swift
ex-50-50-1.leq
flake.lock	chore: update cross-bench setup	2024-04-15 10:59:07 +02:00
flake.nix	chore: robustify Nix shell (#8141 )	2025-04-28 15:08:32 +00:00
full-stdlib.exec.yaml	feat: separate benchmark for profiling the stdlib per-file	2020-10-29 11:53:03 +01:00
ghc-gc.py
hashmap.lean	refactor: remove last appearances of `allowNontermination` (#12211 )	2026-01-29 07:22:19 +00:00
identifier_completion.lean	perf: do not export `opaque` bodies (#10119 )	2025-08-27 20:59:59 +00:00
identifier_completion_didOpen.log	perf: do not export `opaque` bodies (#10119 )	2025-08-27 20:59:59 +00:00
identifier_completion_initialization.log	test: identifier completion benchmark (#6796 )	2025-01-27 19:31:32 +00:00
identifier_completion_runner.lean	test: improve language server test coverage (#10574 )	2025-09-30 11:15:03 +00:00
ilean_roundtrip.lean	feat: reduce server memory consumption (#11162 )	2025-12-01 10:53:23 +00:00
iterators.lean	refactor: move `Iter` and others from `Std.Iterators` to `Std` (#11446 )	2025-12-15 08:24:12 +00:00
lean-gc.py
liasolver.lean	feat: verify `all` and `any` for hash maps (#10765 )	2025-11-15 16:59:37 +00:00
liasolver.lean.args
liasolver.lean.expected.out	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
Makefile	chore: update cross-bench setup	2024-04-15 10:59:07 +02:00
mlkit-gc.py
mut_rec_wf.lean	chore: add #9598 as benchmark (#9642 )	2025-07-31 15:32:54 +00:00
nat_repr.lean	refactor: migrate to new ranges (#8841 )	2025-07-07 12:41:53 +00:00
nat_repr.lean.args	chore: Nat.repr microbenchmark (#3888 )	2024-04-17 18:10:32 +00:00
nat_repr.lean.expected.out	chore: Nat.repr microbenchmark (#3888 )	2024-04-17 18:10:32 +00:00
ocaml-gc.py	chore: more benchmarking setup	2023-01-17 13:28:05 +01:00
omega_stress.lean	perf: optimize `sorry` detection in unused variables linter (#7129 )	2025-02-22 16:43:39 +00:00
parser.lean	refactor: migrate to new ranges (#8841 )	2025-07-07 12:41:53 +00:00
perf.py	chore: update benchmark suite	2022-05-25 18:26:36 +02:00
phashmap.lean	refactor: remove last appearances of `allowNontermination` (#12211 )	2026-01-29 07:22:19 +00:00
qsort.hs	chore: update benchmark suite	2022-05-25 18:26:36 +02:00
qsort.lean	chore: remove >6 month old deprecations (#10446 )	2025-09-22 12:47:11 +00:00
qsort.lean.args
qsort.lean.expected.out
qsort.ml	test: more fair qsort.ml benchmark	2022-10-12 20:22:55 +02:00
qsort.sml
qsort.swift	test: more fair qsort.ml benchmark	2022-10-12 20:22:55 +02:00
rbmap.hs	chore: make rbmap.hs more similar to other implementations	2022-09-24 14:16:48 +02:00
rbmap.lean	chore: modernize rbmap benchmarks a bit	2022-09-24 14:16:48 +02:00
rbmap.lean.args
rbmap.lean.expected.out
rbmap.ml
rbmap.sml
rbmap.swift
rbmap2.lean
rbmap3.lean
rbmap500k.lean
rbmap_checkpoint.hs	chore: make rbmap.hs more similar to other implementations	2022-09-24 14:16:48 +02:00
rbmap_checkpoint.lean	chore: modernize rbmap benchmarks a bit	2022-09-24 14:16:48 +02:00
rbmap_checkpoint.lean.args
rbmap_checkpoint.lean.expected.out
rbmap_checkpoint.ml
rbmap_checkpoint.sml
rbmap_checkpoint.swift
rbmap_checkpoint2.lean
rbmap_checkpoint2.sml
rbmap_checkpoint_cpp_lean3.cpp
rbmap_checkpoint_cpp_std.cpp
rbmap_cpp_lean3.cpp
rbmap_cpp_std.cpp
rbmap_fbip.lean	feat: add rbmap_fbip benchmark	2022-10-06 17:26:43 -07:00
rbmap_library.lean	chore: more RBMap cleanup	2022-10-06 17:26:43 -07:00
README.md	chore: update cross-bench setup	2024-04-15 10:59:07 +02:00
reduceMatch.lean	perf: sparse case splitting in match compilation (#10823 )	2025-11-06 13:46:35 +00:00
report.py	chore: safer bench script	2023-07-19 08:31:39 +02:00
riscv-ast.lean	chore: update bench/riskv-ast.lean (#10505 )	2025-09-24 11:46:26 +00:00
run.sh
server_startup.lean	feat: reduce server memory consumption (#11162 )	2025-12-01 10:53:23 +00:00
server_startup.log	test: add language server startup benchmark (#3558 )	2024-03-04 09:01:51 +00:00
sigmaIterator.lean	fix: sigmaIterator benchmark (#12364 )	2026-02-06 19:45:42 +00:00
simp_arith1.lean	chore: minimize benchmark imports so we don't spend a majority in importing (#9513 )	2025-07-24 12:14:12 +00:00
simp_bubblesort_256.lean	doc: correct typos in documentation and comments (#11465 )	2025-12-02 06:38:05 +00:00
simp_congr.lean	perf: add benchmark for congruence reasoning in simp (#9511 )	2025-07-24 10:47:37 +00:00
simp_local.lean	refactor: `module`-ize `Lean` (#9330 )	2025-07-25 12:02:51 +00:00
simp_subexpr.lean	perf: simp subexpr benchmark (#9404 )	2025-07-17 11:53:48 +00:00
speedcenter.exec.velcom.yaml	test: add `cbv` benchmark for evaluating `Decidable.decide` (#12467 )	2026-02-13 13:25:35 +00:00
speedcenter.yaml	chore: try refining some benchmark settings (#8377 )	2025-05-16 11:24:11 +00:00
states35.lean	chore: move `states35` to bench directory	2022-04-09 15:46:28 -07:00
test_single.sh	feat: LLVM backend (#1837 )	2022-12-30 12:45:30 +01:00
treemap.lean	refactor: remove last appearances of `allowNontermination` (#12211 )	2026-01-29 07:22:19 +00:00
unionfind.lean	feat: change `Array.get` to take a `Nat` and a proof (#6032 )	2024-11-12 03:30:46 +00:00
unionfind.lean.args
unionfind.lean.expected.out
unionfind_clean.lean
watchdogRss.lean	feat: reduce server memory consumption (#11162 )	2025-12-01 10:53:23 +00:00
workspaceSymbols.lean	chore: make workspaceSymbols benchmark independent of sorry search (#11642 )	2025-12-12 20:10:27 +00:00
workspaceSymbolsNewRanges.lean	chore: make workspaceSymbol benchmarks modules (#11094 )	2025-11-05 18:40:39 +00:00

README.md

Lean Benchmark Suites

This folder contains multiple small Lean programs for benchmarking used by two separate benchmark suites based on the temci benchmarking tool:

The light-weight "Speedcenter" suite benchmarks the current build of Lean. It can be used for quick comparisons on the cmdline and powers the Lean Speedcenter website.
The heavy-weight "Cross" suite benchmarks multiple Lean configurations and other functional compilers against each other and generates CSV and HTML reports from that. It was created for the paper "Counting Immutable Beans - Reference Counting Optimized for Purely Functional Programming" (IFL19).

Speedcenter Suite

Requirements:

A local Lean build in ../../build/release. Build at least the bin target.
temci. Using Nix, open a nix-shell in the project root directory to add a compatible version to your PATH. Alternatively, try pip3 install git+https://github.com/parttimenerd/temci.git.

To execute the suite and save the results in base.yaml, run (in this folder)

temci exec --config speedcenter.yaml --out base.yaml

Other interesting exec flags:

use --runs N to modify the default number of 10 runs per benchmark
use --included_blocks fast to excluded slow benchmarks like the stdlib benchmark. You can replace fast with any benchmark name or label in speedcenter.exec.yaml.

If you have multiple saved result files, you can compare them with

temci report --config speedcenter.yaml report1.yaml report2.yaml ...

Cross Suite

We recommend using Nix for building/obtaining all Lean variants and used compilers in a reproducible way. After installing Nix, running the benchmarks is as easy as

nix develop
make

This will record 50 runs for each benchmark configuration (this can be changed with runs in cross.yaml), generate results in report_lean.csv and report_cross.csv, and print them to stdout in a tabulated format. It will also generate HTML reports in report/ comparing the time-based benchmarks.

In order to reduce noise in the benchmarking data, you may instead want to try calling make inside a temci shell:

temci short shell --sudo --preset usable --cpuset_active make

Using root powers, this will temporarily configure your machine similarly to the LLVM benchmarking recommendations and move all your other processes to a single CPU core.