lean4-htt

Author	SHA1	Message	Date
Garmelon	debafca7e1	chore: add radar-based bench suite for stdlib (#11264 ) This PR adds a new [radar]-based [temci]-less bench suite that replaces the `stdlib` benchmarks from the old suite and also measures per-module instruction counts. All other benchmarks from the old suite are unaffected. The readme at `tests/bench-radar/README.md` explains in more detail how the bench suite is structured and how it works. The readmes in the benchmark subdirectories explain what each benchmark does and which metrics it collects. All metrics except `stdlib//max dynamic symbols` were ported to the new suite, though most have been renamed. [radar]: https://github.com/leanprover/radar [temci]: https://github.com/parttimenerd/temci	2025-11-25 12:59:30 +00:00
Paul Reichert	6da35eeccb	refactor: increase runtime of "sigma iterator" benchmark (#11336 ) This PR makes the "sigma iterator" benchmark more compute-intensive because it was too fast and therefore flaky.	2025-11-24 12:21:27 +00:00
Sebastian Ullrich	bfbad53540	fix: avoid storing reference to environment in realization result to prevent promise cycle (#11328 ) This PR fixes freeing memory accidentally retained for each document version in the language server on certain elaboration workloads. The issue must have existed since 4.18.0.	2025-11-24 10:16:56 +00:00
Paul Reichert	2980155f5c	refactor: simplify `ToIterator` (#11242 ) This PR significantly changes the signature of the `ToIterator` type class. The obtained iterators' state is no longer dependently typed and is an `outParam` instead of being bundled inside the class. Among other benefits, `simp` can now rewrite inside of `Slice.toList` and `Slice.toArray`. The downside is that we lose flexibility. For example, the former combinator-based implementation of `Subarray`'s iterators is no longer feasible because the states are dependently typed. Therefore, this PR provides a hand-written iterator for `Subarray`, which does not require a dependently typed state and is faster than the previous one. Converting a family of dependently typed iterators into a simply typed one using a `Sigma`-state iterator generates forbiddingly bad code, so that we do provide such a combinator. This PR adds a benchmark for this problem.	2025-11-22 12:37:18 +00:00
Marc Huisinga	2f1e258a5e	test: re-enable re-elab benchmarks and add watchdog re-elab benchmark (#11284 )	2025-11-20 22:53:08 +00:00
Joachim Breitner	8ef742647e	test: benchmark for large partial match (#11199 ) Creates an inductive data type with 100 constructors, and a function that does matches on half of its constructors, with a catch-all for the other half, and generates the splitter. Related to #11183.	2025-11-16 11:20:31 +00:00
Johannes Tantow	100006fdd0	feat: verify `all` and `any` for hash maps (#10765 ) This PR extends the `all`/`any` functions from hash sets to hash maps and dependent hash maps and verifies them.	2025-11-15 16:59:37 +00:00
Joachim Breitner	d41f39fb10	perf: sparse case splitting in match compilation (#10823 ) This PR lets the match compilation procedure use sparse case analysis when the patterns only match on some but not all constructors of an inductive type. This way, less code is produce. Before, code handling each of the other cases was then optimized and commoned-up by later compilation pipeline, but that is wasteful to do. In some cases this will prevent Lean from noticing that a match statement is complete because it performs less case-splitting for the unreachable case. In this case, give explicit patterns to perform the deeper split with `by contradiction` as the right-hand side. At least temporarily, there is also the option to disable this behaviour with ``` set_option backwards.match.sparseCases false ```	2025-11-06 13:46:35 +00:00
Joachim Breitner	d8a67095d6	chore: make workspaceSymbol benchmarks modules (#11094 ) This PR makes workspaceSymbol benchmarks `module`s, so that they are less sensitive to additions of private symbols in the standard library.	2025-11-05 18:40:39 +00:00
Kim Morrison	4887eeb77c	chore: remove >6 month old deprecations (#10968 )	2025-10-26 10:01:30 +00:00
Henrik Böving	881a131ad3	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
Rob23oba	fad0e69cc7	fix: make name mangling unambiguous (#10727 ) This PR fixes name mangling to be unambiguous / injective by adding `00` for disambiguation where necessary. Additionally, the inverse function, `Lean.Name.unmangle` has been added which can be used to unmangle a mangled identifier. This unmangler has been added to demonstrate the injectivity but also to allow unmangling identifiers e.g. for debugging purposes. Closes #10724	2025-10-23 07:18:07 +00:00
Markus Himmel	b28daa6d60	chore: rename `String.endPos` -> `String.rawEndPos` (#10853 ) This PR renames `String.endPos` to `String.rawEndPos`, as in a future release the name `String.endPos` will be taken by the function that is currently called `String.endValidPos`.	2025-10-21 11:25:30 +00:00
Paul Reichert	f58999a7a6	refactor: use `Shrink` stub in the iterator framework (#10725 ) This PR introduces a no-op version of `Shrink`, a type that should allow shrinking small types into smaller universes given a proof that the type is small enough, and uses it in the iterator library. Because this type would require special compiler support, the current version is just a wrapper around the inner type so that the wrapper is equivalent, but not definitionally equivalent. While `Shrink` is unable to shrink universes right now, but introducing it now will allow us to generalize the universes in the iterator library with fewer breaking changes as soon as an actual `Shrink` is possible.	2025-10-14 10:22:14 +00:00
Markus Himmel	5c707d936c	chore: rename `Stream` to `Std.Stream` (#10645 ) This PR renames `Stream` to `Std.Stream` so that the name becomes available to mathlib after a deprecation cycle.	2025-10-02 15:25:56 +00:00
Paul Reichert	89686fcd02	refactor: replace `PRange shape α` with `Rcc α` and eight other types (#10319 ) This PR "monomorphizes" the structure `Std.PRange shape α`, replacing it with nine distinct structures `Std.Rcc`, `Std.Rco`, `Std.Rci` etc., one for each possible shape of a range's bounds. This change was necessary because the shape polymorphism is detrimental to attempts of automation. BREAKING CHANGE: While range/slice notation itself is unchanged, this essentially breaks the entire remaining (polymorphic) range and slice API except for the dot-notation(`toList`, `iter`, ...). It is not possible to deprecate old declarations that were formulated in a shape-polymorphic way that is not available anymore.	2025-10-02 06:45:11 +00:00
Marc Huisinga	dfd3d18530	test: improve language server test coverage (#10574 ) This PR significantly improves the test coverage of the language server, providing at least a single basic test for every request that is used by the client. It also implements infrastructure for testing all of these requests, e.g. the ability to run interactive tests in a project context and refactors the interactive test runner to be more maintainable. Finally, it also fixes a small bug with the recently implemented unknown identifier code actions for auto-implicits (#10442) that was discovered in testing, where the "import all unambiguous unknown identifiers" code action didn't work correctly on auto-implicit identifiers.	2025-09-30 11:15:03 +00:00
Joachim Breitner	1374445081	chore: update bench/riskv-ast.lean (#10505 ) This PR disables `trace.profiler` in `bench/riskv-ast.lean`. We don't want to optimize the trace profiler, but normal code. While at it, I removed the `#exit` to cover more of the file. While at it, also import the latest from from upstream.	2025-09-24 11:46:26 +00:00
Garmelon	8b64425033	chore: set temci tags for the radar bench script (#10527 ) The radar bench scripts at https://github.com/leanprover/radar-bench-lean4/ split up the benchmarks between the two runners based on the tags: One runner filters by the tag `stdlib` while the other filters by the tag `other`. Only benchmarks using one of these tags will be run, and any benchmark tagged with both will waste electricity. As far as I know, the tags are unused otherwise, so I just replaced all the old tags.	2025-09-23 19:51:10 +00:00
Kim Morrison	2b23afdfab	chore: remove >6 month old deprecations (#10446 )	2025-09-22 12:47:11 +00:00
Joachim Breitner	0e122870be	perf: mkNoConfusionCtors: cheaper `inferType` (#10455 ) This PR changes `mkNoConfusionCtors` so that its use of `inferType` does not have to reduce `noConfusionType`, to make #10315 really effective.	2025-09-19 10:51:17 +00:00
Paul Reichert	fef390df08	perf: improve iterator/range benchmarks, use shortcut instances for `Int` ranges (#10197 ) This PR is the result of analyzing the elaborator performance regression introduced by #10005. It makes the `workspaceSymboldNewRanges` and `iterators` benchmarks less noisy. It also replaces some range-related instances for `Nat` with shortcuts to the general-purpose instances. This is a trade-off between the ergonomics and the synthesis cost of having general-purpose instances.	2025-09-03 15:47:52 +00:00
Sebastian Ullrich	d63d1188cc	chore: fix stdlib size benchmarks	2025-08-28 12:07:27 +02:00
Sebastian Ullrich	9757a7be53	perf: do not export `opaque` bodies (#10119 ) In particular, do not export `partial` bodies	2025-08-27 20:59:59 +00:00
Joachim Breitner	72e8970848	chore: benchmarks for deriving DecidableEq on large inductives (#10149 ) This PR adds benchmarks for deriving `DecidableEq` on inductives with many constructors. (Although at the moment, many is “many” as we timeout for more than 30 or 40 constructors.)	2025-08-27 12:05:04 +00:00
Cameron Zwarich	f7a251b75f	chore: set `experimental.module=true` when running `grind` benchmarks (#10041 )	2025-08-22 03:15:36 +00:00
Joachim Breitner	e9f6033467	chore: benchmark for deriving BEq on large inductive (#10028 )	2025-08-21 15:50:12 +00:00
Henrik Böving	2d4bcf202f	chore: even more independent benchmarks (#9970 )	2025-08-18 18:36:33 +00:00
Henrik Böving	e4be2b2cad	chore: make perf tests more independent of external factors (#9960 )	2025-08-18 08:45:23 +00:00
Sebastian Ullrich	506d16a603	chore: complete `riscv_ast` benchmark (#9928 )	2025-08-15 14:39:25 +00:00
Henrik Böving	44d3cfb3dc	chore: stabilize benchmark output (#9820 )	2025-08-10 10:53:38 +00:00
Sebastian Ullrich	09600f2ca4	chore: add `lakeprof` benchmarks (#9709 )	2025-08-06 11:25:45 +00:00
Joachim Breitner	417031fc17	chore: large match statement benchmark (#9665 ) This PR adds a benchmark with a large, two-level, not-overlapping match statement, including the splitter generation.	2025-08-01 15:25:07 +00:00
Henrik Böving	6eaf406305	chore: bump stack limit in benchmark (#9660 )	2025-08-01 09:33:39 +00:00
Joachim Breitner	c8ef2fae1a	chore: add #9598 as benchmark (#9642 ) This PR adds the example from #9598 as a benchmark.	2025-07-31 15:32:54 +00:00
Sebastian Ullrich	81fe5243d3	chore: add grind tests as benchmarks (#9537 )	2025-07-25 14:21:38 +00:00
Sebastian Ullrich	ff1d3138bf	refactor: `module`-ize `Lean` (#9330 )	2025-07-25 12:02:51 +00:00
Henrik Böving	75b5c8b0aa	perf: phashmap benchmark (#9517 ) This PR adds a benchmark for the persistent hashmap, in particular also covering the non linear insert case which is often hit in practical uses. Furthermore the same test case is also added to the treemap benchmark.	2025-07-24 14:57:07 +00:00
Sebastian Ullrich	db292b4c82	chore: minimize benchmark imports so we don't spend a majority in importing (#9513 )	2025-07-24 12:14:12 +00:00
Henrik Böving	9669c6d5f1	perf: add benchmark for congruence reasoning in simp (#9511 ) This PR adds a benchmark for putting pressure on simp's congruence abilities.	2025-07-24 10:47:37 +00:00
Sebastian Ullrich	9fc31abb1f	chore: benchmark using `USE_LAKE` (#9361 )	2025-07-17 18:44:29 +00:00
Henrik Böving	097952c48f	perf: simp subexpr benchmark (#9404 ) This PR adds a simp benchmark to our suite, specifically targeting caching of subexpression rewriting results.	2025-07-17 11:53:48 +00:00
Henrik Böving	e9ccdeecd0	perf: add a benchmark for simp on local hypotheses (#9403 ) This PR adds a benchmark to our suite, specifically targeting the fact that local hypotheses are currently not indexed in simp and can thus cause significant slowdowns compared to having them as external declarations.	2025-07-16 12:16:29 +00:00
Joachim Breitner	0926d27100	chore: fix benchmark added in #9380 (#9384 )	2025-07-15 18:24:34 +00:00
Joachim Breitner	6adeab2160	chore: add simple simp benchmark (#9380 ) A micro-benchmark for plain, mostly first-order rewriting of simp: This uses axiom to make it independent of specific optimization (e.g. for `Nat`). It generates a “list” of 128 `b`s followed by 128 `a` and uses bubble-sort to to sort it and compares it against the expected output.	2025-07-15 15:04:49 +00:00
Henrik Böving	7958e01b1c	perf: basic micro benchmarks for Std.Data.TreeMap (#9250 ) This PR adds micro-benchmarks for `Std.Data.TreeMap` in the same style as for the hashmap.	2025-07-08 13:55:13 +00:00
Henrik Böving	46c43c3ecb	perf: first set of HashMap benchmarks (#9233 ) This PR adds basic microbenchmarks for `Std.Data.HashMap`	2025-07-08 08:11:52 +00:00
Paul Reichert	98e4b2882f	refactor: migrate to new ranges (#8841 ) This PR migrates usages of `Std.Range` to the new polymorphic ranges. This PR unfortunately increases the transitive imports for frequently-used parts of `Init` because the ranges now rely on iterators in order to provide their functionality for types other than `Nat`. However, iteration over ranges in compiled code is as efficient as before in the examples I checked. This is because of a special `IteratorLoop` implementation provided in the PR for this purpose. There were two issues that were uncovered during migration: * In `IndPredBelow.lean`, migrating the last remaining range causes `compilerTest1.lean` to break. I have minimized the issue and came to the conclusion it's a compiler bug. Therefore, I have not replaced said old range usage yet (see #9186). * In `BRecOn.lean`, we are publicly importing the ranges. Making this import private should theoretically work, but there seems to be a problem with the module system, causing the build to panic later in `Init.Data.Grind.Poly` (see #9185). * In `FuzzyMatching.lean`, inlining fails with the new ranges, which would have led to significant slowdown. Therefore, I have not migrated this file either.	2025-07-07 12:41:53 +00:00
Henrik Böving	6e98dfbc64	perf: bv_decide rewriting benchmark (#9231 ) This PR adds a benchmark for the rewriting engine of bv_decide, based on a problem extracted from SMT-LIB. Note that this problem has significant elaboration time itself due to its sheer size though the overall execution time is split approximately 50:50 between elaboration and rewriting.	2025-07-07 10:24:08 +00:00
Paul Reichert	c9dea51f7a	chore: create iterator benchmark (#9094 ) This PR adds a benchmark file that exemplifies some iterator usages	2025-07-01 11:47:36 +00:00

1 2 3 4 5 ...

292 commits