lean4-htt

Author	SHA1	Message	Date
Joachim Breitner	c2918b2701	test: add benchmark for #11992 (#11997 )	2026-01-13 21:15:32 +00:00
Leonardo de Moura	58e599f2f9	perf: optimize congruence proof construction in `Sym.simp` (#11974 ) This PR optimizes congruence proof construction in `Sym.simp` by avoiding `inferType` calls on expressions that are less likely to be cached. Instead of inferring types of expressions like `@HAdd.hAdd Nat Nat Nat instAdd 5`, we infer the type of the function prefix `@HAdd.hAdd Nat Nat Nat instAdd` and traverse the forall telescope. The key insight is that function prefixes are more likely shared across many call sites (e.g., all `Nat` additions use the same `@HAdd.hAdd Nat Nat Nat instAdd`), so they benefit from `inferType` caching. Benchmark results show improvements on workloads with shared function prefixes: - `many_rewrites_5000`: 48.8ms → 43.1ms (-12%) - `term_tree_5000`: 53.4ms → 30.5ms (-43%)	2026-01-11 23:00:19 +00:00
Leonardo de Moura	d7cbdebf0b	chore: cleanup `simp` benchmark (#11971 )	2026-01-11 19:55:39 +00:00
Leonardo de Moura	d57f71c1c0	perf: optimize kernel type-checking for `have`-telescope simplification in `Sym.simp` (#11967 ) This PR implements a new strategy for simplifying `have`-telescopes in `Sym.simp` that achieves linear kernel type-checking time instead of quadratic. ## Problem When simplifying deep `have`-telescopes, the previous approach using `have_congr'` produced proofs that type-checked in quadratic time. The simplifier itself was fast, but the kernel became the bottleneck for large telescopes. For example, at n=100: - Before: simp = 2.4ms, kernel = 225ms - After: simp = 3.5ms, kernel = 10ms The quadratic behavior occurred because the kernel creates fresh free variables for each binder when type-checking, destroying sharing and producing O(n²) intermediate terms. ## Solution We transform sequential `have`-telescopes into a parallel beta-application form: ``` have x₁ := v₁; have x₂ := v₂[x₁]; b[x₁, x₂] ↓ (definitionally equal) (fun x₁ x₂' => b[x₁, x₂' x₁]) v₁ (fun x₁ => v₂[x₁]) ``` This parallel form leverages the efficient simplifier for lambdas in `Sym.simp`. This form enables: 1. Independent simplification of each argument 2. Proof construction using standard congruence lemmas 3. Linear kernel type-checking time The algorithm has three phases: 1. `toBetaApp`: Transform telescope → parallel beta-application 2. `simpBetaApp`: Simplify using `congr`/`congrArg`/`congrFun'` and `simpLambda` 3. `toHave`: Convert back to `have` form ## Benchmark Results ### Benchmark 1: Chain with all variables used in body \| n \| Before (simp) \| Before (kernel) \| After (simp) \| After (kernel) \| \|---\|---------------\|-----------------\|--------------\|----------------\| \| 50 \| 1.2ms \| 32ms \| 1.6ms \| 4.4ms \| \| 100 \| 2.4ms \| 225ms \| 3.5ms \| 10ms \| \| 200 \| 4.5ms \| — \| 8.4ms \| 27ms \| \| 500 \| 11.7ms \| — \| 33.6ms \| 128ms \| ### Benchmark 3: Parallel declarations (simplified values) \| n \| Before (simp) \| Before (kernel) \| After (simp) \| After (kernel) \| \|---\|---------------\|-----------------\|--------------\|----------------\| \| 50 \| 0.5ms \| 24ms \| 0.8ms \| 1.8ms \| \| 100 \| 1.2ms \| 169ms \| 1.8ms \| 5.3ms \| \| 200 \| 2.2ms \| — \| 3.9ms \| 17ms \| \| 500 \| 5.9ms \| — \| 12.3ms \| 93ms \| ### Benchmark 5: Chain with single dependency \| n \| Before (simp) \| Before (kernel) \| After (simp) \| After (kernel) \| \|---\|---------------\|-----------------\|--------------\|----------------\| \| 100 \| 1.6ms \| 6.2ms \| 1.8ms \| 6.2ms \| \| 200 \| 2.8ms \| 21.6ms \| 4.4ms \| 16.5ms \| \| 500 \| 7.3ms \| 125ms \| 12.8ms \| 72ms \| Key observations: - Kernel time is now linear in telescope depth (previously quadratic) - Simp time increases slightly due to the transformation overhead - Total time (simp + kernel) is dramatically reduced for large telescopes - The improvement is most pronounced when the body depends on many variables ## Trade-offs - Proof sizes are larger (more congruence lemma applications) - Simp time has ~1.5x overhead from the transformation - For very small telescopes (n < 10), the overhead may not pay off The optimization targets the critical path: kernel type-checking was the bottleneck preventing scaling to realistic symbolic simulation workloads.	2026-01-11 02:20:47 +00:00
Sebastian Ullrich	eaf8cf15ff	test: add `leanchecker` benchmark (#11959 )	2026-01-10 20:52:11 +00:00
Leonardo de Moura	cae739c27c	test: `implies` vs `Arrow` `Sym.simp` benchmark (#11966 )	2026-01-10 18:51:54 +00:00
Leonardo de Moura	d92cdae8e9	feat: `simpForall` and `simpArrow` in `Sym.simp` (#11950 ) This PR implements `simpForall` and `simpArrow` in `Sym.simp`.	2026-01-09 06:20:04 +00:00
Leonardo de Moura	0e4794a1a9	test: benchmarks for `lambda`-telescopes (#11929 )	2026-01-08 00:20:03 +00:00
Leonardo de Moura	8484dbad5d	test: benchmarks for `have`-telescopes (#11927 )	2026-01-07 23:24:46 +00:00
Leonardo de Moura	ff87bcb8e5	feat: add option for simplifying `have` decls in two passes (#11923 ) This PR adds a new option to the function `simpHaveTelescope` in which the `have` telescope is simplified in two passes: * In the first pass, only the values and the body are simplified. * In the second pass, unused declarations are eliminated. This new mode eliminates superlinear behavior in the benchmark `simp_3.lean`. Note that the kernel type checker still exhibits quadratic behavior in this example, because it does not have support for expanding a `have`/`let` telescope in a single step.	2026-01-07 01:58:36 +00:00
Leonardo de Moura	8154453bb5	feat: simplify `have` blocks in `Sym.simp` (#11920 ) This PR implements support for simplifying `have` telescopes in `Sym.simp`.	2026-01-07 00:10:47 +00:00
Leonardo de Moura	175661b6c3	refactor: reorganize `SymM` and `GrindM` monad hierarchy (#11909 ) This PR reorganizes the monad hierarchy for symbolic computation in Lean. ## Motivation We want a clean layering where: 1. A foundational monad (`SymM`) provides maximally shared terms and structural/syntactic `isDefEq` 2. `GrindM` builds on this foundation, adding E-graphs, congruence closure, and decision procedures 3. Symbolic execution / VCGen uses `GrindM` directly without introducing a third monad ## Changes The core symbolic computation layer still lives in `Lean.Meta.Sym`. This monad (`SymM`) provides: - Maximally shared terms with pointer-based equality - Structural/syntactic `isDefEq` and matching (no reduction, predictable cost) - Monotonic local contexts (no `revert` or `clear`), enabling O(1) metavariable validation - Efficient `intro`, `apply`, and `simp` implementations The name "Sym" reflects that this is infrastructure for symbolic computation: symbolic simulation, verification condition generation, and decision procedures. ### Updated hierarchy ``` Lean.Meta.Sym -- SymM: shared terms, syntactic isDefEq, intro, apply, simp Lean.Meta.Grind -- GrindM: E-graphs, congruence closure (extends SymM) ``` Symbolic execution is a usage pattern of `GrindM` operating on `Grind.Goal`, not a separate monad. This keeps the API surface minimal: users learn two monads, and VCGen is "how you use `GrindM`" (for users that want to use `grind`) rather than a third abstraction to understand.	2026-01-06 01:12:07 +00:00
Leonardo de Moura	82f60a7ff3	feat: `pre` and `post` may return "done" in `Sym.simp` (#11900 ) This PR adds a `done` flag to the result returned by `Simproc`s in `Sym.simp`. The `done` flag controls whether simplification should continue after the result: - `done = false` (default): Continue with subsequent simplification steps - `done = true`: Stop processing, return this result as final ## Use cases for `done = true` ### In `pre` simprocs Skip simplification of certain subterms entirely: ``` def skipLambdas : Simproc := fun e => if e.isLambda then return .rfl (done := true) else return .rfl ``` ### In `post` simprocs Perform single-pass normalization without recursive simplification: ``` def singlePassNormalize : Simproc := fun e => if let some (e', h) ← tryNormalize e then return .step e' h (done := true) else return .rfl ``` With `done = true`, the result `e'` won't be recursively simplified.	2026-01-05 02:10:06 +00:00
Leonardo de Moura	f1c903ca65	feat: simplify lambdas in `Sym.simp` (#11898 ) This PR adds support for simplifying lambda expressions in `Sym.simp`. It is much more efficient than standard simp for very large lambda expressions with many binders. The key idea is to generate a custom function extensionality theorem for the type of the lambda being simplified. This technique is compatible with the standard `simp` tactic, and will be ported in a separate PR. <img width="581" height="455" alt="image" src="https://github.com/user-attachments/assets/5911dc6c-03f0-48ed-843b-b8cb4f67ee61" /> ### `lambda` benchmark summary \| Lambda size \| MetaM (ms) \| SymM (ms) \| Speedup \| \|-------------\|------------\|-----------\|---------\| \| 50 \| 22.7 \| 0.74 \| ~31× \| \| 100 \| 120.5 \| 1.75 \| ~69× \| \| 150 \| 359.6 \| 2.90 \| ~124× \| \| 200 \| 809.5 \| 4.51 \| ~180× \|	2026-01-05 01:00:30 +00:00
Leonardo de Moura	609d99e860	chore: include free variables (#11894 ) This PR includes free variable in a `simp` benchmark to stress the default `simp` matching procedure.	2026-01-04 18:51:18 +00:00
Leonardo de Moura	78c9a01bb2	feat: check `Sym.simp` thresholds (#11890 ) This PR ensures that `Sym.simp` checks thresholds for maximum recursion depth and maximum number of steps. It also invokes `checkSystem`. Additionally, this PR simplifies the main loop. Assigned metavariables and `zetaDelta` reduction are now handled by installing `pre`/`post` methods.	2026-01-04 04:27:46 +00:00
Leonardo de Moura	bc72487aed	refactor: `Sym.simp` (#11888 ) This PR refactors `Sym.simp` to make it more general and customizable. It also moves the code to its own subdirectory `Meta/Sym/Simp`.	2026-01-04 02:17:23 +00:00
Leonardo de Moura	b40dabdecd	feat: add discrimination tree retrieval for `Sym` (#11886 ) This PR adds `getMatch` and `getMatchWithExtra` for retrieving patterns from discrimination trees in the symbolic simulation framework. The PR also adds uses `DiscrTree` to implement indexing in `Sym.simp`.	2026-01-03 20:28:07 +00:00
Leonardo de Moura	4e8b5cfc46	test: benchmark `Sym` and `Meta` simplifiers (#11870 ) This PR adds simple benchmarks for comparing the `MetaM` and `SymM` simplifiers. The `SymM` simplifier is still working in progress. ### Big picture across benchmarks \| Benchmark \| MetaM scaling \| SymM scaling \| Speedup (approx.) \| \|-------------------------\|-------------------\|--------------\|-------------------\| \| `trans_chain` \| Linear \| Linear \| ~8–9× \| \| `congr_arg_explosion` \| Super-linear \| Linear \| ~100× \| \| `many_rewrites` \| Super-linear \| Linear \| ~10–16× \| <img width="598" height="455" alt="image" src="https://github.com/user-attachments/assets/8bd9021b-b9cf-4fc0-aab4-3118d87f7c22" /> <img width="644" height="455" alt="image" src="https://github.com/user-attachments/assets/0234dc11-0be7-441a-83b6-c309d20a2663" /> <img width="611" height="455" alt="image" src="https://github.com/user-attachments/assets/df79d057-25ed-49d9-a8f3-5285e5fc7013" />	2026-01-02 03:59:54 +00:00
Paul Reichert	05664b15a3	fix: update naming of `FinitenessRelation` fields in the `sigmaIterator.lean` benchmark (#11836 ) This PR fixes a broken benchmark that uses an outdated naming of `FinitenessRelation` and `ProductivenessRelation`'s fields.	2025-12-29 23:13:13 +00:00
Paul Reichert	5ef0207a85	refactor: remove IteratorCollect (#11706 ) This PR removes the `IteratorCollect` type class and hereby simplifies the iterator API. Its limited advantages did not justify the complexity cost.	2025-12-17 23:02:33 +00:00
Paul Reichert	eb20c07b4a	fix: fix broken benchmarks from #11446 (#11681 ) This PR fixes benchmarks that were broken by #11446.	2025-12-15 09:35:42 +00:00
Paul Reichert	c79d74d9a1	refactor: move `Iter` and others from `Std.Iterators` to `Std` (#11446 ) This PR moves many constants of the iterator API from `Std.Iterators` to the `Std` namespace in order to make them more convenient to use. These constants include, but are not limited to, `Iter`, `IterM` and `IteratorLoop`. This is a breaking change. If something breaks, try adding `open Std` in order to make these constants available again. If some constants in the `Std.Iterators` namespace cannot be found, they can be found directly in `Std` now.	2025-12-15 08:24:12 +00:00
Henrik Böving	5339c47555	chore: benchmark for charactersIn (#11643 )	2025-12-12 22:23:51 +00:00
Sebastian Ullrich	1f80b3ffbe	feat: module system is no longer experimental (#11637 ) This PR declares the module system as no longer experimental and makes the `experimental.module` option a no-op, to be removed.	2025-12-12 21:20:26 +00:00
Henrik Böving	59045c6227	chore: make workspaceSymbols benchmark independent of sorry search (#11642 )	2025-12-12 20:10:27 +00:00
Paul Reichert	383c0caa91	feat: remove `Finite` conditions from iterator consumers relying on a new fixpoint combinator (#11038 ) This PR introduces a new fixpoint combinator, `WellFounded.extrinsicFix`. A termination proof, if provided at all, can be given extrinsically, i.e., looking at the term from the outside, and is only required if one intends to formally verify the behavior of the fixpoint. The new combinator is then applied to the iterator API. Consumers such as `toList` or `ForIn` no longer require a proof that the underlying iterator is finite. If one wants to ensure the termination of them intrinsically, there are strictly terminating variants available as, for example, `it.ensureTermination.toList` instead of `it.toList`.	2025-12-08 16:03:22 +00:00
Joachim Breitner	b94cf2c9be	test: add big match on nat lit benchmarks (#11502 ) This PR adds two benchmarks for elaborating match statements of many `Nat` literals, one without and one with splitter generation.	2025-12-04 08:21:56 +00:00
Alok Singh	1e1ed16a05	doc: correct typos in documentation and comments (#11465 ) This PR fixes various typos across the codebase in documentation and comments. - `infered` → `inferred` (ParserCompiler.lean) - `declartation` → `declaration` (Cleanup.lean) - `certian` → `certain` (CasesInfo.lean) - `wil` → `will` (Cache.lean) - `the the` → `the` (multiple files - PrefixTree.lean, Sum/Basic.lean, List/Nat/Perm.lean, Time.lean, Bounded.lean, Lake files) - `to to` → `to` (MutualInductive.lean, simp_bubblesort_256.lean) - Grammar improvements in Bounded.lean and Time.lean All changes are to comments and documentation only - no functional changes. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <noreply@anthropic.com>	2025-12-02 06:38:05 +00:00
Garmelon	ca23ed0c17	chore: fix "tests/compiler//sum binary sizes" benchmark (#11444 ) The bench script expected no output on stdout from `compile.sh`, which was not always the case. Now, it separates the compilation and size measurement steps.	2025-12-01 16:20:34 +00:00
Marc Huisinga	af5b47295f	feat: reduce server memory consumption (#11162 ) This PR reduces the memory consumption of the language server (the watchdog process in particular). In Mathlib, it reduces memory consumption by about 1GB. It also fixes two bugs in the call hierarchy: - When an open file had import errors (e.g. from a transitive build failure), the call hierarchy would not display any usages in that file. Now we use the reference information from the .ilean instead. - When a command would not set a parent declaration (e.g. `#check`), the result was filtered from the call hierarchy. Now we display it as `[anonymous]` instead.	2025-12-01 10:53:23 +00:00
Garmelon	debafca7e1	chore: add radar-based bench suite for stdlib (#11264 ) This PR adds a new [radar]-based [temci]-less bench suite that replaces the `stdlib` benchmarks from the old suite and also measures per-module instruction counts. All other benchmarks from the old suite are unaffected. The readme at `tests/bench-radar/README.md` explains in more detail how the bench suite is structured and how it works. The readmes in the benchmark subdirectories explain what each benchmark does and which metrics it collects. All metrics except `stdlib//max dynamic symbols` were ported to the new suite, though most have been renamed. [radar]: https://github.com/leanprover/radar [temci]: https://github.com/parttimenerd/temci	2025-11-25 12:59:30 +00:00
Paul Reichert	6da35eeccb	refactor: increase runtime of "sigma iterator" benchmark (#11336 ) This PR makes the "sigma iterator" benchmark more compute-intensive because it was too fast and therefore flaky.	2025-11-24 12:21:27 +00:00
Sebastian Ullrich	bfbad53540	fix: avoid storing reference to environment in realization result to prevent promise cycle (#11328 ) This PR fixes freeing memory accidentally retained for each document version in the language server on certain elaboration workloads. The issue must have existed since 4.18.0.	2025-11-24 10:16:56 +00:00
Paul Reichert	2980155f5c	refactor: simplify `ToIterator` (#11242 ) This PR significantly changes the signature of the `ToIterator` type class. The obtained iterators' state is no longer dependently typed and is an `outParam` instead of being bundled inside the class. Among other benefits, `simp` can now rewrite inside of `Slice.toList` and `Slice.toArray`. The downside is that we lose flexibility. For example, the former combinator-based implementation of `Subarray`'s iterators is no longer feasible because the states are dependently typed. Therefore, this PR provides a hand-written iterator for `Subarray`, which does not require a dependently typed state and is faster than the previous one. Converting a family of dependently typed iterators into a simply typed one using a `Sigma`-state iterator generates forbiddingly bad code, so that we do provide such a combinator. This PR adds a benchmark for this problem.	2025-11-22 12:37:18 +00:00
Marc Huisinga	2f1e258a5e	test: re-enable re-elab benchmarks and add watchdog re-elab benchmark (#11284 )	2025-11-20 22:53:08 +00:00
Joachim Breitner	8ef742647e	test: benchmark for large partial match (#11199 ) Creates an inductive data type with 100 constructors, and a function that does matches on half of its constructors, with a catch-all for the other half, and generates the splitter. Related to #11183.	2025-11-16 11:20:31 +00:00
Johannes Tantow	100006fdd0	feat: verify `all` and `any` for hash maps (#10765 ) This PR extends the `all`/`any` functions from hash sets to hash maps and dependent hash maps and verifies them.	2025-11-15 16:59:37 +00:00
Joachim Breitner	d41f39fb10	perf: sparse case splitting in match compilation (#10823 ) This PR lets the match compilation procedure use sparse case analysis when the patterns only match on some but not all constructors of an inductive type. This way, less code is produce. Before, code handling each of the other cases was then optimized and commoned-up by later compilation pipeline, but that is wasteful to do. In some cases this will prevent Lean from noticing that a match statement is complete because it performs less case-splitting for the unreachable case. In this case, give explicit patterns to perform the deeper split with `by contradiction` as the right-hand side. At least temporarily, there is also the option to disable this behaviour with ``` set_option backwards.match.sparseCases false ```	2025-11-06 13:46:35 +00:00
Joachim Breitner	d8a67095d6	chore: make workspaceSymbol benchmarks modules (#11094 ) This PR makes workspaceSymbol benchmarks `module`s, so that they are less sensitive to additions of private symbols in the standard library.	2025-11-05 18:40:39 +00:00
Kim Morrison	4887eeb77c	chore: remove >6 month old deprecations (#10968 )	2025-10-26 10:01:30 +00:00
Henrik Böving	881a131ad3	chore: re-enable tests (#10923 )	2025-10-23 08:38:57 +00:00
Rob23oba	fad0e69cc7	fix: make name mangling unambiguous (#10727 ) This PR fixes name mangling to be unambiguous / injective by adding `00` for disambiguation where necessary. Additionally, the inverse function, `Lean.Name.unmangle` has been added which can be used to unmangle a mangled identifier. This unmangler has been added to demonstrate the injectivity but also to allow unmangling identifiers e.g. for debugging purposes. Closes #10724	2025-10-23 07:18:07 +00:00
Markus Himmel	b28daa6d60	chore: rename `String.endPos` -> `String.rawEndPos` (#10853 ) This PR renames `String.endPos` to `String.rawEndPos`, as in a future release the name `String.endPos` will be taken by the function that is currently called `String.endValidPos`.	2025-10-21 11:25:30 +00:00
Paul Reichert	f58999a7a6	refactor: use `Shrink` stub in the iterator framework (#10725 ) This PR introduces a no-op version of `Shrink`, a type that should allow shrinking small types into smaller universes given a proof that the type is small enough, and uses it in the iterator library. Because this type would require special compiler support, the current version is just a wrapper around the inner type so that the wrapper is equivalent, but not definitionally equivalent. While `Shrink` is unable to shrink universes right now, but introducing it now will allow us to generalize the universes in the iterator library with fewer breaking changes as soon as an actual `Shrink` is possible.	2025-10-14 10:22:14 +00:00
Markus Himmel	5c707d936c	chore: rename `Stream` to `Std.Stream` (#10645 ) This PR renames `Stream` to `Std.Stream` so that the name becomes available to mathlib after a deprecation cycle.	2025-10-02 15:25:56 +00:00
Paul Reichert	89686fcd02	refactor: replace `PRange shape α` with `Rcc α` and eight other types (#10319 ) This PR "monomorphizes" the structure `Std.PRange shape α`, replacing it with nine distinct structures `Std.Rcc`, `Std.Rco`, `Std.Rci` etc., one for each possible shape of a range's bounds. This change was necessary because the shape polymorphism is detrimental to attempts of automation. BREAKING CHANGE: While range/slice notation itself is unchanged, this essentially breaks the entire remaining (polymorphic) range and slice API except for the dot-notation(`toList`, `iter`, ...). It is not possible to deprecate old declarations that were formulated in a shape-polymorphic way that is not available anymore.	2025-10-02 06:45:11 +00:00
Marc Huisinga	dfd3d18530	test: improve language server test coverage (#10574 ) This PR significantly improves the test coverage of the language server, providing at least a single basic test for every request that is used by the client. It also implements infrastructure for testing all of these requests, e.g. the ability to run interactive tests in a project context and refactors the interactive test runner to be more maintainable. Finally, it also fixes a small bug with the recently implemented unknown identifier code actions for auto-implicits (#10442) that was discovered in testing, where the "import all unambiguous unknown identifiers" code action didn't work correctly on auto-implicit identifiers.	2025-09-30 11:15:03 +00:00
Joachim Breitner	1374445081	chore: update bench/riskv-ast.lean (#10505 ) This PR disables `trace.profiler` in `bench/riskv-ast.lean`. We don't want to optimize the trace profiler, but normal code. While at it, I removed the `#exit` to cover more of the file. While at it, also import the latest from from upstream.	2025-09-24 11:46:26 +00:00
Garmelon	8b64425033	chore: set temci tags for the radar bench script (#10527 ) The radar bench scripts at https://github.com/leanprover/radar-bench-lean4/ split up the benchmarks between the two runners based on the tags: One runner filters by the tag `stdlib` while the other filters by the tag `other`. Only benchmarks using one of these tags will be run, and any benchmark tagged with both will waste electricity. As far as I know, the tags are unused otherwise, so I just replaced all the old tags.	2025-09-23 19:51:10 +00:00

1 2 3 4 5 ...

323 commits