lean4-htt

Author	SHA1	Message	Date
Leonardo de Moura	58e599f2f9	perf: optimize congruence proof construction in `Sym.simp` (#11974 ) This PR optimizes congruence proof construction in `Sym.simp` by avoiding `inferType` calls on expressions that are less likely to be cached. Instead of inferring types of expressions like `@HAdd.hAdd Nat Nat Nat instAdd 5`, we infer the type of the function prefix `@HAdd.hAdd Nat Nat Nat instAdd` and traverse the forall telescope. The key insight is that function prefixes are more likely shared across many call sites (e.g., all `Nat` additions use the same `@HAdd.hAdd Nat Nat Nat instAdd`), so they benefit from `inferType` caching. Benchmark results show improvements on workloads with shared function prefixes: - `many_rewrites_5000`: 48.8ms → 43.1ms (-12%) - `term_tree_5000`: 53.4ms → 30.5ms (-43%)	2026-01-11 23:00:19 +00:00
Leonardo de Moura	d7cbdebf0b	chore: cleanup `simp` benchmark (#11971 )	2026-01-11 19:55:39 +00:00
Leonardo de Moura	d57f71c1c0	perf: optimize kernel type-checking for `have`-telescope simplification in `Sym.simp` (#11967 ) This PR implements a new strategy for simplifying `have`-telescopes in `Sym.simp` that achieves linear kernel type-checking time instead of quadratic. ## Problem When simplifying deep `have`-telescopes, the previous approach using `have_congr'` produced proofs that type-checked in quadratic time. The simplifier itself was fast, but the kernel became the bottleneck for large telescopes. For example, at n=100: - Before: simp = 2.4ms, kernel = 225ms - After: simp = 3.5ms, kernel = 10ms The quadratic behavior occurred because the kernel creates fresh free variables for each binder when type-checking, destroying sharing and producing O(n²) intermediate terms. ## Solution We transform sequential `have`-telescopes into a parallel beta-application form: ``` have x₁ := v₁; have x₂ := v₂[x₁]; b[x₁, x₂] ↓ (definitionally equal) (fun x₁ x₂' => b[x₁, x₂' x₁]) v₁ (fun x₁ => v₂[x₁]) ``` This parallel form leverages the efficient simplifier for lambdas in `Sym.simp`. This form enables: 1. Independent simplification of each argument 2. Proof construction using standard congruence lemmas 3. Linear kernel type-checking time The algorithm has three phases: 1. `toBetaApp`: Transform telescope → parallel beta-application 2. `simpBetaApp`: Simplify using `congr`/`congrArg`/`congrFun'` and `simpLambda` 3. `toHave`: Convert back to `have` form ## Benchmark Results ### Benchmark 1: Chain with all variables used in body \| n \| Before (simp) \| Before (kernel) \| After (simp) \| After (kernel) \| \|---\|---------------\|-----------------\|--------------\|----------------\| \| 50 \| 1.2ms \| 32ms \| 1.6ms \| 4.4ms \| \| 100 \| 2.4ms \| 225ms \| 3.5ms \| 10ms \| \| 200 \| 4.5ms \| — \| 8.4ms \| 27ms \| \| 500 \| 11.7ms \| — \| 33.6ms \| 128ms \| ### Benchmark 3: Parallel declarations (simplified values) \| n \| Before (simp) \| Before (kernel) \| After (simp) \| After (kernel) \| \|---\|---------------\|-----------------\|--------------\|----------------\| \| 50 \| 0.5ms \| 24ms \| 0.8ms \| 1.8ms \| \| 100 \| 1.2ms \| 169ms \| 1.8ms \| 5.3ms \| \| 200 \| 2.2ms \| — \| 3.9ms \| 17ms \| \| 500 \| 5.9ms \| — \| 12.3ms \| 93ms \| ### Benchmark 5: Chain with single dependency \| n \| Before (simp) \| Before (kernel) \| After (simp) \| After (kernel) \| \|---\|---------------\|-----------------\|--------------\|----------------\| \| 100 \| 1.6ms \| 6.2ms \| 1.8ms \| 6.2ms \| \| 200 \| 2.8ms \| 21.6ms \| 4.4ms \| 16.5ms \| \| 500 \| 7.3ms \| 125ms \| 12.8ms \| 72ms \| Key observations: - Kernel time is now linear in telescope depth (previously quadratic) - Simp time increases slightly due to the transformation overhead - Total time (simp + kernel) is dramatically reduced for large telescopes - The improvement is most pronounced when the body depends on many variables ## Trade-offs - Proof sizes are larger (more congruence lemma applications) - Simp time has ~1.5x overhead from the transformation - For very small telescopes (n < 10), the overhead may not pay off The optimization targets the critical path: kernel type-checking was the bottleneck preventing scaling to realistic symbolic simulation workloads.	2026-01-11 02:20:47 +00:00
Leonardo de Moura	cae739c27c	test: `implies` vs `Arrow` `Sym.simp` benchmark (#11966 )	2026-01-10 18:51:54 +00:00
Leonardo de Moura	d92cdae8e9	feat: `simpForall` and `simpArrow` in `Sym.simp` (#11950 ) This PR implements `simpForall` and `simpArrow` in `Sym.simp`.	2026-01-09 06:20:04 +00:00
Leonardo de Moura	0e4794a1a9	test: benchmarks for `lambda`-telescopes (#11929 )	2026-01-08 00:20:03 +00:00
Leonardo de Moura	8484dbad5d	test: benchmarks for `have`-telescopes (#11927 )	2026-01-07 23:24:46 +00:00
Leonardo de Moura	ff87bcb8e5	feat: add option for simplifying `have` decls in two passes (#11923 ) This PR adds a new option to the function `simpHaveTelescope` in which the `have` telescope is simplified in two passes: * In the first pass, only the values and the body are simplified. * In the second pass, unused declarations are eliminated. This new mode eliminates superlinear behavior in the benchmark `simp_3.lean`. Note that the kernel type checker still exhibits quadratic behavior in this example, because it does not have support for expanding a `have`/`let` telescope in a single step.	2026-01-07 01:58:36 +00:00
Leonardo de Moura	8154453bb5	feat: simplify `have` blocks in `Sym.simp` (#11920 ) This PR implements support for simplifying `have` telescopes in `Sym.simp`.	2026-01-07 00:10:47 +00:00
Leonardo de Moura	175661b6c3	refactor: reorganize `SymM` and `GrindM` monad hierarchy (#11909 ) This PR reorganizes the monad hierarchy for symbolic computation in Lean. ## Motivation We want a clean layering where: 1. A foundational monad (`SymM`) provides maximally shared terms and structural/syntactic `isDefEq` 2. `GrindM` builds on this foundation, adding E-graphs, congruence closure, and decision procedures 3. Symbolic execution / VCGen uses `GrindM` directly without introducing a third monad ## Changes The core symbolic computation layer still lives in `Lean.Meta.Sym`. This monad (`SymM`) provides: - Maximally shared terms with pointer-based equality - Structural/syntactic `isDefEq` and matching (no reduction, predictable cost) - Monotonic local contexts (no `revert` or `clear`), enabling O(1) metavariable validation - Efficient `intro`, `apply`, and `simp` implementations The name "Sym" reflects that this is infrastructure for symbolic computation: symbolic simulation, verification condition generation, and decision procedures. ### Updated hierarchy ``` Lean.Meta.Sym -- SymM: shared terms, syntactic isDefEq, intro, apply, simp Lean.Meta.Grind -- GrindM: E-graphs, congruence closure (extends SymM) ``` Symbolic execution is a usage pattern of `GrindM` operating on `Grind.Goal`, not a separate monad. This keeps the API surface minimal: users learn two monads, and VCGen is "how you use `GrindM`" (for users that want to use `grind`) rather than a third abstraction to understand.	2026-01-06 01:12:07 +00:00
Leonardo de Moura	82f60a7ff3	feat: `pre` and `post` may return "done" in `Sym.simp` (#11900 ) This PR adds a `done` flag to the result returned by `Simproc`s in `Sym.simp`. The `done` flag controls whether simplification should continue after the result: - `done = false` (default): Continue with subsequent simplification steps - `done = true`: Stop processing, return this result as final ## Use cases for `done = true` ### In `pre` simprocs Skip simplification of certain subterms entirely: ``` def skipLambdas : Simproc := fun e => if e.isLambda then return .rfl (done := true) else return .rfl ``` ### In `post` simprocs Perform single-pass normalization without recursive simplification: ``` def singlePassNormalize : Simproc := fun e => if let some (e', h) ← tryNormalize e then return .step e' h (done := true) else return .rfl ``` With `done = true`, the result `e'` won't be recursively simplified.	2026-01-05 02:10:06 +00:00
Leonardo de Moura	f1c903ca65	feat: simplify lambdas in `Sym.simp` (#11898 ) This PR adds support for simplifying lambda expressions in `Sym.simp`. It is much more efficient than standard simp for very large lambda expressions with many binders. The key idea is to generate a custom function extensionality theorem for the type of the lambda being simplified. This technique is compatible with the standard `simp` tactic, and will be ported in a separate PR. <img width="581" height="455" alt="image" src="https://github.com/user-attachments/assets/5911dc6c-03f0-48ed-843b-b8cb4f67ee61" /> ### `lambda` benchmark summary \| Lambda size \| MetaM (ms) \| SymM (ms) \| Speedup \| \|-------------\|------------\|-----------\|---------\| \| 50 \| 22.7 \| 0.74 \| ~31× \| \| 100 \| 120.5 \| 1.75 \| ~69× \| \| 150 \| 359.6 \| 2.90 \| ~124× \| \| 200 \| 809.5 \| 4.51 \| ~180× \|	2026-01-05 01:00:30 +00:00
Leonardo de Moura	609d99e860	chore: include free variables (#11894 ) This PR includes free variable in a `simp` benchmark to stress the default `simp` matching procedure.	2026-01-04 18:51:18 +00:00
Leonardo de Moura	78c9a01bb2	feat: check `Sym.simp` thresholds (#11890 ) This PR ensures that `Sym.simp` checks thresholds for maximum recursion depth and maximum number of steps. It also invokes `checkSystem`. Additionally, this PR simplifies the main loop. Assigned metavariables and `zetaDelta` reduction are now handled by installing `pre`/`post` methods.	2026-01-04 04:27:46 +00:00
Leonardo de Moura	bc72487aed	refactor: `Sym.simp` (#11888 ) This PR refactors `Sym.simp` to make it more general and customizable. It also moves the code to its own subdirectory `Meta/Sym/Simp`.	2026-01-04 02:17:23 +00:00
Leonardo de Moura	b40dabdecd	feat: add discrimination tree retrieval for `Sym` (#11886 ) This PR adds `getMatch` and `getMatchWithExtra` for retrieving patterns from discrimination trees in the symbolic simulation framework. The PR also adds uses `DiscrTree` to implement indexing in `Sym.simp`.	2026-01-03 20:28:07 +00:00
Leonardo de Moura	4e8b5cfc46	test: benchmark `Sym` and `Meta` simplifiers (#11870 ) This PR adds simple benchmarks for comparing the `MetaM` and `SymM` simplifiers. The `SymM` simplifier is still working in progress. ### Big picture across benchmarks \| Benchmark \| MetaM scaling \| SymM scaling \| Speedup (approx.) \| \|-------------------------\|-------------------\|--------------\|-------------------\| \| `trans_chain` \| Linear \| Linear \| ~8–9× \| \| `congr_arg_explosion` \| Super-linear \| Linear \| ~100× \| \| `many_rewrites` \| Super-linear \| Linear \| ~10–16× \| <img width="598" height="455" alt="image" src="https://github.com/user-attachments/assets/8bd9021b-b9cf-4fc0-aab4-3118d87f7c22" /> <img width="644" height="455" alt="image" src="https://github.com/user-attachments/assets/0234dc11-0be7-441a-83b6-c309d20a2663" /> <img width="611" height="455" alt="image" src="https://github.com/user-attachments/assets/df79d057-25ed-49d9-a8f3-5285e5fc7013" />	2026-01-02 03:59:54 +00:00

17 commits