lean4-htt

Author	SHA1	Message	Date
Kim Morrison	ea221f3283	feat: `Nat.(fold\|foldRev\|any\|all)M?` take a function which sees the upper bound (#6139 ) This PR modifies the signature of the functions `Nat.fold`, `Nat.foldRev`, `Nat.any`, `Nat.all`, so that the function is passed the upper bound. This allows us to change runtime array bounds checks to compile time checks in many places.	2024-11-22 03:05:51 +00:00
Kim Morrison	72e952eadc	chore: avoid runtime array bounds checks (#6134 ) This PR avoids runtime array bounds checks in places where it can trivially be done at compile time. None of these changes are of particular consequence: I mostly wanted to learn how much we do this, and what the obstacles are to doing it less.	2024-11-21 05:04:52 +00:00
Kim Morrison	f85c66789d	feat: Array.insertIdx/eraseIdx take a tactic-provided proof (#6133 ) This PR replaces `Array.feraseIdx` and `Array.insertAt` with `Array.eraseIdx` and `Array.insertIdx`, both of which take a `Nat` argument and a tactic-provided proof that it is in bounds. We also have `eraseIdxIfInBounds` and `insertIdxIfInBounds` which are noops if the index is out of bounds. We also provide a `Fin` valued version of `Array.findIdx?`. Together, these quite ergonomically improve the array indexing safety at a number of places in the compiler/elaborator.	2024-11-20 09:52:38 +00:00
Markus Himmel	688ee4c887	fix: constant folding for Nat.ble and Nat.blt (#6087 ) This PR fixes a bug in the constant folding for the `Nat.ble` and `Nat.blt` function in the old code generator, leading to a miscompilation. Closes #6086	2024-11-15 12:09:52 +00:00
Kim Morrison	3a408e0e54	feat: change `Array.get` to take a `Nat` and a proof (#6032 ) This PR changes the signature of `Array.get` to take a Nat and a proof, rather than a `Fin`, for consistency with the rest of the (planned) Array API. Note that because of bootstrapping issues we can't provide `get_elem_tactic` as an autoparameter for the proof. As users will mostly use the `xs[i]` notation provided by `GetElem`, this hopefully isn't a problem. We may restore `Fin` based versions, either here or downstream, as needed, but they won't be the "main" functions. --------- Co-authored-by: David Thrane Christiansen <david@davidchristiansen.dk>	2024-11-12 03:30:46 +00:00
Kim Morrison	218601009b	chore: rename Array.back to back! (#5897 )	2024-10-31 09:18:18 +00:00
Kyle Miller	03c6e99ef7	fix: bring elaborator in line with kernel for primitive projections (#5822 ) The kernel supports primitive projections for all inductive types with one construtor. The elaborator was assuming primitive projections only work for "structure-likes", non-recursive inductive types with no indices. Enables numeric projection notation for general one-constructor inductives. Extracted from #5783.	2024-10-31 03:16:52 +00:00
Kim Morrison	71122696a1	feat: rename Array.shrink to take, and relate to List.take (#5796 )	2024-10-21 23:35:32 +00:00
Joachim Breitner	60096e7d15	refactor: more idiomatic syntax for `if h:` (#5567 ) https://github.com/leanprover/lean4/pull/5552 introduced a fair number of `if h:`, but the slightly preferred style is `if h :`, with a space, so here goes a quick `sed`.	2024-10-01 15:23:54 +00:00
TomasPuverle	ddec5336e5	chore: switch obvious cases of array "bang"`[]!` indexing to rely on hypothesis (#5552 ) Update certain uses of `arr[i]!` to use the "provably correct" version `arr[i]`, in order to use "best practices". Some motivation and discussion on [Zulip](https://leanprover.zulipchat.com/#narrow/stream/270676-lean4/topic/Lean.20compiler.2Felaborator.20development.20question/near/472934715)	2024-10-01 11:12:22 +00:00
Kim Morrison	a6f0112fc5	feat: refactor of Array (#5452 ) This is a second attempt at #5446, first reverting parts of #5403.	2024-09-24 12:57:55 +00:00
euprunin	ba43ce18c3	chore: remove repeated words (#5438 ) Co-authored-by: euprunin <euprunin@users.noreply.github.com> Co-authored-by: Kim Morrison <scott.morrison@gmail.com>	2024-09-24 03:40:11 +00:00
euprunin	50339e38d9	chore: fix spelling mistakes in src/Lean/ (#5426 ) Co-authored-by: euprunin <euprunin@users.noreply.github.com>	2024-09-23 14:56:59 +00:00
euprunin	8b61dda964	chore: fix spelling mistakes in error messages/exceptions (#5425 ) Co-authored-by: euprunin <euprunin@users.noreply.github.com>	2024-09-23 13:19:05 +00:00
Kim Morrison	e551a366a0	feat: theorems about List.toArray (#5403 )	2024-09-23 05:24:03 +00:00
Eric Wieser	46b16b6df1	doc: explain the borrow syntax (#4305 ) Obviously a link to the web docs isn't ideal, but having hovers available on the symbol is much better than nothing. --------- Co-authored-by: David Thrane Christiansen <david@davidchristiansen.dk> Co-authored-by: Sebastian Ullrich <sebasti@nullri.ch>	2024-09-17 09:52:41 +00:00
Kim Morrison	4e0f6b8b45	feat: export Bool.and/or/not/xor	2024-09-16 12:45:51 +10:00
Kyle Miller	7a7440f59b	feat: have IR checker suggest `noncomputable` (#4729 ) Currently, `ll_infer_type` is responsible for telling the user about `noncomputable` when a definition depends on one without executable code. However, this is imperfect because type inference does not check every subexpression. This leads to errors later on that users find to be hard to interpret. Now, `Lean.IR.checkDecls` has a friendlier error message when it encounters constants without compiled definitions, suggesting to consider using `noncomputable`. While this function is an internal IR consistency check, it is also reasonable to have it give an informative error message in this particular case. The suggestion to use `noncomputable` is limited to just unknown constants. Some alternatives would be to either (1) create another checker just for missing constants, (2) change `ll_infer_type` to always visit every subexpression no matter if they are necessary for inferring the type, or (3) investigate whether `tests/lean/run/1785.lean` is due to a deeper issue. Closes #1785	2024-09-07 22:00:21 +00:00
Sebastian Ullrich	9d0302e749	chore: remove LEAN_EXPORT denylist workaround	2024-08-12 14:14:42 +02:00
David Thrane Christiansen	bcbd7299e9	fix: export more symbols needed by Verso (#4956 ) This enables the Verso LSP server extensions to work.	2024-08-08 13:31:34 +00:00
Markus Himmel	b144107ed5	chore: deprecate Lean.HashMap and Lean.HashSet (#4954 ) This restores all of the imports of `Lean.Data.HashMap` and `Lean.Data.HashSet` so that users actually see the deprecation warnings instead of a "declaration not found" error.	2024-08-08 12:46:10 +00:00
Markus Himmel	4bac74c4ac	chore: switch to Std.HashMap and Std.HashSet almost everywhere	2024-08-07 18:24:42 +02:00
David Thrane Christiansen	32b9de8c77	fix: export symbols needed by Verso (#4884 ) Verso needed a symbol that was unexported - this exposes it again.	2024-08-01 04:56:27 +00:00
Joachim Breitner	1a12f63f74	refactor: move Synax.hasIdent, shake dependencies (#4766 ) I noticed that a change to `Lean.PrettyPrinter.Delaborator.Builtins` rebuilt more modules than I expected, so I moved a definition and reduced some dependcies. More reduction would be possible to move const-delaboration out of the big `Lean.PrettyPrinter`, and import from `Lean.PrettyPrinter` selectively.	2024-07-16 21:19:26 +00:00
Sebastian Ullrich	f167cfba71	chore: exclude more symbols to get below Windows symbol limit	2024-07-15 23:19:04 +02:00
Mac Malone	0d529e18a6	feat: expose flags for the bundled C compiler (#4477 ) Expose the C compiler and linker flags used with the bundled compiler (clang) to Lean code. This is needed to skip the use of `leanc` in Lake.	2024-06-22 01:23:33 +00:00
Mario Carneiro	0a1a855ba8	fix: validate UTF-8 at C++ -> Lean boundary (#3963 ) Continuation of #3958. To ensure that lean code is able to uphold the invariant that `String`s are valid UTF-8 (which is assumed by the lean model), we have to make sure that no lean objects are created with invalid UTF-8. #3958 covers the case of lean code creating strings via `fromUTF8Unchecked`, but there are still many cases where C++ code constructs strings from a `const char *` or `std::string` with unclear UTF-8 status. To address this and minimize accidental missed validation, the `(lean_)mk_string` function is modified to validate UTF-8. The original function is renamed to `mk_string_unchecked`, with several other variants depending on whether we know the string is UTF-8 or ASCII and whether we have the length and/or utf8 char count on hand. I reviewed every function which leads to `mk_string` or its variants in the C code, and used the appropriate validation function, defaulting to `mk_string` if the provenance is unclear. This PR adds no new error handling paths, meaning that incorrect UTF-8 will still produce incorrect results in e.g. IO functions, they are just not causing unsound behavior anymore. A subsequent PR will handle adding better error reporting for bad UTF-8.	2024-06-19 14:05:48 +00:00
Sebastian Ullrich	d85d3d5f3a	fix: accidental ownership with specialization	2024-06-07 13:59:22 +02:00
Leonardo de Moura	b53a74d6fd	fix: miscompilation in constant folding (#4311 ) closes #4306	2024-05-31 04:24:45 +00:00
Sebastian Ullrich	dfb496a271	fix: allow multiple `declareBuiltin` per declaration	2024-05-23 17:23:39 +02:00
Leonardo de Moura	6a040ab068	feat: propagate `maxHeartbeats` to kernel (#4113 ) Co-authored-by: Sebastian Ullrich <sebasti@nullri.ch>	2024-05-09 17:44:19 +00:00
Leonardo de Moura	93c06c0552	feat: relaxed reset/reuse in the code generator (#4100 ) closes #4089	2024-05-07 22:08:32 +00:00
Leonardo de Moura	5f1c4df07d	feat: display diagnostic information at term and tactic `set_option diagnostics true` (#4048 ) We don't need to include reduction info at `simp` diagnostic information.	2024-05-01 22:47:57 +00:00
Leonardo de Moura	27c79cb614	fix: double `reset` bug at `ResetReuse` (#4028 ) We conjecture this is the cause for the segfaults when compiling Mathlib with #4006	2024-04-29 23:26:07 +00:00
Leonardo de Moura	dc442ec137	fix: theorems should never be marked as `extern` (#4018 )	2024-04-29 00:01:49 +00:00
Leonardo de Moura	9d14c0456b	feat: add `set_option diag true` for diagnostic counters (#4016 ) It currently only reports how many times each declaration has been unfolded, and how often the `isDefEq` heuristic for `f a =?= f b` has been used. Only counters above the threshold are reported.	2024-04-28 22:14:08 +00:00
Sebastian Ullrich	2dcd42f395	feat: trace.profiler export to Firefox Profiler (#3801 ) Reusing the best profiling UI out there Usage: ``` lean -Dtrace.profiler=true -Dtrace.profiler.output=profile.json foo.lean ... ``` then open `profile.json` in https://profiler.firefox.com/. See also `script/collideProfiles.lean` for minimizing and merging profiles.	2024-04-15 12:13:14 +00:00
Leonardo de Moura	0684c95d35	fix: do not lift `(<- ...)` over pure if-then-else (#3820 ) Now, only `(<- ...)`s occurring in the condition of a pure if-then-else are lifted. That is, `if (<- foo) then ... else ...` is ok, but `if ... then (<- foo) else ...` is not. See #3713 closes #3713 This PR also adjusts this repo. Note that some of the `(<- ...)` were harmless since they were just accessing some read-only state.	2024-04-01 21:33:59 +00:00
David Thrane Christiansen	966fa800f8	chore: remove the coercion from String to Name (#3589 ) This coercion caused difficult-to-diagnose bugs sometimes. Because there are some situations where converting a string to a name should be done by parsing the string, and others where it should not, an explicit choice seems better here. --------- Co-authored-by: Mac Malone <tydeu@hatpress.net>	2024-03-21 23:46:03 +00:00
Mario Carneiro	49f66dc485	perf: rewrite UnusedVariables lint (#3186 ) This is a rewrite of the `UnusedVariables` lint to inline and simplify many of the dependent functions to try to improve the performance of this lint, which quite often shows up in perf reports. * The mvar assignment scanning is one of the most expensive parts of the process, so we do two things to improve this: * Lazily perform the scan only if we need it * Use an object-pointer hashmap to ensure that we don't have quadratic behavior when there are many mvar assignments with slight differences. * The dependency on `Lean.Server` is removed, meaning we don't need to do the LSP conversion stuff anymore. The main logic of reference finding is inlined. * We take `fvarAliases` into account, and union together fvars which are aliases of a base fvar. (It would be great if we had `UnionFind` here.) More docs will be added once we confirm an actual perf improvement. --------- Co-authored-by: Sebastian Ullrich <sebasti@nullri.ch>	2024-03-21 12:28:57 +00:00
Leonardo de Moura	173b956961	feat: reserved names (#3675 ) - Add support for reserved declaration names. We use them for theorems generated on demand. - Equation theorems are not private declarations anymore. - Generate equation theorems on demand when resolving symbols. - Prevent users from creating declarations using reserved names. Users can bypass it using meta-programming. See next test for examples.	2024-03-15 00:33:22 +00:00
Leonardo de Moura	855fbed024	fix: regression on match expressions with builtin literals (#3521 )	2024-02-27 18:49:44 +00:00
Henrik Böving	23e49eb519	perf: add prelude to all Lean modules	2024-02-18 14:55:17 -08:00
Sebastian Ullrich	90a516de09	chore: avoid libleanshared symbol limit (#3346 )	2024-02-15 11:39:44 +00:00
Henrik Böving	50d661610d	perf: LLVM backend, put all allocas in the first BB to enable mem2reg (#3244 ) Again co-developed with @bollu. Based on top of: #3225 While hunting down the performance discrepancy on qsort.lean between C and LLVM we noticed there was a single, trivially optimizeable, alloca (LLVM's stack memory allocation instruction) that had load/stores in the hot code path. We then found: https://groups.google.com/g/llvm-dev/c/e90HiFcFF7Y. TLDR: `mem2reg`, the pass responsible for getting rid of allocas if possible, only triggers on an alloca if it is in the first BB. The allocas of the current implementation get put right at the location where they are needed -> they are ignored by mem2reg. Thus we decided to add functionality that allows us to push all allocas up into the first BB. We initially wanted to write `buildPrologueAlloca` in a `withReader` style so: 1. get the current position of the builder 2. jump to first BB and do the thing 3. revert position to the original However the LLVM C API does not expose an option to obtain the current position of an IR builder. Thus we ended up at the current implementation which resets the builder position to the end of the BB that the function was called from. This is valid because we never operate anywhere but the end of the current BB in the LLVM emitter. The numbers on the qsort benchmark got improved by the change as expected, however we are not fully there yet: ``` C: Benchmark 1: ./qsort.lean.out 400 Time (mean ± σ): 2.005 s ± 0.013 s [User: 1.996 s, System: 0.003 s] Range (min … max): 1.993 s … 2.036 s 10 runs LLVM before aligning the types Benchmark 1: ./qsort.lean.out 400 Time (mean ± σ): 2.151 s ± 0.007 s [User: 2.146 s, System: 0.001 s] Range (min … max): 2.142 s … 2.161 s 10 runs LLVM after aligning the types Benchmark 1: ./qsort.lean.out 400 Time (mean ± σ): 2.073 s ± 0.011 s [User: 2.067 s, System: 0.002 s] Range (min … max): 2.060 s … 2.097 s 10 runs LLVM after this Benchmark 1: ./qsort.lean.out 400 Time (mean ± σ): 2.038 s ± 0.009 s [User: 2.032 s, System: 0.001 s] Range (min … max): 2.027 s … 2.052 s 10 runs ``` Note: If you wish to merge this PR independently from its predecessor, there is no technical dependency between the two, I'm merely stacking them so we can see the performance impacts of each more clearly.	2024-02-13 14:54:40 +00:00
Henrik Böving	06f73d621b	fix: type mismatches in the LLVM backend (#3225 ) Debugged and authored in collaboration with @bollu. This PR fixes several performance regressions of the LLVM backend compared to the C backend as described in #3192. We are now at the point where some benchmarks from `tests/bench` achieve consistently equal and sometimes ever so slightly better performance when using LLVM instead of C. However there are still a few testcases where we are lacking behind ever so slightly. The PR contains two changes: 1. Using the same types for `lean.h` runtime functions in the LLVM backend as in `lean.h` it turns out that: a) LLVM does not throw an error if we declare a function with a different type than it actually has. This happened on multiple occasions here, in particular when the function used `unsigned`, as it was wrongfully assumed to be `size_t` sized. b) Refuses to inline a function to the call site if such a type mismatch occurs. This means that we did not inline important functionality such as `lean_ctor_set` and were thus slowed down compared to the C backend which did this correctly. 2. While developing this change we noticed that LLVM does treat the following as invalid: Having a function declared with a certain type but called with integers of a different type. However this will manifest in completely nonsensical errors upon optimizing the bitcode file through `leanc` such as: ``` error: Invalid record (Producer: 'LLVM15.0.7' Reader: 'LLVM 15.0.7') ``` Presumably because the generate .bc file is invalid in the first place. Thus we added a call to `LLVMVerifyModule` before serializing the module into a bitcode file. This ended producing the expected type errors from LLVM an aborting the bitcode file generation as expected. We manually checked each function in `lean.h` that is mentioned in `EmitLLVM.lean` to make sure that all of their types align correctly now. Quick overview of the fast benchmarks as measured on my machine, 2 runs of LLVM and 2 runs of C to get a feeling for how far the averages move: - binarytrees: basically equal performance - binarytrees.st: basically equal performance - const_fold: equal if not slightly better for LLVM - deriv: LLVM has 8% more instructions than C but same wall clock time - liasolver: basically equal performance - qsort: LLVM is slower by 7% instructions, 4% time. We have identified why the generated code is slower (there is a store/load in a hot loop in LLVM that is not in C) but not figured out why that happens/how to address it. - rbmap: LLVM has 3% less instructions and 13% less wall-clock time than C (woop woop) - rbmap_1 and rbmap_10 show similar behavior - rbmap_fbip: LLVM has 2% more instructions but 2% better wall time - rbmap_library: equal if not slightly better for LLVM - unionfind: LLVM has 5% more instructions but 4% better wall time Leaving out benchmarks related to the compiler itself as I was too lazy to keep recompiling it from scratch until we are on a level with C. Summing things up, it appears that LLVM has now caught up or surpassed the C backend in the microbenchmarks for the most part. Next steps from our side are: - trying to win the qsort benchmark - figuring out why/how LLVM runs more instructions for less wall-clock time. My current guesses would be measurement noise and/or better use of micro architecture? - measuring the larger benchmarks as well	2024-02-13 10:57:35 +00:00
Joachim Breitner	368ead54b2	refactor: `termination_by` changes in stdlib	2024-01-10 17:27:35 +01:00
Kyle Miller	a2226a43ac	feat: encode `let_fun` using a `letFun` function (#2973 ) Switches from encoding `let_fun` using an annotated `(fun x : t => b) v` expression to a function application `letFun v (fun x : t => b)`. --------- Co-authored-by: Sebastian Ullrich <sebasti@nullri.ch>	2023-12-18 09:01:42 +00:00
Joachim Breitner	b1f2fcf758	fix: Escape `?` in C literal strings to avoid trigraphs This fixes #3829	2023-11-06 16:25:00 +01:00
Siddharth Bhat	0b37bad2cb	feat: split bitcode optimization and object file building to be outside lean	2023-11-02 23:21:47 +01:00

1 2 3 4 5 ...

832 commits