lean4-htt

Author	SHA1	Message	Date
Eric Paul	bb8e6801f0	chore: fix typo in parser docstring (#11753 ) Fix a typo in the docstring for checking the `lhsPrec`	2025-12-20 23:17:47 +00:00
Eric Wieser	9338aabed9	fix: move the monad argument for ForIn, ForIn', and ForM (#10204 ) This PR changes the interface of the `ForIn`, `ForIn'`, and `ForM` typeclasses to not take a `Monad m` parameter. This is a breaking change for most downstream `instance`s, which will will now need to assume `[Monad m]`. The rationale is that if the provider of an instance requires `m` to be a Monad, they should assume this up front. This makes it possible for the instanve to assume `LawfulMonad m` or some other stronger requirement, and also to provided a concrete instance for a particular `m` without assuming a non-canonical `Monad` structure on it. Zulip: [#lean4 > Monad assumptions in fields of other typeclasses @ 💬](https://leanprover.zulipchat.com/#narrow/channel/270676-lean4/topic/Monad.20assumptions.20in.20fields.20of.20other.20typeclasses/near/537102158)	2025-11-25 12:20:37 +00:00
Markus Himmel	fa5d08b7de	refactor: use `String.Slice` in `String.take` and variants (#11180 ) This PR redefines `String.take` and variants to operate on `String.Slice`. While previously functions returning a substring of the input sometimes returned `String` and sometimes returned `Substring.Raw`, they now uniformly return `String.Slice`. This is a BREAKING change, because many functions now have a different return type. So for example, if `s` is a string and `f` is a function accepting a string, `f (s.drop 1)` will no longer compile because `s.drop 1` is a `String.Slice`. To fix this, insert a call to `copy` to restore the old behavior: `f (s.drop 1).copy`. Of course, in many cases, there will be more efficient options. For example, don't write `f <\| s.drop 1 \|>.copy \|>.dropEnd 1 \|>.copy`, write `f <\| s.drop 1 \|>.dropEnd 1 \|>.copy` instead. Also, instead of `(s.drop 1).copy = "Hello"`, write `s.drop 1 == "Hello".toSlice` instead.	2025-11-18 16:13:48 +00:00
Markus Himmel	bf60550ce5	chore: rename `Substring` to `Substring.Raw` (#11154 ) This PR renames `Substring` to `Substring.Raw`. This is to signify its status as a second-class citizen (not deprecated, but no real plans for verification, like `String.Pos.Raw`) and to free up the name `Substring` for a possible future type `String.Substring : String -> Type` so that `s.Substring` is the type of substrings of `s`. The functions `String.toSubstring` and `String.toSubstring'` will remain for now for bootstrapping reasons.	2025-11-16 09:30:04 +00:00
Markus Himmel	b28daa6d60	chore: rename `String.endPos` -> `String.rawEndPos` (#10853 ) This PR renames `String.endPos` to `String.rawEndPos`, as in a future release the name `String.endPos` will be taken by the function that is currently called `String.endValidPos`.	2025-10-21 11:25:30 +00:00
Markus Himmel	dad541265c	refactor: move operations on `String.Pos.Raw` to the `String.Pos.Raw` namespace (#10735 ) This PR moves many operations involving `String.Pos.Raw` to a the `String.Pos.Raw` namespace with the eventual aim of freeing up the `String` namespace to contain operations using `String.ValidPos` (to be renamed to `String.Pos`) instead. This PR adds the `String.ValidPos.set` and `String.ValidPos.modify` functions. After this PR, `String.pos_lt_eq` is no longer a `simp` lemma. Add `String.Pos.Raw.lt_iff` as a `simp` lemma if your proofs break.	2025-10-18 12:12:55 +00:00
Markus Himmel	dca8d6d188	refactor: discipline around arithmetic of `String.Pos.Raw` (#10713 ) This PR enforces rules around arithmetic of `String.Pos.Raw`. Specifically, it adopts the following conventions: - Byte indices ("ordinals") in strings should be represented using `String.Pos.Raw` - Amounts of bytes ("cardinals") in strings should be represented using `Nat`. For example, `String.Slice.utf8ByteSize` now returns `Nat` instead of `String.Pos.Raw`, and there is a new function `String.Slice.rawEndPos`. Finally, the `HAdd` and `HSub` instances for `String.Pos.Raw` are reorganized. This is a breaking change. The `HAdd/HSub String.Pos.Raw String.Pos.Raw String.Pos.Raw` instances have been removed. For the use case of tracking positions relative to some other position, we instead provide `offsetBy` and `unoffsetBy` functions. For the use case of advancing/unadvancing a position by an arbitrary number of bytes, we instead provide `increaseBy` and `decreaseBy` functions. For offsetting/unoffsetting/advancing/unadvancing a position `p` by the size of a string `s` (resp. character `c`), use `s + p`/`p - s`/`p + s`/`p - s` (resp. `c + p`/`p - c`/`p + c`/`p - c`).	2025-10-09 07:47:45 +00:00
Leonardo de Moura	f9e140838e	feat: `hexnum` parser (#10716 ) This PR adds a new helper parser for implementing parsers that contain hexadecimal numbers. We are going to use it to implement anchors in the `grind` interactive mode.	2025-10-08 21:12:03 +00:00
Markus Himmel	81ea922025	chore: rename `String.Pos` to `String.Pos.Raw` (#10624 ) This PR renames `String.Pos` to `String.Pos.Raw`. After an abbreviated deprecation cycle, we will then rename `String.ValidPos` to `String.Pos`.	2025-10-01 07:45:24 +00:00
Robert J. Simmons	2231d9b488	feat: improve error messages for ambiguous `3.toDecmial` syntax (#10488 ) This PR changes the way that scientific numerals are parsed in order to give better error messages for (invalid) syntax like `32.succ`. Example: ```lean4 #check 32.succ ``` Before, the error message is: ``` unexpected identifier; expected command ``` This is because `32.` parses as a complete float, and `#check 32.` parses as a complete command, so `succ` is being read as the start of a new command. With this change, the error message will move from the `succ` token to the `32` token (which isn't totally ideal from my perspective) but gives a less misleading error message and corresponding suggestion: ``` unexpected identifier after decimal point; consider parenthesizing the number ```	2025-09-26 01:12:10 +00:00
David Thrane Christiansen	97464c9d7f	fix: trailing whitespace setting for string literals was ignored (#10389 ) This PR fixes a bug where string literal parsing ignored its trailing whitespace setting.	2025-09-15 09:51:56 +00:00
Kyle Miller	3e4fa12c72	feat: add `unicode(...)` parser syntax and `pp.unicode` option (#10373 ) This PR adds a `pp.unicode` option and a `unicode("→", "->")` syntax description alias for the lower-level `unicodeSymbol "→" "->"` parser. The syntax is added to the `notation` command as well. When `pp.unicode` is true (the default) then the first form is used when pretty printing, and otherwise the second ASCII form is used. A variant, `unicode("→", "->", preserveForPP)` causes the `->` form to be preferred; delaborators can insert `→` directly into the syntax, which will be pretty printed as-is; this allows notations like `fun` to use custom options such as `pp.unicode.fun` to opt into the unicode form when pretty printing. Additionally: - Adds more documentation for the `symbol` and `nonReservedSymbol` parser descriptions. - Adds documentation for the `infix`/`infixr`/`infixl`/`prefix`/`postfix` commands. - The parenthesizers for symbols are improved to backtrack if the atom doesn't match. - Fixes a bug where `&"..."` symbols aren't validated. This is partial progress for issue #1056. What remains is enabling `unicode(...)` for mixfix commands and then making use of it for core notation.	2025-09-14 04:40:03 +00:00
David Thrane Christiansen	3e2124bb48	feat: docstrings with Verso syntax (#10307 ) This PR upstreams the Verso parser and adds preliminary support for Verso in docstrings. This will allow the compiler to check examples and cross-references in documentation. After a `stage0` update, a follow-up PR will add the appropriate attributes that allow the feature to be used. The parser tests from Verso also remain to be upstreamed, and user-facing documentation will be added once the feature has been used on more internals.	2025-09-10 07:03:57 +00:00
Sebastian Ullrich	321af0e02b	fix: public structures with private field types under the module system (#10109 ) Fixes #10099	2025-08-25 14:48:23 +00:00
David Thrane Christiansen	c9727c2d19	feat: add a stop position field to the parser (#10043 ) This PR allows Lean's parser to run with a final position prior to the end of the string, so it can be invoked on a sub-region of the input. This has applications in Verso proper, which parses Lean syntax in contexts such as code blocks and docstrings, and it is a prerequisite to parsing the contents of Lean docstrings.	2025-08-23 18:29:51 +00:00
Sebastian Ullrich	ff1d3138bf	refactor: `module`-ize `Lean` (#9330 )	2025-07-25 12:02:51 +00:00
Rob23oba	e148871087	chore: fix spelling errors (#9175 ) (Almost) only typos in constant names and doc-strings were considered; grammar was not considered. Also, along others, `mkDefinitionValInferrringUnsafe` has been fixed :-)	2025-07-24 23:35:32 +00:00
Henrik Böving	09de5cd70e	refactor: remove Lean.RBMap usages (#9260 ) This PR removes uses of `Lean.RBMap` in Lean itself. Furthermore some massaging of the import graph is done in order to avoid having `Std.Data.TreeMap.AdditionalOperations` (which is quite expensive) be the critical path for a large chunk of Lean. In particular we can build `Lean.Meta.Simp` and `Lean.Meta.Grind` without it thanks to these changes. We did previously not conduct this change as `Std.TreeMap` was not outperforming `Lean.RBMap` yet, however this has changed with the new code generator.	2025-07-21 14:04:45 +00:00
Paul Reichert	70b4b2b36c	feat: polymorphic ranges (#8784 ) This PR introduces ranges that are polymorphic, in contrast to the existing `Std.Range` which only supports natural numbers. Breakdown of core changes: * `Lean.Parser.Basic`: Modified the number parser (`Lean.Parser.Basic`) so that it will only consider a single dot to be part of a decimal number. `1..` will no longer be parsed as `1.` followed by `.`, but as `1` followed by `..`. * The test `ellipsisProjIssue` ensures that `#check Nat.add ...succ` produces a syntax error. After introducing the new range notation (see below), it returns a different (less nice) error message. I updated the test to reflect the new error message. (The error message will become nicer as soon as a delaborator for the ranges is implemented. This is out of scope for this PR.) Breakdown of standard library changes: Modified modules: `Init.Data.Range.Polymorphic` (added), `Init.Data.Iterators`, `Std.Data.Iterators` * Introduced the type `Std.PRange` that is parameterized over the type in which the range operates and the shapes of the lower and upper bound. * Introduced a new notation for ranges. Examples for this notation are: `1...`, `1...=3`, `1...<3`, `1<...=2`, `...=3`. * Defined lots of typeclasses for different capabilities of ranges, depending on their shape and underlying type. * Introduced `Iter(M).size`. * Introduced the `Iter(M).stepSize n` combinator, which iterates over an iterator with the given step size `n`. It will drop `n - 1` values between every value it emits. * Replaced `LawfulPureIterator` with a new and better typeclass `LawfulDeterministicIterator`. * Simplified some lemma statements in the iterator library such as `IterM.toList_eq_match`, which unnecessarily matched over a `Subtype`, hindering rewrites due to type dependencies. Reasons for the concrete choice of notation: * `lean4-cli` uses `...`-based notation for the `Cmd` notation and it clashes with `...a` range notation. * test `2461` fails when using two-dot-based notation because of the existing `{ a.. }` notation.	2025-06-26 08:18:11 +00:00
Sebastian Ullrich	af1d8dd070	feat: `:= private` instance syntax	2025-05-28 10:18:04 +02:00
Eric Wieser	ae1ab94992	fix: replace bad simp lemmas for `Id` (#7352 ) This PR reworks the `simp` set around the `Id` monad, to not elide or unfold `pure` and `Id.run` In particular, it stops encoding the "defeq abuse" of `Id X = X` in the statements of theorems, instead using `Id.run` and `pure` to pass back and forth between these two spellings. Often when writing these with `pure`, they generalize to other lawful monads; though such changes were split off to other PRs. This fixes the problem with the current simp set where `Id.run (pure x)` is simplified to `Id.run x`, instead of the desirable `x`. This is particularly bad because the` x` is sometimes inferred with type `Id X` instead of `X`, which prevents other `simp` lemmas about `X` from firing. Making `Id` reducible instead is not an option, as then the `Monad` instances would have nothing to key on. --------- Co-authored-by: Sebastian Graf <sg@lean-fro.org> Co-authored-by: Kim Morrison <kim@tqft.net> Co-authored-by: Paul Reichert <6992158+datokrat@users.noreply.github.com>	2025-05-22 22:45:35 +00:00
euprunin	88078930a9	chore: fix spelling mistakes (#8324 ) Co-authored-by: euprunin <euprunin@users.noreply.github.com>	2025-05-14 06:52:16 +00:00
David Thrane Christiansen	a97813e11f	doc: review docstrings for syntax-related operators in manual (#7534 ) This PR adds missing `Syntax`-related docstrings and makes the existing ones consistent in style with the others.	2025-03-19 05:15:05 +00:00
Sebastian Ullrich	b3a8d5b04e	feat: async modes for environment access (#6852 ) This PR allows environment extensions to opt into access modes that do not block on the entire environment up to this point as a necessary prerequisite for parallel proof elaboration.	2025-01-31 16:35:50 +00:00
Kim Morrison	5b1c6b558a	feat: align `take/drop/extract` across `List/Array/Vector` (#6860 ) This PR makes `take`/`drop`/`extract` available for each of `List`/`Array`/`Vector`. The simp normal forms differ, however: in `List`, we simplify `extract` to `take+drop`, while in `Array` and `Vector` we simplify `take` and `drop` to `extract`. We also provide `Array/Vector.shrink`, which simplifies to `take`, but is implemented by repeatedly popping. Verification lemmas for `Array/Vector.extract` to follow in a subsequent PR.	2025-01-30 01:24:25 +00:00
Parth Shastri	0da3624ec9	fix: allow dot idents to resolve to local names (#6602 ) This PR allows the dot ident notation to resolve to the current definition, or to any of the other definitions in the same mutual block. Existing code that uses dot ident notation may need to have `nonrec` added if the ident has the same name as the definition. Closes #6601	2025-01-12 17:18:22 +00:00
Kyle Miller	63791f0177	feat: `_` separators in numeric literals (#6204 ) This PR lets `_` be used in numeric literals as a separator. For example, `1_000_000`, `0xff_ff` or `0b_10_11_01_00`. New lexical syntax: ```text numeral10 : [0-9]+ ("_"+ [0-9]+)* numeral2 : "0" [bB] ("_"* [0-1]+)+ numeral8 : "0" [oO] ("_"* [0-7]+)+ numeral16 : "0" [xX] ("_"* hex_char+)+ float : numeral10 "." numeral10? [eE[+-]numeral10] ``` Closes #6199	2024-12-08 22:23:12 +00:00
Kim Morrison	71122696a1	feat: rename Array.shrink to take, and relate to List.take (#5796 )	2024-10-21 23:35:32 +00:00
Mario Carneiro	ec98c92ba6	feat: @[builtin_doc] attribute (part 2) (#3918 ) This solves the issue where certain subexpressions are lacking syntax hovers because the hover text is not "builtin" - it only shows up if the `Parser` constant is imported in the environment. For top level syntaxes this is not a problem because `builtin_term_parser` will automatically add this doc information, but nested syntaxes don't get the same treatment. We could walk the expression and add builtin docs recursively, but this is somewhat expensive and unnecessary given that it's a fixed list of declarations in lean core. Moreover, there are reasons to want to control which syntax nodes actually get hovers, and while a better system for that is forthcoming, for now it can be achieved by strategically not applying the `@[builtin_doc]` attribute. Fixes #3842	2024-09-13 08:05:10 +00:00
Kyle Miller	a7338c5ad8	feat: make frontend normalize line endings to LF (#3903 ) To eliminate parsing differences between Windows and other platforms, the frontend now normalizes all CRLF line endings to LF, like [in Rust](https://github.com/rust-lang/rust/issues/62865). Effects: - This makes Lake hashes be faithful to what Lean sees (Lake already normalizes line endings before computing hashes). - Docstrings now have normalized line endings. In particular, this fixes `#guard_msgs` failing multiline tests for Windows users using CRLF. - Now strings don't have different lengths depending on the platform. Before this PR, the following theorem is true for LF and false for CRLF files. ```lean example : " ".length = 1 := rfl ``` Note: the normalization will take `\r\r\n` and turn it into `\r\n`. In the elaborator, we reject loose `\r`'s that appear in whitespace. Rust instead takes the approach of making the normalization routine fail. They do this so that there's no downstream confusion about any `\r\n` that appears. Implementation note: the LSP maintains its own copy of a source file that it updates when edit operations are applied. We are assuming that edit operations never split or join CRLFs. If this assumption is not correct, then the LSP copy of a source file can become slightly out of sync. If this is an issue, there is some discussion [here](https://github.com/leanprover/lean4/pull/3903#discussion_r1592930085).	2024-05-20 17:13:08 +00:00
David Thrane Christiansen	74e7886ce7	feat: custom error recovery in parser (#3413 ) Adds a simple error-recovery mechanism to Lean's parser, similar to those used in other combinator parsing libraries. Lean itself isn't very amenable to error recovery with this mechanism, as it requires global knowledge of the grammar in question to write recovery rules that don't break backtracking or `<\|>`. I only found a few opportunities. But for DSLs, this is really important. In particular, Verso parse errors interacted very badly with Lean parse errors in a way that required frequent "restart file" commands, but this mechanism allows me to both recover from Verso parse errors and to have Lean skip the rest of the file rather than repeatedly trying to parse it as Lean commands.	2024-02-21 14:29:54 +00:00
Henrik Böving	23e49eb519	perf: add prelude to all Lean modules	2024-02-18 14:55:17 -08:00
Kyle Miller	ae6fe098cb	feat: Rust-style raw string literals (#2929 ) For example, `r"\n"` and `r#"The word "this" is in quotes."#`. Implements #1422	2023-12-20 16:53:08 +00:00
Kyle Miller	bcbcf50442	feat: string gaps for continuing string literals across multiple lines (#2821 ) Implements "gaps" in string literals. These are escape sequences of the form `"\" newline whitespace+` that have the interpretation of an empty string. For example, ``` "this is \ a string" ``` is equivalent to `"this is a string"`. These are modeled after string continuations in [Rust](https://doc.rust-lang.org/beta/reference/tokens.html#string-literals). Implements RFC #2838	2023-12-07 08:17:00 +00:00
int-y1	8d7520b36f	chore: fix typos in comments	2023-10-08 10:46:05 +02:00
Joachim Breitner	b2d668c340	perf: Use flat ByteArrays in Trie (#2529 )	2023-09-20 13:22:37 +02:00
Sebastian Ullrich	241430aa03	perf: avoid calculating position, revert building `unexpected` message in `mkUnexpectedTokenErrors`	2023-09-12 11:42:24 +02:00
Sebastian Ullrich	6c0baf4aed	feat: support reporting range for parser errors, report ranges for expected token errors	2023-09-12 11:42:24 +02:00
Sebastian Ullrich	f4fc8b3e15	refactor: parser error setters	2023-09-12 11:42:24 +02:00
Mario Carneiro	2037094f8c	doc: document all parser aliases (#2499 )	2023-09-06 09:02:25 +00:00
Marcus Rossel	7ee7595637	doc: fix typos (#2467 )	2023-08-28 15:40:33 +10:00
Sebastian Ullrich	8fc1af650a	fix: symmetry in orelse antiquotation parsing	2023-07-28 08:36:33 -07:00
Sebastian Ullrich	eceac9f12a	perf: avoid syntax stack copy at `orelseFn`	2023-07-28 08:36:33 -07:00
Mario Carneiro	e64a2e1a12	fix: misleading indentation	2023-06-17 06:56:53 -07:00
Mario Carneiro	b139a97825	fix: hygieneInfo should not consume whitespace	2023-06-09 15:05:19 +02:00
Mario Carneiro	c20a7bf305	feat: `hygieneInfo` parser (aka `this` 2.0)	2023-06-02 16:19:02 +02:00
Sebastian Ullrich	9c9cc017df	fix: ignore empty character literals	2022-12-12 22:59:06 +01:00
Sebastian Ullrich	42a080fae2	fix: comments ending in `--/` Fixes #1883	2022-11-25 10:32:49 +01:00
Sebastian Ullrich	1f447efa54	doc: update Lean.Parser.Basic	2022-11-11 14:17:21 +01:00
Sebastian Ullrich	30dd28480d	fix: `suppressInsideQuot` inside quotation	2022-11-11 13:45:41 +01:00

1 2 3 4 5

224 commits