lean4-htt

Author	SHA1	Message	Date
Garmelon	6b7f0ad5fc	chore: check test output before exit code in piles (#12947 ) This improves the feedback when tests fail. Getting a diff is more useful than a vague exit code.	2026-03-17 16:34:21 +00:00
Garmelon	49715fe63c	chore: improve how test suite interacts with stages (#12913 ) The tests need to run with certain environment variables set that only cmake really knows and that differ between stages. Cmake could just set the variables directly when running the tests and benchmarks, but that would leave no good way to manually run a single benchmark. So cmake generates some stage-specific scripts instead that set the required environment variables. Previously, those scripts were sourced directly by the individual `run_` scripts, so the env scripts of different stages would overwrite each other. This PR changes the setup so they can instead be generated next to each other. This also simplifies the `run_` scripts themselves a bit, and makes `tests/bench/build` less of a hack.	2026-03-16 15:20:03 +00:00
Wojciech Nawrocki	47b3be0524	feat: update RPC wire format (#12905 ) This PR adjusts the JSON encoding of RPC references from `{"p": "n"}` to `{"__rpcref": "n"}`. Existing clients will continue to work unchanged, but should eventually move to the new format by advertising the `rpcWireFormat` client capability. - This came up in leanprover/vscode-lean4#712. - The new encoding is far less likely to clash with real-world names, and is now documented as a "reserved internal name". - At 8 bytes vs. 1 byte, it incurs a ~5% size increase on the JSON size of interactive terms, e.g. from 868KiB to 903KiB on the leanprover/vscode-lean4#500 test. - Make `deriving RpcEncodable` throw an error when it encounters the reserved name. We cannot easily guard against clashes in user-provided JSON, however, so we just assume it does not clash. - Add a notion of RPC wire format with corresponding `rpcWireFormat` client and server capabilities. The format before this PR is now called `v0`, whereas here we implement `v1`. Existing clients should eventually implement compatibility with `v1` (because doing so fixes the above bug), but will continue to work in the meantime. The format may be revised again in the future (but we don't expect to revise it so often that semver would be useful). - Document everything. ## Alternative designs (abandoned for now) - Option 1. Add a method `$/lean/rpc/metadata` which, given the name of an RPC method `foo`, returns metadata containing a description of where the RPC refs in any return value of `foo` would be (essentially a description of the structure of the return type). - Option 2. Wrap every response to `$/lean/rpc/call` in such metadata. This would be a different change to the wire format. - To implement this in an extensible way, we extend `RpcEncodable` by a `refPaths` field. But how does `refPaths` describe where the refs are? - Option A. Emit the code of a JS method that extracts the refs. This is maybe simplest, but it would leave non-JS clients (e.g. `lean.nvim`) behind. - Option B. Give the description in some query language. The query language must be able to describe paths into arbitrary inductive types. - The most popular option, [JSONPath](https://www.rfc-editor.org/rfc/rfc9535), seemingly cannot describe non-uniform paths (e.g. both the `a`s in `{a: 1, {b: {a: 2}}}`). - [JMESPath](https://jmespath.org/) can describe non-uniform paths, and has 'fully compliant' implementations in many languages, but doesn't seem to handle recursive paths. - The most expressive option is [jq](https://github.com/jqlang/jq), but the most popular way to run it is via an Emscripten WASM blob in [jq-web](https://github.com/fiatjaf/jq-web) which seems heavy. There is [jqjs](https://github.com/mwh/jqjs) as well; I'm not sure how production-ready that is.	2026-03-13 23:46:16 +00:00
Garmelon	6a2a884372	chore: migrate pkg tests (#12889 ) Also refactor util.sh in the process, so test scripts become easier to write (inspired in part by lake's test suite).	2026-03-11 18:55:46 +00:00
Kim Morrison	e01cbf2b8f	feat: add structured TraceResult to TraceData (#12698 ) This PR adds a `result? : Option TraceResult` field to `TraceData` and populates it in `withTraceNode` and `withTraceNodeBefore`, so that metaprograms walking trace trees can determine success/failure structurally instead of string-matching on emoji. `TraceResult` has three cases: `.success` (checkEmoji), `.failure` (crossEmoji), and `.error` (bombEmoji, exception thrown). An `ExceptToTraceResult` typeclass converts `Except` results to `TraceResult` directly, with instances for `Bool` and `Option`. `TraceResult.toEmoji` converts back to emoji for display. This replaces the previous `ExceptToEmoji` typeclass — `TraceResult` is now the primary representation rather than being derived from emoji strings. `withTraceNodeBefore` (used by `isDefEq`) uses `ExceptToTraceResult.toTraceResult` directly, correctly handling `Bool` (`.ok false` = failure) and `Option` (`.ok none` = failure), with `Except.error` mapping to `.error`. For `withTraceNode`, `result?` defaults to `none`. Callers can pass `mkResult?` to provide structured results; when set, the corresponding emoji is auto-prepended to the message. Motivated by mathlib's `#defeq_abuse` diagnostic tactic (https://github.com/leanprover-community/mathlib4/pull/35750) which currently string-matches on emoji to determine trace node outcomes. See https://leanprover.zulipchat.com/#narrow/channel/113488-general/topic/backward.2EisDefEq.2ErespectTransparency 🤖 Prepared with Claude Code --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-10 02:42:57 +00:00
Garmelon	a3cb39eac9	chore: migrate more tests to new test suite (#12809 ) This PR migrates most remaining tests to the new test suite. It also completes the migration of directories like `tests/lean/run`, meaning that PRs trying to add tests to those old directories will now fail.	2026-03-06 16:52:01 +00:00

6 commits