History

Paul Reichert 68ea28c24f feat: `Array.mergeSort` (#12385 ) This PR implements a merge sort algorithm on arrays. It has been measured to be about twice as fast as `List.mergeSort` for large arrays with random elements, but for small or almost sorted ones, the list implementation is faster. Compared to `Array.qsort`, it is stable and has O(n log n) worst-case cost. Note: There is still a lot of potential for optimization. The current implementation allocates O(n log n) arrays, one per recursive call. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>		2026-03-06 13:18:13 +00:00
..
.gitignore
Bench.lean	feat: `Array.mergeSort` (#12385 )	2026-03-06 13:18:13 +00:00
bench.py	feat: `Array.mergeSort` (#12385 )	2026-03-06 13:18:13 +00:00
bench2.py	feat: `Array.mergeSort` (#12385 )	2026-03-06 13:18:13 +00:00
lakefile.lean
lean-toolchain	chore: relative lean-toolchains (#12652 )	2026-02-25 10:23:35 +00:00
README.md	feat: `Array.mergeSort` (#12385 )	2026-03-06 13:18:13 +00:00

README.md

mergeSortBenchmark

Benchmarking List.mergeSort and Array.mergeSort.

Run lake exe mergeSort k to run a benchmark on collections of size k * 10^5. This reports the total and average time (in milliseconds) to sort:

an already sorted list/array
a reverse sorted list/array
an almost sorted list/array
and a random list/array with duplicates

The benchmark also reports the comparative performance between the two implementations.

Performance Characteristics

In many cases, List.mergeSort is faster. However, for large, random collections (>= 600k elements), Array.mergeSort scales better.

Run python3 bench.py to run this for k = 1, .., 10, and calculate a best fit of the model A * k + B * k * log k to the observed runtimes. (This isn't really what one should do: fitting a log to data across a single order of magnitude is not helpful.)

More detailed comparisons can be generated using python3 bench2.py.