kakoune.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2025-07-08	Replace std::unique_ptr with a custom implementation	Maxime Coste
	<memory> is a costly header we can avoid by just implementing UniquePtr ourselves, which is a pretty straightforward in modern C++, this saves around 10% of the compilation time here.
2025-07-07	Copy instruction to local variable in step_current_thread	Maxime Coste
	This helps the compiler realize that data cannot change and does not need reloading, improving codegen slightly.
2025-07-07	Use uint32_t for DualThreadStack indices	Maxime Coste

2025-07-07	Add a CharRange regex op to optimize the common simple range case	Maxime Coste
	Instead of jumping into the general CharClass code, detect simple [a-z] style ranges and use a specific op. Also detect when a range can be converted to ignore case
2025-07-07	Avoid branches in ThreadedRegexVM::DualThreadStack iteration	Maxime Coste
	decrement and post_increment do not get cmov optimised as expected, we can avoid this altogether by taking advantage of the fact that capacity is always a power-of-two and we can hence use a bitwise and we can use a bitwise and to loop around capacity.
2025-02-04	Revert "Use uint64_t for regex step"	Maxime Coste
	This got pushed by accident This reverts commit d92496449d0c9655253ad16363685bb8446dc582.
2025-01-22	Use uint64_t for regex step	Maxime Coste
	Its unclear that maintaining a small instruction size outweigh the cost of handling wrapping of the current_step every 64K codepoints, this makes the code simpler.
2024-12-10	Code style tweak in regex_impl	Maxime Coste

2024-12-09	Rework split between exec and exec_program method	Maxime Coste
	Split last iteration out of the loop so that optimizer can elide most comparisons between pos and config.end as its always different in the loop and equal at last call.
2024-12-09	Tweak inlining around thread stack push/pulls	Maxime Coste
	Ensure push/pulls operations are inlined except for the uncommon grow.
2024-12-05	Fix parameter passing in find_next_start	Maxime Coste

2024-12-04	Various small code simplifications/tweaks in ThreadedRegexVM	Maxime Coste

2024-12-01	Add specific start desc optimization for single possible start byte	Maxime Coste
	Use tighter codegen for that pretty common use case.
2024-11-28	Raise the regex idle function call period to every 16M codepoint	Maxime Coste

2024-11-04	Fix backward regex search ending in DOTALL	Johannes Altmanninger
	I noticed that reverse searches ending in "." stopped working in version 2024.05.08: kak -n -e "exec %{%cfoobar<ret><esc>gj<a-/>foo.<ret>}' Bisects ca7471c25 (Compute StartDesc with an offset to effective start, 2024-03-18) which updated the find_next_start() logic for the forward case but not for backward case. Add a symmetrical change and test case, that seems to fix it. Not 100% sure if this is correct but feels so.
2024-08-12	Reduce headers dependency graph	Maxime Coste
	Move more code into the implementation files to reduce the amount of code pulled by headers.
2024-08-12	Remove void_t and use requires instead	Maxime Coste

2024-06-15	Small code style tweak	Maxime Coste

2024-06-15	Store instruction pointers directly in ThreadedRegexVM::Thread	Maxime Coste
	The previous tradeoff of having a very small Thread struct is not necessary anymore as we do not memcpy Threads on swap_next since d708b77186c1685dcbd2298246ada7d204acec2f. This requires offsets to be used instead of indices for jump/split ops.
2024-05-31	Small regex code cleanup	Maxime Coste

2024-04-01	Add missing <bit> include	Maxime Coste

2024-03-22	Match Op declaration order in switches	Maxime Coste

2024-03-22	Make CompiledRegex not a RefCountable	Maxime Coste
	Keep this closer to the point of use, avoid pull ref_ptr.hpp into regex_impl.hpp
2024-03-21	Compute StartDesc with an offset to effective start	Maxime Coste
	This means `.{2,4}foo` will now consider 4 or less before f as a start candidate instead of every characters
2024-03-21	Only push a first instruction thread when on a potential start	Maxime Coste
	There is no need to push threads for each codepoint when we know they will fail as the current codepoint is not a start candidate.
2024-03-15	Revert "Always allocate saves"	Maxime Coste
	This crashes in unit tests This reverts commit cde5f5a25838b2c9a2bf198b819a58d723b434a3.
2024-03-15	Always allocate saves	Maxime Coste
	This sometimes allocates saves too eagerly, but it removes a branch in release saves that executes on every thread failing which seems slightly better.
2024-03-13	Avoid clearing iterator buffer on saves allocation	Maxime Coste
	When creating a new save, we had to clear all iterators to have valid values. This operation is relatively costly because it gets optimized to a memset whose call overhead is pretty high (as we usually have less than 32 bytes to clear). Bypass this by storing a bitmap of valid iterators.
2024-03-13	Simplify and accelerate start desc map	Maxime Coste
	Store values for all possible bytes and fill utf8 multi byte start values when necessary.
2024-03-12	Small cleanup	Maxime Coste

2024-03-11	Simplify Split regex op handling by swapping target	Maxime Coste

2024-03-11	flatten ThreadedRegexVM::codepoint	Maxime Coste
	Profiling shows that this does not always get the utf8::read_codepoint call inlined and that almost doubles the time spent in the function.
2024-03-07	Reduce Save access indirections	Maxime Coste
	Most Save access are to modify the refcount. Now that the freelist is index based it is not necessary to keep Save objects at fixed memory locations.
2024-03-05	Slight simplification of ThreadedRegexVM::exec	Maxime Coste
	Remove redundant checking for end and double indirection to get instructions pointer.
2024-02-12	Early reject regex instructions that were already scheduled this step	Maxime Coste

2024-02-11	Do not decode utf8 while looking for next regex match start candidate	Maxime Coste
	If the first byte in the multi-byte utf8 sequence does not match, it means the "other" character is not set, so none of the sequence byte will match (as they are all with the MSB set). This tightens the critical loop which ends up running faster in most cases.
2023-06-27	Unbreak build on ppc	Sergey Fedorov
	Fixes: https://github.com/mawww/kakoune/issues/4937
2023-05-21	Add an idle callback to be called regularly while regex matching	Maxime Coste
	This paves the way towards being able to cancel long regex matching operations
2023-03-13	Grow dual thread stack after pushing a thread on the next queue	Maxime Coste
	The previous code was assuming it was fine to push_next without growing, which used to be the case with the previous implementation because we always have poped the current thread that we try to push. However now that we use a ring-buffer, m_next_begin == m_next_end can either mean full, or empty. We solve this by assuming it means empty and never allowing the buffer to become full, which means we need to grow after pushing to next if we get full. Fixes #4859
2023-02-19	Only decode current codepoint once per step	Maxime Coste
	Instead of potentially decoding for each thread, always decode as its only slightly slower than finding next codepoint (which will be necessary anyway) and pass the codepoint to each thread.
2023-02-19	Remove instructions from ExecConfig	Maxime Coste
	We can just compute whenever we reset last_step, which does not happen often and we know `forward` at compile time anyway
2023-02-19	Optimize Regex CharacterClass matching	Maxime Coste
	Take advantage of ranges sorting to early out, make the logic inline.
2023-02-14	Fix broken corner cases in DualThreadStack::grow_ifn	Maxime Coste
	We only grow when the ring buffer is full, which allows for a nice simplification of the code. Tell grow_ifn if we pushed in current or next so that we can distinguish between filled by next or filled by current when m_current == m_next_begin
2023-02-14	Refactor DualThreadStack as a RingBuffer	Maxime Coste
	Instead of two stacks growing from the two ends of a buffer, use a ring buffer growing from the same mid spot. This avoids the costly memory copy every step when we set next threads as the current ones.
2023-02-13	Remove scheduled optimization from ThreadedRegexVM	Maxime Coste
	This does not seem to actually speed up execution as threads will be dropped on next step anyway
2023-01-23	Fix incorrect use of subject end/begin in regex execution	Maxime Coste
	This could lead to reading past subject string end in certain conditions Fixes #4794
2022-08-20	Slight code style tweak	Maxime Coste

2022-08-20	Remove unnecessary utf8 decoding when looking for EOL in regex	Maxime Coste

2022-08-20	Refactor RegionsHighlighter to share regexes	Maxime Coste
	Instead of storing regexes in each regions, move them to the core highlighter in a hash map so that shared regexes between different regions are only applied once per update instead of once per region Also change iteration logic to apply all regex together to each changed lines to improve memory locality on big buffers. For the big_markdown.md file described in #4685 this reduces initial display time from 3.55s to 2.41s on my machine.
2022-08-05	Reuse existing character classes when possible in regex	Maxime Coste