Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix handling of backtracking stack with some loops #79353

Merged
merged 1 commit into from
Dec 7, 2022

Commits on Dec 7, 2022

  1. Fix handling of backtracking stack with some loops

    With both RegexOptions.Compiled and the Regex source generator, Regex greedy loops with
    - a minimum bound of at least 2
    - no child constructs that backtrack
    - and a child that's more than a one/notone/set (aka things that match a single character)
    
    are possibly leaving state on the backtracking stack when:
    - at least one iteration of the loop successfully matches
    - but not enough iterations match to make the loop successful such that matching the loop fails
    
    In that case, if a previous construct in the pattern pushed any state onto the backtracking stack such that it expects to be able to pop off and use that state upon backtracking to it, it will potentially pop the erroneously leftover state.  This can then cause execution to go awry, as it's getting back an unexpected value.  That can lead to false positives, false negatives, or exceptions such as an IndexOutOfRangeException due to trying to pop too much from the backtracking stack.
    
    We already have the ability to remember the backtracking stack position when we initially enter the loop so that we can reset to that position later on.  The fix is simply to extend that to also perform that reset when failing the match of such a loop in such circumstances.
    stephentoub committed Dec 7, 2022
    Configuration menu
    Copy the full SHA
    362bae7 View commit details
    Browse the repository at this point in the history