reimplement PCRE2_UNREACHABLE() assertions with a safer approach #490

carenas · 2024-09-22T10:27:49Z

Posted mainly for discussion on the implementation details and split on three patches to make discussion easier.

First patch is just a somehow unrelated cleanup that is useful to test the change with cmake

Second patch shows the proposed changes that implement safe assertions that could be also used in non debug builds as optimization hints, and adds a simpler version of the fix proposed in #489 on top as a prove of concept. Ideally, this series (once agreed upon, fully audited for safety and tested) could be rebased on top of that change.

Third patch is a workaround for a gcc bug that the proposed fix will trigger and that affects CI.

alexdowad · 2024-09-22T11:08:43Z

One comment: the PCRE2_UNREACHABLE() macro was intended to serve 2 different purposes:

In debug builds, it's a debugging tool which can help fuzzers, etc. to find bugs where control flow goes somewhere it shouldn't;
In release builds, it's an optimization hint for the compiler.

I think the implementation in this PR only meets point 1 but not point 2; but if that is not correct, please mention.

carenas · 2024-09-22T11:19:42Z

You are correct, there is no production implementation and therefore there is no optimization hint either (except for DEBUG builds by the mentioned attribute that is expected from abort(), but that of course is implementation dependent).

Interestingly enough from all the places where it was used, the only place where it would make a difference was the function fixed in #489, and the difference wouldn't be good for our users.

If the assertion was ever reached, and their library was built with a recent enough compiler in an x86 system, they will get a crash, instead of getting an error, because as you clearly pointed out, the "return" call itself would be removed at compile time; was that behaviour what was expected from that "optimization"?

alexdowad · 2024-09-22T11:43:41Z

If the assertion was ever reached, and their library was built with a recent enough compiler in an x86 system, they will get a crash, instead of getting an error, because as you clearly pointed out, the "return" call itself would be removed at compile time; was that behaviour what was expected from that "optimization"?

In answer to that question, what was expected was that if we assert a certain path is unreachable, that path should really be unreachable, and because the compiler is told that is unreachable, it can (in some cases) generate better code. For example, if you tell the compiler that the default: label in a switch statement is unreachable, it can assume that there will always be a matching label and doesn't need to generate code to handle the "default" case.

If the unreachable path is really unreachable, then there will be no crashes in production (because it will never be reached). On the other hand, if we are not able to guarantee, through whatever method, that our "unreachable" code is really unreachable... that is a problem. 😦 Since PCRE2 has significant installed base, it would be nice if the codebase was at a high enough level of quality that we could have full confidence that the "unreachable" paths are really unreachable. In that case, they could safely benefit from __builtin_unreachable's optimization hint.

However, if the codebase is not at that level of quality yet, then you are very right that PCRE2_UNREACHABLE must not do anything unsafe in production builds.

In any case, thanks very much for looking into this issue.

Another thought: It would be nice if, in production builds, all instances of PCRE2_ASSERT and PCRE2_UNREACHABLE would make public API functions return PCRE2_INTERNAL_ERROR if an assertion fails. That could even mitigate possible vulnerabilities by turning them into error returns instead. However, it would probably require some trickery with longjmp, which seems inadvisable.

alexdowad · 2024-09-22T11:45:11Z

Just one suggestion: I think the assertion failure message in this PR would be easier to understand if it said something like "Execution must not reach this point" or "Execution reached unexpected point".

Allow showing the internal value for PCRE2_DEBUG in the summary, just like is done for ./configure

The original asserts weren't very useful in debug mode as they were lacking information on where they were being triggered and were also unreliable and dangerous as they could result in important code being removed and trigger crashes. Instead of implementing one generic assert for both modes, build a more useful one for each mode, but to make sure that the non production paths are not being unnecessarily eliminated, allow for a parameter that could be used to indicate if that functionality is desired or not. Reinstate all original assertions to use that instead, and make sure to set the parameter to a safe value in the one that is known to cause problems with the previous code.

Most of the uses of `PCRE2_UNREACHABLE(0)` are at the end of `case` and therefore in non debug builds, the assertion "should" tell the compiler that a "fall back" is not possible, but the version of gcc used in Ubuntu 22.04 has a bug and will instead see the assertion as additional code that doesn't have a `break` after and therefore trigger `-Wimplicit-fallthrough` warnings instead. Update the job to use Ubuntu 24.04 that provides gcc 13.2 and that doesn't have the bug anymore, and while at it update all jobs to error on warnings so that failures will be more visible.

zherczeg · 2024-09-23T03:34:39Z

Reaching an unreachable code is always a big problem, since you cannot write tests to see what happens. Hence whatever we do, we probably have issues. I don't mind unreachable in matching code, because its purpose is high performance. For pattern compilation, we could use errors, although the problem is the same: we cannot test them.

carenas · 2024-09-23T04:24:21Z

I would prefer no unreachables in production, as they are additional code, their behaviour changes from compiler to compiler, and even between versions of the same compiler and have bugs, but agree that since they are meant to be really unreachable (assuming the original selection was done carefully enough) they could be used to maybe improve performance.

Anyway, as requested by Alex updated the series for further discussion.

zherczeg · 2024-09-23T04:26:36Z

Btw I would prefer an assertion before all internal errors. When I develop code, and get those errors, it is too hard to find their sources.

.github/workflows/build.yml

carenas · 2024-09-23T04:31:32Z

src/pcre2_util.h

 #endif

 #ifdef PCRE2_DEBUG
+
 #if defined(HAVE_ASSERT_H) && !defined(NDEBUG)
 #include <assert.h>
 #define PCRE2_ASSERT(x) assert(x)
 #elif defined(HAVE_STDLIB_H) && defined(HAVE_STDIO_H)


this additional checks were apparently introduced by mistake, which is why the final version might also remove them and why they are not being used in the new debug assert for PCRE2_UNREACHABLE() below.

the code was just reformatted so it can be easier to read and also compare with the proposed similar implementation.

alexdowad · 2024-09-23T05:19:10Z

agree that since they are meant to be really unreachable (assuming the original selection was done carefully enough) they could be used to maybe improve performance.

Carlo, how would this be:

We agree that an UNREACHABLE assertion should only be used when 1) the code has been carefully audited to make sure that the "unreachable" path is really unreachable, and 2) the code in question is heavily tested by the existing test suite.
UNREACHABLE assertions compile to an optimization hint in release builds, a failing assertion (i.e. abort) in debug builds.
For now, all calls to PCRE2_UNREACHABLE can be converted to a comment saying /* We should not get here */ and PCRE2_ERROR_INTERNAL return. I will go through one by one and check each site carefully before converting it back to an assertion.

Just a suggestion.

alexdowad · 2024-09-23T05:21:11Z

Btw I would prefer an assertion before all internal errors. When I develop code, and get those errors, it is too hard to find their sources.

Suggestion: Aside from PCRE2_ASSERT, can we add another macro which expands to a failing assertion (abort) in debug builds, but return PCRE2_ERROR_INTERNAL; in release builds?

This macro must only be used in functions which return int status code (zero for success, non-zero for error code).

alexdowad · 2024-09-23T05:24:15Z

src/pcre2_util.h

+
+#ifndef PCRE2_UNREACHABLE
+#ifdef PCRE2_DEADTRAP
+#define PCRE2_UNREACHABLE(d) if (d == 0) PCRE2_DEADTRAP()


Ah, I see. So the parameter to PCRE2_UNREACHABLE indicates whether we have confidence that the code is "really unreachable" or not.

Interesting.

Guess that is a way to put it, but what it really indicates to me is, "this unreachable should NEVER abort in non debug builds and should NEVER be considered as a hint for code elimination either, it is there ONLY to be used in debug builds to catch bugs, as all assertions should be.

carenas · 2024-09-23T06:13:08Z

can we add another macro

sure, fork away and go ahead, I promise not to rebase this until all discussion is settled and review your code to suggest improvements.

note though that having different assertion per type of function is going to be messy, since each function has different ways to report those errors, so IMHO might be easier to open code each as needed using PCRE2_ASSERT()

carenas mentioned this pull request Sep 22, 2024

Implement PCRE2_UNREACHABLE assertion for MS Visual C++ #465

Merged

carenas added 3 commits September 22, 2024 20:00

cmake: add PCRE2_DEBUG information to summary

d4e5ba0

Allow showing the internal value for PCRE2_DEBUG in the summary, just like is done for ./configure

carenas force-pushed the debugass branch from 91961b3 to cb04c63 Compare September 23, 2024 04:05

carenas changed the title ~~reimplement assertions as a debug helper~~ reimplement assertions with a safer approach Sep 23, 2024

zherczeg reviewed Sep 23, 2024

View reviewed changes

.github/workflows/build.yml Show resolved Hide resolved

carenas commented Sep 23, 2024

View reviewed changes

alexdowad reviewed Sep 23, 2024

View reviewed changes

carenas changed the title ~~reimplement assertions with a safer approach~~ reimplement PCRE2_UNREACHABLE() assertions with a safer approach Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reimplement PCRE2_UNREACHABLE() assertions with a safer approach #490

reimplement PCRE2_UNREACHABLE() assertions with a safer approach #490

carenas commented Sep 22, 2024 •

edited

Loading

alexdowad commented Sep 22, 2024

carenas commented Sep 22, 2024 •

edited

Loading

alexdowad commented Sep 22, 2024

alexdowad commented Sep 22, 2024

zherczeg commented Sep 23, 2024

carenas commented Sep 23, 2024

zherczeg commented Sep 23, 2024

carenas Sep 23, 2024 •

edited

Loading

alexdowad commented Sep 23, 2024 •

edited

Loading

alexdowad commented Sep 23, 2024

alexdowad Sep 23, 2024

carenas Sep 23, 2024

carenas commented Sep 23, 2024

reimplement PCRE2_UNREACHABLE() assertions with a safer approach #490

Are you sure you want to change the base?

reimplement PCRE2_UNREACHABLE() assertions with a safer approach #490

Conversation

carenas commented Sep 22, 2024 • edited Loading

alexdowad commented Sep 22, 2024

carenas commented Sep 22, 2024 • edited Loading

alexdowad commented Sep 22, 2024

alexdowad commented Sep 22, 2024

zherczeg commented Sep 23, 2024

carenas commented Sep 23, 2024

zherczeg commented Sep 23, 2024

carenas Sep 23, 2024 • edited Loading

Choose a reason for hiding this comment

alexdowad commented Sep 23, 2024 • edited Loading

alexdowad commented Sep 23, 2024

alexdowad Sep 23, 2024

Choose a reason for hiding this comment

carenas Sep 23, 2024

Choose a reason for hiding this comment

carenas commented Sep 23, 2024

carenas commented Sep 22, 2024 •

edited

Loading

carenas commented Sep 22, 2024 •

edited

Loading

carenas Sep 23, 2024 •

edited

Loading

alexdowad commented Sep 23, 2024 •

edited

Loading