feat: blocking detection #2709

bruno-garcia · 2023-10-09T21:07:30Z

Just playing around, but after a thread on twitter (https://twitter.com/brungarc/status/1711484812702171309). Based in the Ben Adams BlockingDetector.

Suppressing Blocking Detection

If users are intentionally making blocking calls in their code for some reason, the blocking detection can be suppressed (temporarily) via:

using (new SuppressBlockingDetection())
{
    // Some blocking code
}

Volume Concerns

If blocking calls exist on a hot path, we may end up sending lots of events.

We did create a solution to this in #3174 but decided to roll this back in this PR due to concerns about using metrics.

Top Frame Decision

Bruno had experimented, in the initial code, with the number of frames to skip when capturing a stack trace. The thinking was that it might be nice to see one or two calls into the Sentry blocking detection code. This turns out to be problematic when none of the stack frames are InApp, since Sentry doesn't know which frame to highlight. So I've reverted to Ben Adams' original code that leaves the culprit blocking call at the top of the stack. This makes it easy to find regardless of whether there are InApp frames present in the stack trace or not.

Grouping

There were concerns about grouping and fingerprinting... By default Sentry uses the stack trace as a fingerprint and we can get different stack traces, depending on how the async state machine schedules the task. However this is also related to the top frame decision and given that we're skipping any frames related to the TPL now, the grouping problem has gone away.

Resources

It's all about the SynchronizationContext - Stephen Cleary
ASP.NET SynchronizationContext - Stephen Cleary
ExecutionContext vs SynchronizationContext - Stephen Toub

Aparently you have to be called Stephen to understand Synchronization 😜

github-actions · 2023-10-09T21:08:40Z

	Messages
📖	Do not forget to update Sentry-docs with your feature once the pull request gets approved.

Generated by 🚫 dangerJS against ffef982

bruno-garcia · 2024-02-05T16:34:07Z

since I haven't had a change to look into this for a long time, some notes I have:

Stack trace in SentryEvent.Thread and also in Sentry.Exception? Seems like it's done so the UI renders in some way
Some thread info only shows if there are more than 1 thread in the array (Create 1 manually to see it in the UI)

If all InApp=false groups by the whole thing and creates new issues for each event. Group by URL in that case? (also risky because of parameterized URL)

…dotnet into feat/blocking-detector

jamescrosswell · 2024-02-07T02:38:58Z

@bruno-garcia are we sure we want this as the call stack?

I know you intentially changed skipframes from 3:6 to 0:3 but it seems weird to have some Sentry code at the top of the stack (rather than highlighting the culprit code - i.e. the In App frame).

bruno-garcia · 2024-02-07T03:17:48Z

Braindump during chat with @jamescrosswell

Concern with volume: If there's a blocking call on a hot path (like on GET /product) this will fire on each request and burn through quota.
- We could keep state on the client and fire it once.
- Folks can use error sample rate, or sample on beforeSend specifically for this error.
Use metrics and code locations? But here we really need the full stack trace.
What should be the top frame? In other words, how many frames to skip? (original code DetectionSource.SynchronizationContext ? 3 : 6).
- Sentry focuses (expands) on the first InApp frame so it's OK to include some framework code that's called from it. But not clear how many frames/where to go
The SDK itself triggers at least 2 events on the first request. Only happen once but if we can, we need to fix these before going ahead.
- If this is intentional, we need to opt-out of reporting this one.
We need a mechanism to drop them outside giving folks code to write (e.g; beforeSend snippets), see point above. We need an API to drop certain errors.
SentryThreads vs SentryException. I found it odd looking at events that we had stack traces on both (duplicated). And the UI rendered different depending on which one was used. I tried to capture this in this note (feat: blocking detection #2709 (comment))
- Not really related to blocking detector but came up while doing this.
- To better understand this, uncomment the code in the PR (SentryThreads = new []{new SentryThread) and see how the UI changes. Also inspect JSON and note that we have 1 dupe stack trace, matching by ThreadId IIRC.
Grouping problems. We'd need to make sure each blocking detected groups properly. If all frames are inApp=false, the async stack trace machinery can change and result in different groups. Not sure how to solve this, might need to Fingerprint as BlockingDetectorNotInAPp or group by URL in that case? (also risky because of parameterized URL, transaction might work if not raw url based).
Tests - From the lib might help but mainly how it groups, etc. How does it work within Sentry?
Documentation: Not all things are detected. As Ben Adams listed in details on the original code's repo

…dotnet into feat/blocking-detector

jamescrosswell · 2024-02-28T02:17:42Z

Won't this potentially hide useful information about where the problem is? We could mark these on the backend as grouping=false although I don't have a lot of details on how to do that. @adinauer I believe did some stuff for Java and might know (I believe getsentry/sentry#45185)

These frames all come after the blocking call so I don't think they're relevant to the event.

…dotnet into feat/blocking-detector

adinauer · 2024-02-28T06:54:10Z

@bruno-garcia @jamescrosswell glancing at the code here this seems different from what I changed for Java SDK. We removed some auto generated IDs from class names in the stacktrace so they don't mess up grouping in the issue you linked.

src/Sentry/Ben.BlockingDetector/StaticRecursionTracker.cs

bruno-garcia · 2024-03-01T23:12:14Z

@bruno-garcia @jamescrosswell glancing at the code here this seems different from what I changed for Java SDK. We removed some auto generated IDs from class names in the stacktrace so they don't mess up grouping in the issue you linked.

yes, it's a totally different use case. But the goal here is to send frames that we'd like to show, but not group by. Which can be achieved by changing the server the similarly to how you did it.

bruno-garcia · 2024-03-05T22:25:17Z

@vaind added u as reviewer since @jamescrosswell wrote the code since the initial push I made so I guess both of us are like "not sure I should review or merge this" :D

src/Sentry/Ben.BlockingDetector/README.md

bitsandfoxes · 2024-03-06T11:18:20Z

src/Sentry/Internal/MainExceptionProcessor.cs

@@ -155,7 +155,7 @@ private SentryException BuildSentryException(Exception exception, int id, int? p
            sentryEx.Mechanism = mechanism;
        }

-        sentryEx.Stacktrace = SentryStackTraceFactoryAccessor().Create(exception);
+        sentryEx.Stacktrace ??= SentryStackTraceFactoryAccessor().Create(exception);


What's the implication here? Previously, the stacktrace would get overwritten. Why is this change necessary? Where would the stacktrace come from?

the stack trace is already set in the event. We get it from the blocking detection logic so we won't want it overwriten

Suggested change

sentryEx.Stacktrace ??= SentryStackTraceFactoryAccessor().Create(exception);

sentryEx.Stacktrace = SentryStackTraceFactoryAccessor().Create(exception);

The sentryEx gets newly instantiated in this method and the StackTrace has to be set explicitly. I think this change does nothing.

We get it from the blocking detection logic so we won't want it overwriten

That all happens in the event processor.

src/Sentry/Internal/MainSentryEventProcessor.cs

bitsandfoxes · 2024-03-06T12:18:19Z

src/Sentry/Ben.BlockingDetector/BlockingMonitor.cs

+            // Skip frames relating to the async state machine
+            || frameInfo?.StartsWith("System.Threading") == true;


I think we can skip doing this and improve grouping on the server side for this kind of stuff in general.

@bitsandfoxes could you link to the associated PR where this logic gets changed on the server?

Also, if we do skip this here, we'll need to make some changes to set a custom fingerprint that excludes all this stuff (since by default the entire stacktrace is used as the fingerprint).

I'll follow up on this. The blocking detection is opt-in and the improvements to the grouping are not blocking this PR. #3202

I'll follow up on this. The blocking detection is opt-in and the improvements to the grouping are not blocking this PR. #3202

If we remove the logic above from the client, the grouping will be broken unless some similar logic exists on the server.

I had a chat to Bruno though... Not sure we need to be doing this on the server.

src/Sentry/Ben.BlockingDetector/BlockingMonitor.cs

bitsandfoxes

This looks really good to me!

bruno-garcia · 2024-03-08T03:10:27Z

Thanks folks!

jamescrosswell · 2024-03-10T21:28:37Z

src/Sentry.AspNetCore/SentryMiddleware.cs

+                if (_options.CaptureBlockingCalls && _monitor is not null)
+                {
+                    var syncCtx = SynchronizationContext.Current;
+                    SynchronizationContext.SetSynchronizationContext(syncCtx == null ? _detectBlockingSyncCtx : new DetectBlockingSynchronizationContext(_monitor, syncCtx));


@bruno-garcia won't the SynchronizationContext in ASP.NET Core always be null? Is this code here just in case SDK users have implemented a custom sync context for some reason?

feat: blocking detection

5a29e10

bruno-garcia mentioned this pull request Oct 9, 2023

Events for blocking calls getsentry/sentry-dart#1671

Open

bruno-garcia added 6 commits October 10, 2023 21:09

half assed stuff from the weekend

17bd0bf

Merge branch 'main' into feat/blocking-detector

a25993c

verify

7173711

Merge branch 'main' into feat/blocking-detector

ef951da

ref

e1173fa

Merge remote-tracking branch 'origin' into feat/blocking-detector

579fc98

bruno-garcia force-pushed the feat/blocking-detector branch from 720a534 to 579fc98 Compare December 22, 2023 15:44

bruno-garcia added 2 commits December 22, 2023 16:57

binding

43515be

context

d83362b

bruno-garcia mentioned this pull request Dec 22, 2023

Replace SdkVersion from reflection to source generator #2992

Open

wip

047c0dc

bruno-garcia mentioned this pull request Jan 2, 2024

Capture logcat output as attachment getsentry/sentry-java#3075

Open

wip

fc1d708

jamescrosswell self-assigned this Feb 6, 2024

jamescrosswell linked an issue Feb 7, 2024 that may be closed by this pull request

Blocking detection #3123

Closed

jamescrosswell and others added 5 commits February 7, 2024 14:33

Merge branch 'main' into feat/blocking-detector

869168f

Fixed compiler errors

9859b21

Update CHANGELOG.md

3fa8364

Format code

3cb5bb6

Merge branch 'feat/blocking-detector' of github.com:getsentry/sentry-…

50f5b6d

…dotnet into feat/blocking-detector

jamescrosswell added 4 commits February 13, 2024 21:53

Reintroducing Bruno's changes with tests

85509fd

Reintroduced changes to MainSentryEventProcessor

c75eb29

Reintroduced the core blocking capability

0c15b10

Update SentryStackTraceFactory.cs

0c855f4

bruno-garcia marked this pull request as ready for review February 27, 2024 18:41

bruno-garcia requested review from bitsandfoxes and jamescrosswell as code owners February 27, 2024 18:41

bruno-garcia and others added 5 commits February 27, 2024 16:57

Merge branch 'main' into feat/blocking-detector

2e8b05a

merge issues

df2a8d6

Refactored to improve testability

42ed4a8

Merge branch 'feat/blocking-detector' of github.com:getsentry/sentry-…

c87472e

…dotnet into feat/blocking-detector

Format code

1cf1d18

jamescrosswell added 2 commits February 28, 2024 15:34

Moved IBlockingMonitor to a separate file

45c67b9

Merge branch 'feat/blocking-detector' of github.com:getsentry/sentry-…

f8d872a

…dotnet into feat/blocking-detector

Merge branch 'main' into feat/blocking-detector

9846d5a

bruno-garcia commented Feb 28, 2024

View reviewed changes

src/Sentry/Ben.BlockingDetector/StaticRecursionTracker.cs Show resolved Hide resolved

Merge branch 'main' into feat/blocking-detector

322afff

bruno-garcia requested a review from vaind March 5, 2024 22:24

bitsandfoxes reviewed Mar 6, 2024

View reviewed changes

src/Sentry/Ben.BlockingDetector/BlockingMonitor.cs Outdated Show resolved Hide resolved

jamescrosswell added 2 commits March 7, 2024 10:35

Merge branch 'main' into feat/blocking-detector

80be4dd

Applied review feedback

ffef982

bitsandfoxes mentioned this pull request Mar 7, 2024

Improve grouping #3202

Open

bitsandfoxes approved these changes Mar 7, 2024

View reviewed changes

bruno-garcia merged commit d1e5efc into main Mar 8, 2024
30 checks passed

bruno-garcia deleted the feat/blocking-detector branch March 8, 2024 03:10

jamescrosswell reviewed Mar 10, 2024

View reviewed changes

jamescrosswell mentioned this pull request Apr 16, 2024

Added documentation for Blocking Detection in ASP.NET Core getsentry/sentry-docs#9712

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: blocking detection #2709

feat: blocking detection #2709

bruno-garcia commented Oct 9, 2023 •

edited by jamescrosswell

Loading

github-actions bot commented Oct 9, 2023 •

edited

Loading

bruno-garcia commented Feb 5, 2024

jamescrosswell commented Feb 7, 2024

bruno-garcia commented Feb 7, 2024

jamescrosswell commented Feb 28, 2024

adinauer commented Feb 28, 2024

bruno-garcia commented Mar 1, 2024

bruno-garcia commented Mar 5, 2024

bitsandfoxes Mar 6, 2024

bruno-garcia Mar 6, 2024

bitsandfoxes Mar 7, 2024 •

edited

Loading

bitsandfoxes Mar 6, 2024

jamescrosswell Mar 6, 2024

bitsandfoxes Mar 7, 2024 •

edited

Loading

jamescrosswell Mar 7, 2024

bitsandfoxes left a comment

bruno-garcia commented Mar 8, 2024

jamescrosswell Mar 10, 2024

	sentryEx.Stacktrace ??= SentryStackTraceFactoryAccessor().Create(exception);
	sentryEx.Stacktrace = SentryStackTraceFactoryAccessor().Create(exception);

		// Skip frames relating to the async state machine
		\|\| frameInfo?.StartsWith("System.Threading") == true;

feat: blocking detection #2709

feat: blocking detection #2709

Conversation

bruno-garcia commented Oct 9, 2023 • edited by jamescrosswell Loading

Suppressing Blocking Detection

Volume Concerns

Top Frame Decision

Grouping

Resources

github-actions bot commented Oct 9, 2023 • edited Loading

bruno-garcia commented Feb 5, 2024

jamescrosswell commented Feb 7, 2024

bruno-garcia commented Feb 7, 2024

jamescrosswell commented Feb 28, 2024

adinauer commented Feb 28, 2024

bruno-garcia commented Mar 1, 2024

bruno-garcia commented Mar 5, 2024

bitsandfoxes Mar 6, 2024

Choose a reason for hiding this comment

bruno-garcia Mar 6, 2024

Choose a reason for hiding this comment

bitsandfoxes Mar 7, 2024 • edited Loading

Choose a reason for hiding this comment

bitsandfoxes Mar 6, 2024

Choose a reason for hiding this comment

jamescrosswell Mar 6, 2024

Choose a reason for hiding this comment

bitsandfoxes Mar 7, 2024 • edited Loading

Choose a reason for hiding this comment

jamescrosswell Mar 7, 2024

Choose a reason for hiding this comment

bitsandfoxes left a comment

Choose a reason for hiding this comment

bruno-garcia commented Mar 8, 2024

jamescrosswell Mar 10, 2024

Choose a reason for hiding this comment

bruno-garcia commented Oct 9, 2023 •

edited by jamescrosswell

Loading

github-actions bot commented Oct 9, 2023 •

edited

Loading

bitsandfoxes Mar 7, 2024 •

edited

Loading

bitsandfoxes Mar 7, 2024 •

edited

Loading