Skip to content

Conversation

@kddubey
Copy link
Contributor

@kddubey kddubey commented Jan 30, 2025

Background

possible_cause_confidence increases when there's less speculation (which correlates w/ slightly more accurate causes)

possible_cause_novelty increases when there's more novelty / less redundancy wrt whats_wrong

thresholds led to 50% of possible causes getting dropped on our autofix sentry issues (notebook here)

Backend changes

corresponding backend change to sentry: #84346

corresponding backend change to seer: https://github.com/getsentry/seer/pull/1788

without the sentry backend change (there are no data.scores), default to current behavior: always show possible cause

before

with the change, only show it if both scores are greater than the threshold

after

(note: this example is actually considered novel and confident-enough. I hardcoded the threshold to test the behavior)

@kddubey kddubey requested a review from roaga January 30, 2025 23:10
@github-actions github-actions bot added the Scope: Frontend Automatically applied to PRs that change frontend components label Jan 30, 2025
@codecov
Copy link

codecov bot commented Jan 30, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

Additional details and impacted files
@@             Coverage Diff             @@
##           master   #84349       +/-   ##
===========================================
+ Coverage   60.35%   87.62%   +27.27%     
===========================================
  Files        9606     9601        -5     
  Lines      543349   543049      -300     
  Branches    21333    21275       -58     
===========================================
+ Hits       327945   475860   +147915     
+ Misses     215050    66836   -148214     
+ Partials      354      353        -1     

Copy link
Member

@roaga roaga left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The code looks fine, but wanna add a test?

@kddubey kddubey requested a review from a team as a code owner January 31, 2025 00:08
@JoshFerge
Copy link
Member

no review notes, but this is awesome. great work

@kddubey kddubey force-pushed the kddubey/issue-summary/thresholds-possible-cause-frontend branch from 19c3449 to 1fc50e7 Compare January 31, 2025 20:56
@kddubey kddubey merged commit e84cc72 into master Jan 31, 2025
43 checks passed
@kddubey kddubey deleted the kddubey/issue-summary/thresholds-possible-cause-frontend branch January 31, 2025 21:09
andrewshie-sentry pushed a commit that referenced this pull request Feb 5, 2025
…84349)

## Background

`possible_cause_confidence` increases when there's less speculation
(which correlates w/ slightly more accurate causes)

`possible_cause_novelty` increases when there's more novelty / less
redundancy wrt `whats_wrong`

thresholds led to 50% of possible causes getting dropped on our autofix
sentry issues ([notebook
here](https://github.com/getsentry/data-analysis/blob/main/autofix/issue_summary/issue_summary_confidence.ipynb))

## Backend changes

corresponding backend change to sentry: #84346

corresponding backend change to seer:
getsentry/seer#1788

without the sentry backend change (there are no `data.scores`), default
to current behavior: always show possible cause


![before](https://github.com/user-attachments/assets/9f632a94-690b-4602-bdea-cbd2b95824b0)

with the change, only show it if both scores are greater than the
threshold


![after](https://github.com/user-attachments/assets/0b1e84af-cc0b-4d0a-8106-58cc615ea602)

(note: this example is actually considered novel and confident-enough. I
hardcoded the threshold to test the behavior)
@github-actions github-actions bot locked and limited conversation to collaborators Feb 16, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

Scope: Frontend Automatically applied to PRs that change frontend components

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants