[WIP] Draft `grouphashmetadata` table #75980

lobsterkatie · 2024-08-12T17:20:09Z

NOTE: This is an overview of what the eventual state of the table might look like, but is not meant to be merged. It's just a place to have conversation about the overall plan.

My first stab at creating the new table. General feedback welcome, but also, I have a two general questions:

This is all of the fields that - as of now - are likely to be added in this first go-round. I'm sure there'll be changes, though. Do you think it's better to create the table with all the fields and possibly write migrations to change fields, or just add the fields as I add code related to them?
Are there any bits I'm missing, things besides just the fields?

github-actions · 2024-08-12T17:22:53Z

This PR has a migration; here is the generated SQL for src/sentry/migrations/0748_create_grouphashmetadata_table.py ()

--
-- Create model GroupHashMetadata
--
CREATE TABLE "sentry_grouphashmetadata" ("id" bigint NOT NULL PRIMARY KEY GENERATED BY DEFAULT AS IDENTITY, "date_added" timestamp with time zone NOT NULL, "grouping_method" integer NOT NULL CHECK ("grouping_method" >= 0), "grouping_config" varchar NOT NULL, "hash_basis" varchar NULL, "hashing_metadata" text NULL, "enhancements" text NULL, "fingerprint" text NULL, "seer_model" varchar NULL, "seer_date_sent" timestamp with time zone NULL, "seer_results" text NULL, "event_sent" varchar(32) NULL, "grouphash_id" bigint NOT NULL UNIQUE, "secondary_hash_match_id" bigint NULL);
ALTER TABLE "sentry_grouphashmetadata" ADD CONSTRAINT "sentry_grouphashmeta_grouphash_id_c47122d9_fk_sentry_gr" FOREIGN KEY ("grouphash_id") REFERENCES "sentry_grouphash" ("id") DEFERRABLE INITIALLY DEFERRED NOT VALID;
ALTER TABLE "sentry_grouphashmetadata" VALIDATE CONSTRAINT "sentry_grouphashmeta_grouphash_id_c47122d9_fk_sentry_gr";
ALTER TABLE "sentry_grouphashmetadata" ADD CONSTRAINT "sentry_grouphashmeta_secondary_hash_match_f564cbf0_fk_sentry_gr" FOREIGN KEY ("secondary_hash_match_id") REFERENCES "sentry_grouphash" ("id") DEFERRABLE INITIALLY DEFERRED NOT VALID;
ALTER TABLE "sentry_grouphashmetadata" VALIDATE CONSTRAINT "sentry_grouphashmeta_secondary_hash_match_f564cbf0_fk_sentry_gr";
CREATE INDEX CONCURRENTLY "sentry_grouphashmetadata_secondary_hash_match_id_f564cbf0" ON "sentry_grouphashmetadata" ("secondary_hash_match_id");

codecov · 2024-08-12T17:27:53Z

Test Failures Detected: Due to failing tests, we cannot provide coverage reports at this time.

❌ Failed Test Results:

Completed 21748 tests with 5 failed, 21542 passed and 201 skipped.

View the full list of failed tests

pytest

Class name: tests.sentry.backup.test_comparators
Test name: test_default_comparators
Flags:

backend

#x1B[1m#x1B[.../sentry/backup/test_comparators.py#x1B[0m:2162: in test_default_comparators
    insta_snapshot(serialized)
#x1B[1m#x1B[31mE   Failed: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~#x1B[0m
#x1B[1m#x1B[31mE   Snapshot .../snapshots/test_comparators/test_default_comparators.pysnap changed!#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   Re-run pytest with SENTRY_SNAPSHOTS_WRITEBACK=new and then use 'make review-python-snapshots' to review.#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   Or: Use SENTRY_SNAPSHOTS_WRITEBACK=1 to update snapshots directly.#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   --- #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   +++ #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   @@ -569,6 +569,12 @@#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE        - group_tombstone_id#x1B[0m
#x1B[1m#x1B[31mE        - project#x1B[0m
#x1B[1m#x1B[31mE      model_name: sentry.grouphash#x1B[0m
#x1B[1m#x1B[31mE   +- comparators:#x1B[0m
#x1B[1m#x1B[31mE   +  - class: ForeignKeyComparator#x1B[0m
#x1B[1m#x1B[31mE   +    fields:#x1B[0m
#x1B[1m#x1B[31mE   +    - grouphash#x1B[0m
#x1B[1m#x1B[31mE   +    - secondary_hash_match#x1B[0m
#x1B[1m#x1B[31mE   +  model_name: sentry.grouphashmetadata#x1B[0m
#x1B[1m#x1B[31mE    - comparators:#x1B[0m
#x1B[1m#x1B[31mE      - class: ForeignKeyComparator#x1B[0m
#x1B[1m#x1B[31mE        fields:#x1B[0m
#x1B[1m#x1B[31mE   ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~#x1B[0m

Class name: tests.sentry.backup.test_dependencies
Test name: test_detailed
Flags:

backend

#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:54: in test_detailed
    assert_model_dependencies(expect, actual)
#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:42: in assert_model_dependencies
    raise AssertionError(
#x1B[1m#x1B[31mE   AssertionError: Model dependency graph does not match fixture. This means that you have changed the model dependency graph in some load bearing way. If you are seeing this in CI, and the dependency changes are intentional, please run `bin/generate-model-dependency-fixtures` and re-upload:#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   --- #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   +++ #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   @@ -2306,6 +2306,33 @@#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE          ]#x1B[0m
#x1B[1m#x1B[31mE        ]#x1B[0m
#x1B[1m#x1B[31mE      },#x1B[0m
#x1B[1m#x1B[31mE   +  "sentry.grouphashmetadata": {#x1B[0m
#x1B[1m#x1B[31mE   +    "dangling": false,#x1B[0m
#x1B[1m#x1B[31mE   +    "foreign_keys": {#x1B[0m
#x1B[1m#x1B[31mE   +      "grouphash": {#x1B[0m
#x1B[1m#x1B[31mE   +        "kind": "FlexibleForeignKey",#x1B[0m
#x1B[1m#x1B[31mE   +        "model": "sentry.grouphash",#x1B[0m
#x1B[1m#x1B[31mE   +        "nullable": false#x1B[0m
#x1B[1m#x1B[31mE   +      },#x1B[0m
#x1B[1m#x1B[31mE   +      "secondary_hash_match": {#x1B[0m
#x1B[1m#x1B[31mE   +        "kind": "FlexibleForeignKey",#x1B[0m
#x1B[1m#x1B[31mE   +        "model": "sentry.grouphash",#x1B[0m
#x1B[1m#x1B[31mE   +        "nullable": true#x1B[0m
#x1B[1m#x1B[31mE   +      }#x1B[0m
#x1B[1m#x1B[31mE   +    },#x1B[0m
#x1B[1m#x1B[31mE   +    "model": "sentry.grouphashmetadata",#x1B[0m
#x1B[1m#x1B[31mE   +    "relocation_dependencies": [],#x1B[0m
#x1B[1m#x1B[31mE   +    "relocation_scope": "Excluded",#x1B[0m
#x1B[1m#x1B[31mE   +    "silos": [#x1B[0m
#x1B[1m#x1B[31mE   +      "Region"#x1B[0m
#x1B[1m#x1B[31mE   +    ],#x1B[0m
#x1B[1m#x1B[31mE   +    "table_name": "sentry_grouphashmetadata",#x1B[0m
#x1B[1m#x1B[31mE   +    "uniques": [#x1B[0m
#x1B[1m#x1B[31mE   +      [#x1B[0m
#x1B[1m#x1B[31mE   +        "grouphash"#x1B[0m
#x1B[1m#x1B[31mE   +      ]#x1B[0m
#x1B[1m#x1B[31mE   +    ]#x1B[0m
#x1B[1m#x1B[31mE   +  },#x1B[0m
#x1B[1m#x1B[31mE      "sentry.grouphistory": {#x1B[0m
#x1B[1m#x1B[31mE        "dangling": false,#x1B[0m
#x1B[1m#x1B[31mE        "foreign_keys": {#x1B[0m

Class name: tests.sentry.backup.test_dependencies
Test name: test_flat
Flags:

backend

#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:63: in test_flat
    assert_model_dependencies(expect, actual)
#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:42: in assert_model_dependencies
    raise AssertionError(
#x1B[1m#x1B[31mE   AssertionError: Model dependency graph does not match fixture. This means that you have changed the model dependency graph in some load bearing way. If you are seeing this in CI, and the dependency changes are intentional, please run `bin/generate-model-dependency-fixtures` and re-upload:#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   --- #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   +++ #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   @@ -316,6 +316,9 @@#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE        "sentry.grouptombstone",#x1B[0m
#x1B[1m#x1B[31mE        "sentry.project"#x1B[0m
#x1B[1m#x1B[31mE      ],#x1B[0m
#x1B[1m#x1B[31mE   +  "sentry.grouphashmetadata": [#x1B[0m
#x1B[1m#x1B[31mE   +    "sentry.grouphash"#x1B[0m
#x1B[1m#x1B[31mE   +  ],#x1B[0m
#x1B[1m#x1B[31mE      "sentry.grouphistory": [#x1B[0m
#x1B[1m#x1B[31mE        "sentry.group",#x1B[0m
#x1B[1m#x1B[31mE        "sentry.organization",#x1B[0m

Class name: tests.sentry.backup.test_dependencies
Test name: test_sorted
Flags:

backend

#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:72: in test_sorted
    assert_model_dependencies(expect, actual)
#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:42: in assert_model_dependencies
    raise AssertionError(
#x1B[1m#x1B[31mE   AssertionError: Model dependency graph does not match fixture. This means that you have changed the model dependency graph in some load bearing way. If you are seeing this in CI, and the dependency changes are intentional, please run `bin/generate-model-dependency-fixtures` and re-upload:#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   --- #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   +++ #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   @@ -217,6 +217,7 @@#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE      "sentry.sentryappinstallationtoken",#x1B[0m
#x1B[1m#x1B[31mE      "sentry.sentryappinstallationforprovider",#x1B[0m
#x1B[1m#x1B[31mE      "sentry.incident",#x1B[0m
#x1B[1m#x1B[31mE   +  "sentry.grouphashmetadata",#x1B[0m
#x1B[1m#x1B[31mE      "sentry.dashboardwidgetqueryondemand",#x1B[0m
#x1B[1m#x1B[31mE      "sentry.alertruletriggerexclusion",#x1B[0m
#x1B[1m#x1B[31mE      "sentry.alertruletriggeraction",#x1B[0m

Class name: tests.sentry.backup.test_dependencies
Test name: test_truncate
Flags:

backend

#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:81: in test_truncate
    assert_model_dependencies(expect, actual)
#x1B[1m#x1B[.../sentry/backup/test_dependencies.py#x1B[0m:42: in assert_model_dependencies
    raise AssertionError(
#x1B[1m#x1B[31mE   AssertionError: Model dependency graph does not match fixture. This means that you have changed the model dependency graph in some load bearing way. If you are seeing this in CI, and the dependency changes are intentional, please run `bin/generate-model-dependency-fixtures` and re-upload:#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   --- #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   +++ #x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE   @@ -217,6 +217,7 @@#x1B[0m
#x1B[1m#x1B[31mE   #x1B[0m
#x1B[1m#x1B[31mE      "sentry_sentryappinstallationtoken",#x1B[0m
#x1B[1m#x1B[31mE      "sentry_sentryappinstallationforprovider",#x1B[0m
#x1B[1m#x1B[31mE      "sentry_incident",#x1B[0m
#x1B[1m#x1B[31mE   +  "sentry_grouphashmetadata",#x1B[0m
#x1B[1m#x1B[31mE      "sentry_dashboardwidgetqueryondemand",#x1B[0m
#x1B[1m#x1B[31mE      "sentry_alertruletriggerexclusion",#x1B[0m
#x1B[1m#x1B[31mE      "sentry_alertruletriggeraction",#x1B[0m

armenzg

This is pretty cool to see. Good start 👍🏻

armenzg · 2024-08-12T19:54:09Z