Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use hash splitting for deterministic patient split assignments upon dataset updates. #45

Open
mmcdermott opened this issue Jul 19, 2024 · 1 comment
Labels
MEDS-Extract priority:low A low priority issue.

Comments

@mmcdermott
Copy link
Owner

See here for example:

https://github.com/som-shahlab/femr/blob/main/src/femr/splits.py#L58

@mmcdermott mmcdermott added priority:low A low priority issue. MEDS-Extract labels Jul 30, 2024
@mmcdermott
Copy link
Owner Author

See #132 for an example of something like hash splitting for ordering shards in this codebase. It is not about patient splits, so it is not a direct analog, but it is related..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
MEDS-Extract priority:low A low priority issue.
Projects
None yet
Development

No branches or pull requests

1 participant