Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Avoid memory over usage on GPU nodes in the SparkPlan #7252

Closed
7 tasks done
revans2 opened this issue Dec 5, 2022 · 3 comments
Closed
7 tasks done

[FEA] Avoid memory over usage on GPU nodes in the SparkPlan #7252

revans2 opened this issue Dec 5, 2022 · 3 comments
Labels
epic Issue that encompasses a significant feature or body of work reliability Features to improve reliability or bugs that severly impact the reliability of the plugin

Comments

@revans2
Copy link
Collaborator

revans2 commented Dec 5, 2022

Is your feature request related to a problem? Please describe.
The goal of this epic is to provide a framework and update a few SparkPlan nodes so that they can intelligently retry tasks when OOM failures are encountered.

@revans2 revans2 added feature request New feature or request ? - Needs Triage Need team to review and classify epic Issue that encompasses a significant feature or body of work labels Dec 5, 2022
@revans2 revans2 removed the feature request New feature or request label Dec 5, 2022
@sameerz sameerz added the reliability Features to improve reliability or bugs that severly impact the reliability of the plugin label Dec 6, 2022
@mattahrens mattahrens removed the ? - Needs Triage Need team to review and classify label Dec 8, 2022
@tgravescs
Copy link
Collaborator

do we want to add #7934 to this?

@revans2 revans2 closed this as completed Apr 4, 2023
@ttnghia
Copy link
Collaborator

ttnghia commented Apr 6, 2023

Should we consider adding retry for Parquet writer into this too?

@revans2
Copy link
Collaborator Author

revans2 commented Apr 7, 2023

@ttnghia this epic is closed. It was a bit of a dumping ground for issues, and I wanted to clean it up. #8029 has parquet writes it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Issue that encompasses a significant feature or body of work reliability Features to improve reliability or bugs that severly impact the reliability of the plugin
Projects
None yet
Development

No branches or pull requests

5 participants