Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Qualification tool: update SQL Df value used and look at jobs in SQL #5612

Merged
merged 141 commits into from
May 24, 2022

Conversation

tgravescs
Copy link
Collaborator

Fixes #5570

Ok this has a couple of changes.

  1. Change to use the spark reported SQL DF duration instead of the stage duration since we might not have all stages.
  2. I found that the sql mapping to stages can miss quite a few stages, especially in the case an entire SQL query doesn't have any mappings. In these cases look at the SQL to Job mapping and get the stages from those and just average everything in that SQL query across those times.
  3. Fixed a couple bugs.
  4. removed a decimal check we should have removed a while.

This change makes the DF times much closer then what they were in previous version of the tool.

@tgravescs tgravescs added this to the May 23 - Jun 3 milestone May 24, 2022
@tgravescs tgravescs self-assigned this May 24, 2022
@tgravescs
Copy link
Collaborator Author

build

@tgravescs
Copy link
Collaborator Author

build

@tgravescs
Copy link
Collaborator Author

build

@tgravescs tgravescs merged commit 808bcc2 into NVIDIA:branch-22.06 May 24, 2022
@tgravescs tgravescs deleted the qualDFTime branch May 24, 2022 21:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Qualification tool investigate SQL Df value reported
3 participants