Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Report equivilant stages/sql ids as a part of compare #2793

Merged
merged 3 commits into from
Jun 28, 2021

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented Jun 23, 2021

For the Profile tool this helps a lot when trying to compare large complicated queries to each other.

This refactored dot generation because I wanted to reused the stage matching part of it, but I needed a better way to do it. I also wanted to be able to output dot graphs as a apart of compare, even if they were not producing a graph that compared the different applications.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
@revans2 revans2 added the task Work required that improves the product but is not user facing label Jun 23, 2021
@revans2 revans2 added this to the June 21 - July 2 milestone Jun 23, 2021
@revans2 revans2 self-assigned this Jun 23, 2021

import SparkPlanInfoWithStage._

def debugEquals(other: Any, depth: Int = 0): Boolean = {
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left this in, because our plugin is constantly changing and we have not really tested this with databricks or others, so if we run into issues, this is a quick way to track down what parts of a plan need to be normalized for the comparison to work properly.

+-----------------------+-----------------------+
|app-20210329165943-0103|app-20210329170243-0018|
+-----------------------+-----------------------+
|0 |0 |
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

would it be helpful to also add time each took? I guess we can always add more later.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ya I thought about that, but I didn't want to make it too complicated to start out with.

@tgravescs
Copy link
Collaborator

build

@revans2 revans2 merged commit 087dbc9 into NVIDIA:branch-21.08 Jun 28, 2021
@revans2 revans2 deleted the equivilant_stages branch June 28, 2021 14:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
task Work required that improves the product but is not user facing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants