Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Profiling tool generate dot file fails to convert #3066

Closed
tgravescs opened this issue Jul 28, 2021 · 2 comments · Fixed by #3068
Closed

[BUG] Profiling tool generate dot file fails to convert #3066

tgravescs opened this issue Jul 28, 2021 · 2 comments · Fixed by #3068
Assignees
Labels
bug Something isn't working

Comments

@tgravescs
Copy link
Collaborator

Describe the bug

Running profiling tool to generate the dot graph with:

$SPARK_HOME/bin/spark-submit --master local[*] --driver-memory 30g --class com.nvidia.spark.rapids.tool.profiling.ProfileMain ./tools/target/rapids-4-spark-tools_2.12-21.08.0-SNAPSHOT.jar -g application_1616746343401_0025
Then try to convert it to pdf or svg both fail:

$ dot -Tpdf  application_1616746343401_0025-query-0.dot > app.pdf
Error: not well-formed (invalid token) in line 95
...
                                    :           :              +- GpuFileGpuScan parquet default.insight_baiduapp_follow_history[cuid#5,uid#6,third_id#7,type#8,ext#13,event_day#14] Batched: true, DataFilters: [isnotnull(type#8), isnotnull(cuid#5), NOT (cuid#5 = ), (type#8 = ugc), (NOT (cast(ext#13[fake] a..., Format: Parquet, Location: InMemoryFileIndex[hdfs://bdbl-rpm-1106-57451.bdbl:9000/user/turing/spark/bbs/work/fengchao/ubs_da..., PartitionFilters: [isnotnull(event_day#14), (event_day#14 = 20201118)], PushedFilters: [IsNotNull(type), IsNotNull(cuid), Not(EqualTo(cuid,)), EqualTo(type,ugc)], ReadSchema: struct<cuid:string,uid:string,third_id:string,type:string,ext:map<string,string> ...
in label of graph G
@tgravescs tgravescs added bug Something isn't working ? - Needs Triage Need team to review and classify labels Jul 28, 2021
@tgravescs tgravescs self-assigned this Jul 28, 2021
@tgravescs tgravescs removed the ? - Needs Triage Need team to review and classify label Jul 28, 2021
@tgravescs
Copy link
Collaborator Author

the issue is with label generation introduce dhere: 13b740c#diff-8dc61637872038b82e8dda72007efa65d71ff93aae90b312c7185abd320dbe87

It looks like certain characters in the plan string can mess it up.

@tgravescs
Copy link
Collaborator Author

ReadSchema: struct<_c1:string,_c2:string><br align="left"/>

causes issues

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant