Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] test_non_empty_ctas fails on yarn #3476

Closed
tgravescs opened this issue Sep 14, 2021 · 6 comments · Fixed by #3517 or #3632
Closed

[BUG] test_non_empty_ctas fails on yarn #3476

tgravescs opened this issue Sep 14, 2021 · 6 comments · Fixed by #3517 or #3632
Assignees
Labels
bug Something isn't working P1 Nice to have for release

Comments

@tgravescs
Copy link
Collaborator

Describe the bug
06:31:10 integration_tests/src/main/python/parquet_write_test.py::test_non_empty_ctas[True][ALLOW_NON_GPU(DataWritingCommandExec,HiveTableScanExec)] FAILED [ 99%]
06:31:10 integration_tests/src/main/python/parquet_write_test.py::test_non_empty_ctas[False][ALLOW_NON_GPU(DataWritingCommandExec,HiveTableScanExec)] FAILED [100%]
06:31:10

E                   py4j.protocol.Py4JJavaError: An error occurred while calling o106.sql.
06:31:10  E                   : java.lang.ClassNotFoundException: org.apache.hadoop.hive.ql.plan.LoadTableDesc$LoadFileType
06:31:10  E                   	at java.net.URLClassLoader.findClass(URLClassLoader.java:382)
06:31:10  E                   	at java.lang.ClassLoader.loadClass(ClassLoader.java:418)
06:31:10  E                   	at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:352)
06:31:10  E                   	at java.lang.ClassLoader.loadClass(ClassLoader.java:351)
06:31:10  E                   	at org.apache.****.sql.hive.client.Shim_v3_0.clazzLoadFileType$lzycompute(HiveShim.scala:1272)
06:31:10  E                   	at org.apache.****.sql.hive.client.Shim_v3_0.clazzLoadFileType(HiveShim.scala:1271)
06:31:10  E                   	at org.apache.****.sql.hive.client.Shim_v3_0.loadTable(HiveShim.scala:1355)
06:31:10  E                   	at org.apache.****.sql.hive.client.HiveClientImpl.$anonfun$loadTable$1(HiveClientImpl.scala:885)
06:31:10  E                   	at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
06:31:10  E                   	at org.apache.****.sql.hive.client.HiveClientImpl.$anonfun$withHiveState$1(HiveClientImpl.scala:294)
06:31:10  E                   	at org.apache.****.sql.hive.client.HiveClientImpl.liftedTree1$1(HiveClientImpl.scala:227)
06:31:10  E                   	at org.apache.****.sql.hive.client.HiveClientImpl.retryLocked(HiveClientImpl.scala:226)
06:31:10  E                   	at org.apache.****.sql.hive.client.HiveClientImpl.withHiveState(HiveClientImpl.scala:276)
06:31:10  E                   	at org.apache.****.sql.hive.client.HiveClientImpl.loadTable(HiveClientImpl.scala:880)
06:31:10  E                   	at org.apache.****.sql.hive.HiveExternalCatalog.$anonfun$loadTable$1(HiveExternalCatalog.scala:884)
06:31:10  E                   	at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
06:31:10  E                   	at org.apache.****.sql.hive.HiveExternalCatalog.withClient(HiveExternalCatalog.scala:103)
06:31:10  E                   	at org.apache.****.sql.hive.HiveExternalCatalog.loadTable(HiveExternalCatalog.scala:878)
06:31:10  E                   	at org.apache.****.sql.catalyst.catalog.ExternalCatalogWithListener.loadTable(ExternalCatalogWithListener.scala:167)
06:31:10  E        
@tgravescs tgravescs added bug Something isn't working ? - Needs Triage Need team to review and classify P0 Must have for release labels Sep 14, 2021
@Salonijain27 Salonijain27 removed the ? - Needs Triage Need team to review and classify label Sep 14, 2021
@tgravescs
Copy link
Collaborator Author

this failed again after kicking a second time.

@jlowe jlowe self-assigned this Sep 16, 2021
@jlowe
Copy link
Member

jlowe commented Sep 16, 2021

This looks like the same stacktrace as reported at apache/spark#26619 (comment). The stacktrace shows that a Hive 3.x shim is getting loaded, but it looks like a Hive 2.x jar is being used underneath it which goes about as well as one would expect.

@tgravescs
Copy link
Collaborator Author

seeing some tests on ucx egx yarn after this:

[2021-09-17T08:09:26.387Z] integration_tests/src/main/python/parquet_write_test.py::test_non_empty_ctas[True][ALLOW_NON_GPU(DataWritingCommandExec,HiveTableScanExec)] [31mFAILED[0m[31m [ 99%][0m
[2021-09-17T08:09:26.387Z] integration_tests/src/main/python/parquet_write_test.py::test_non_empty_ctas[False][ALLOW_NON_GPU(DataWritingCommandExec,HiveTableScanExec)] [31mFAILED[0m[31m [100%][0m

[2021-09-17T08:09:26.388Z] �[1m�[31mE : java.lang.ClassNotFoundException: org.apache.hadoop.hive.ql.plan.LoadTableDesc$LoadFileType�[0m

@tgravescs
Copy link
Collaborator Author

actually the nightly didn't push last night so reclose this and see if it fails again

@jlowe
Copy link
Member

jlowe commented Sep 20, 2021

This is still happening.

@jlowe jlowe reopened this Sep 20, 2021
@Salonijain27 Salonijain27 added P1 Nice to have for release and removed P0 Must have for release labels Sep 21, 2021
@jlowe jlowe changed the title [BUG]parquet_write_test fails on yarn [BUG] test_non_empty_ctas fails on yarn Sep 21, 2021
@jlowe
Copy link
Member

jlowe commented Sep 21, 2021

Note that the failure that is occurring happens when the test is performing a CPU-only table write. There's no GPU operations being performed, so it seems like the Spark YARN cluster is misconfigured somehow to cause this type of error.

@jlowe jlowe assigned abellina and unassigned jlowe Sep 23, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working P1 Nice to have for release
Projects
None yet
4 participants