Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]test_cache_expand_exec is failing on 330 #5673

Closed
firestarman opened this issue May 27, 2022 · 4 comments
Closed

[BUG]test_cache_expand_exec is failing on 330 #5673

firestarman opened this issue May 27, 2022 · 4 comments
Labels
bug Something isn't working invalid This doesn't seem right

Comments

@firestarman
Copy link
Collaborator

Describe the bug
test_cache_expand_exec is failing according to the 330 nightly IT.

[2022-05-07T10:58:56.458Z] FAILED ../../src/main/python/cache_test.py::test_cache_expand_exec[{'spark.sql.inMemoryColumnarStorage.enableVectorizedReader': 'true'}-Decimal(20,2)][IGNORE_ORDER]
[2022-05-07T10:58:56.458Z] FAILED ../../src/main/python/cache_test.py::test_cache_expand_exec[{'spark.sql.inMemoryColumnarStorage.enableVectorizedReader': 'false'}-String][IGNORE_ORDER]


  Caused by: java.lang.NoSuchMethodException: org.apache.spark.sql.execution.datasources.parquet.VectorizedColumnReader.readBatch(int, org.apache.spark.sql.execution.vectorized.WritableColumnVector)
[2022-05-07T10:58:56.457Z] E                   	at java.lang.Class.getDeclaredMethod(Class.java:2130)
[2022-05-07T10:58:56.457Z] E                   	at com.nvidia.spark.rapids.ParquetCachedBatchSerializer$CachedBatchIteratorConsumer.<init>(ParquetCachedBatchSerializer.scala:693)
[2022-05-07T10:58:56.457Z] E                   	at com.nvidia.spark.rapids.ParquetCachedBatchSerializer.$anonfun$convertCachedBatchToInternalRow$1(ParquetCachedBatchSerializer.scala:595)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.$anonfun$mapPartitions$2(RDD.scala:855)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.$anonfun$mapPartitions$2$adapted(RDD.scala:855)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:365)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:365)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:365)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.rdd.RDD.iterator(RDD.scala:329)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.shuffle.ShuffleWriteProcessor.write(ShuffleWriteProcessor.scala:59)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:99)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:52)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.scheduler.Task.run(Task.scala:136)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:548)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1504)
[2022-05-07T10:58:56.457Z] E                   	at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:551)
[2022-05-07T10:58:56.457Z] E                   	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
[2022-05-07T10:58:56.457Z] E                   	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
[2022-05-07T10:58:56.457Z] E                   	... 1 more
@firestarman firestarman added bug Something isn't working ? - Needs Triage Need team to review and classify labels May 27, 2022
@tgravescs
Copy link
Collaborator

This is odd, these tests have been failing since April 9th.

These tests passed for me running against official rc3 spark 3.3 image yesterday though.

@tgravescs
Copy link
Collaborator

this should have been fixed with #5425, my guess is something wrong with integration env, the line number if complains about is from before that change

@tgravescs
Copy link
Collaborator

I think you may have looked at the old IT build, last nights passed as well, closing.

@sameerz sameerz added invalid This doesn't seem right and removed ? - Needs Triage Need team to review and classify labels May 28, 2022
@firestarman
Copy link
Collaborator Author

firestarman commented May 30, 2022

The IT number I checked were 81 and 82, but maybe they still referred to the old spark images.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working invalid This doesn't seem right
Projects
None yet
Development

No branches or pull requests

3 participants