Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Predicate push down when converting CachedBatch to InternalRow #1019

Open
razajafri opened this issue Oct 23, 2020 · 2 comments
Open
Labels
P2 Not required for release performance A performance related task/issue

Comments

@razajafri
Copy link
Collaborator

This concerns me a little it indicates that we are throwing out rows after parquet read them instead of doing predicate push down. I am fine with this for now, but we should have a follow on issue to fix it.

Originally posted by @revans2 in #1001 (comment)

@revans2 revans2 changed the title Converting CachedBatch to InternalRow on CPU is throwing away rows which could be made more efficient Support Predicate push down when converting CachedBatch to InternalRow Oct 26, 2020
@sameerz
Copy link
Collaborator

sameerz commented Dec 10, 2020

@razajafri should this be tracked in #1143?

Please add a label to this.

@razajafri razajafri added the performance A performance related task/issue label Dec 10, 2020
@razajafri
Copy link
Collaborator Author

I have added this to #1143

@mattahrens mattahrens added the P2 Not required for release label Apr 27, 2022
tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023
…IDIA#1019)

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
P2 Not required for release performance A performance related task/issue
Projects
None yet
Development

No branches or pull requests

3 participants