Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize ParquetCacheBatchSerializer #2234

Closed
razajafri opened this issue Apr 22, 2021 · 2 comments
Closed

Optimize ParquetCacheBatchSerializer #2234

razajafri opened this issue Apr 22, 2021 · 2 comments
Assignees
Labels
feature request New feature or request performance A performance related task/issue

Comments

@razajafri
Copy link
Collaborator

razajafri commented Apr 22, 2021

This is an uber issue to track optimizations needed in Cache Serializer

  • Only pass the needed conf instead of broadcasting the Map
  • Compression is AUTO right now which may compromise performance
  • Filter method is a noop at the moment, this might impact performance
@razajafri razajafri added feature request New feature or request ? - Needs Triage Need team to review and classify labels Apr 22, 2021
@razajafri razajafri self-assigned this Apr 22, 2021
@razajafri razajafri added the performance A performance related task/issue label Apr 22, 2021
@sameerz sameerz removed the ? - Needs Triage Need team to review and classify label Apr 27, 2021
@sameerz
Copy link
Collaborator

sameerz commented Apr 27, 2021

Is this a duplicate of #1143? If not, can this be combined with the already open issue?

@razajafri
Copy link
Collaborator Author

Yes, thanks for pointing that out @sameerz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request performance A performance related task/issue
Projects
None yet
Development

No branches or pull requests

2 participants