PARQUET-1126: Write unencrypted Parquet files without Hadoop #1376
+97
−10
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
If you want to write an unencrypted Parquet file without Hadoop, the existing code will use Hadoop to try to get encryption properties.
parquet-java/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetWriter.java
Lines 388 to 393 in fbe13d8
However, if you have these
null
, we really didn't need to go through Hadoop. Also, it calls a helper method inParquetOutputFormat
. This class inherits from Hadoop'sFileOutputFormat
. So calling this method at all, requires Hadoop classes. To resolve this, I moved this helper into a package-protectedEncryptionPropertiesHelper
class.Closes #1497