Skip to content

Support multi-threaded writing of encrypted Parquet files #16737

@rok

Description

@rok

Is your feature request related to a problem or challenge?

#16351 added modular encryption reading and writing. We should enable multi threaded encrypted writing.

Describe the solution you'd like

output_single_parquet_file_parallelized in ParquetSink should use encryption properties passed with WriterProperties.

Describe alternatives you've considered

Current arrow-rs API does not provide public API for passing FileEncryptor (for security reasons) so no good alternatives were identified.

Additional context

This will require an upstream change in arrow-rs that is currently in draft apache/arrow-rs#7818.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions