mod parquet_source¶
- module parquet_source¶
ParquetMarkerSource: marker stream backed by a Parquet temp file.Used when the predicted working set would not safely fit in RAM. The Parquet payload has the same schema as the in-memory variant (
id,sequence,ind1, …,indN) so the per-row decoding is shared viasuper::arrow_source::ArrowMarkerSource::from_batches.Enums
- enum ParquetSourceError¶
- EmptyInput¶
- TempFile(String)¶
- Write(String)¶
Traits implemented
- impl std::fmt::Display for ParquetSourceError¶
- impl std::error::Error for ParquetSourceError¶
Structs and Unions
- struct ParquetMarkerSource¶
Spilled-to-Parquet marker source. Owns the underlying temp file so it is removed automatically when the source is dropped.
Implementations
- impl ParquetMarkerSource¶
Functions
- fn path(&self) -> &std::path::Path¶
Path of the on-disk Parquet payload (e.g. for diagnostics).
- fn spill_from_arrow(in_mem: &ArrowMarkerSource) -> Result<Self, ParquetSourceError>¶
Materialise an in-memory
ArrowMarkerSourceas a Parquet temp file and wrap it as aParquetMarkerSource. The original batches are dropped once the file is written.