API Reference

ParquetConfig


class monad.config.ParquetConfig

Defines the data connection to parquet files.

from monad.config import ParquetConfig

ParquetConfig(
    path="/path/to/parquet/file",
    cache_path="/path/to/cache/directory/",
)
Parameters

path : pathlib.Path
The full path to the parquet file. For parquet files divided into parts, the path should be <path_to_directory_with_parquet_files>/*.parquet. Example: /home/data/customers.parquet or /home/data/customers/*.parquet.


cache_path: Path | None
Default: None
The full path to the cache directory to store temporary query results and persisted data to optimize performance and reduce redundant computations. Example: /home/data/cache/.


config_overrides : dict[str, Any]
Default: dict
Connection configuration. Pass, e.g. {"max_memory": "100GB"} to limit RAM usage.