Apache DataFusion
Using External Indexes, Metadata Stores, Catalogs and Caches to Accelerate Queries on Apache Parquet by Andrew Lamb (InfluxData) Embedding User-Defined Indexes in Apache Parquet Files by Qi Zhu (Cloudera), Jigao Luo (Systems Group at TU Darmstadt), and Andrew Lamb (InfluxData) Apache DataFusion 49.0.0 Released Highlights I found impressive
Dynamic Filters and TopK pushdown Async User-Defined Functions (ask_llm?!) Better Cancellation for Certain Long-Running Queries