Non-fiction
"DataFusion Python Bindings in Practice"
"DataFusion Python Bindings in Practice" offers a definitive, hands-on guide to harnessing the power of Apache DataFusion from within the Python ecosystem. The book begins by grounding readers in DataFusion’s robust Rust-based architecture, highlighting its modular design and its relevance for analytic workloads. Through clear explanations and practical walkthroughs, it guides data professionals through environment setup, schema management, and an insightful comparison with leading alternatives such as PySpark and Dask, establishing how DataFusion stands out in terms of architecture and performance.
Delving deeper, the book meticulously explores data source integration, expressive query composition, and advanced workflow creation using Python. It details a wide range of supported formats—CSV, Parquet, JSON, Avro—and provides thorough guidance on schema evolution, custom data sources, and optimizing data ingestion. Readers are equipped with patterns for constructing complex data pipelines, extending DataFusion with custom user-defined functions (UDFs), and orchestrating distributed execution with fault tolerance, logging, and resource management best practices.
For developers and data engineers seeking to implement scalable, secure, and production-ready analytics, this book addresses critical concerns such as performance profiling, parallelism, security, and compliance. It rounds off with case studies, real-world applications, and discussion of the ecosystem’s future, providing practical insights into contributing to the DataFusion project and building unified analytics workflows. Whether applied in industry or research, "DataFusion Python Bindings in Practice" is an essential resource for anyone leveraging Python for high-performance, flexible big data processing.
© 2025 HiTeX Press (Ebook): 6610001025885
Data di uscita
Ebook: 20 agosto 2025
Più di 400.000 titoli
Kids Mode (accesso sicuro per bambini)
Scarica e ascolta offline
Disdici quando vuoi
Ascolto illimitato. Dove vuoi, quando vuoi.
9.99 € /mese
Disdici quando vuoi
Paghi subito 89.99€/anno, l'equivalente di 7.49€/mese, per 1 anno di ascolto illimitato.
89.99 € /anno
Disdici quando vuoi
Risparmia con più account. Ognuno con le proprie storie.
14.99 € /mese
Disdici quando vuoi
Le tue prime storie, al prezzo più basso.
6.49 € /mese
Disdici quando vuoi