09-29-2024, 11:58 AM
English | 2024 | ISBN: 9798868806018 | 631 pages | true PDF, EPUB | 51 MB
Quote:This book covers modern data engineering functions and important Python libraries, to help you develop state-of-the-art ML pipelines and integration code.
The book begins by explaining data analytics and transformation, delving into the Pandas library, its capabilities, and nuances. It then explores emerging libraries such as Polars and CuDF, providing insights into GPU-based computing and cutting-edge data manipulation techniques. The text discusses the importance of data validation in engineering processes, introducing tools such as Great Expectations and Pandera to ensure data quality and reliability. The book delves into API design and development, with a specific focus on leveraging the power of FastAPI. It covers authentication, authorization, and real-world applications, enabling you to construct efficient and secure APIs using FastAPI. Also explored is concurrency in data engineering, examining Dask's capabilities from basic setup to crafting advanced machine learning pipelines. The book includes development and delivery of data engineering pipelines using leading cloud platforms such as AWS, Google Cloud, and Microsoft Azure. The concluding chapters concentrate on real-time and streaming data engineering pipelines, emphasizing Apache Kafka and workflow orchestration in data engineering.
🌞 Contents of Download:
📌 979-8-8688-0602-5.pdf (33 MB)
-----------------------------***[ softwarez.info (OP) ]***-----------------------------
⭐️ Data Engineering For Machine Learning Pipelines From Python Libraries To ML Pipelines And Cloud Platforms ✅ (33 MB)
NitroFlare Link(s)
RapidGator Link(s)