Op werkdagen voor 23:00 besteld, morgen in huis Gratis verzending vanaf €20

The Definitive Guide to Azure Data Engineering

Modern ELT, DevOps, and Analytics on the Azure Cloud Platform

Paperback Engels 2021 9781484271810
Verwachte levertijd ongeveer 9 werkdagen

Samenvatting

Build efficient and scalable batch and real-time data ingestion pipelines, DevOps continuous integration and deployment pipelines, and advanced analytics solutions on the Azure Data Platform. This book teaches you to design and implement robust data engineering solutions using Data Factory, Databricks, Synapse Analytics, Snowflake, Azure SQL database, Stream Analytics, Cosmos database, and Data Lake Storage Gen2. You will learn how to engineer your use of these Azure Data Platform components for optimal performance and scalability. You will also learn to design self-service capabilities to maintain and drive the pipelines and your workloads.

The approach in this book is to guide you through a hands-on, scenario-based learning process that will empower you to promote digital innovation best practices while you work through your organization’s projects, challenges, and needs. The clear examples enable you to use this book as a reference and guide for building data engineering solutions in Azure. After reading this book, you will have a far stronger skill set and confidence level in getting hands on with the Azure Data Platform.

What You Will Learn

-Build dynamic, parameterized ELT data ingestion orchestration pipelines in Azure Data Factory
-Create data ingestion pipelines that integrate control tables for self-service ELT
-Implement a reusable logging framework that can be applied to multiple pipelines
-Integrate Azure Data Factory pipelines with a variety of Azure data sources and tools
-Transform data with Mapping Data Flows in Azure Data Factory
-Apply Azure DevOps continuous integration and deployment practices to your Azure Data Factory pipelines and development SQL databases
-Design and implement real-time streaming and advanced analytics solutions using Databricks, Stream Analytics, and Synapse Analytics
-Get started with a variety of Azure data services through hands-on examples

Who This Book Is For
Data engineers and data architects who are interested in learning architectural and engineering best practices around ELT and ETL on the Azure Data Platform, those who are creating complex Azure data engineering projects and are searching for patterns of success, and aspiring cloud and data professionals involved in data engineering, data governance, continuous integration and deployment of DevOps practices, and advanced analytics who want a full understanding of the many different tools and technologies that Azure Data Platform provides

Specificaties

ISBN13:9781484271810
Taal:Engels
Bindwijze:paperback
Uitgever:Apress
Verschijningsdatum:7-8-2021
Hoofdrubriek:IT-management / ICT

Lezersrecensies

Wees de eerste die een lezersrecensie schrijft!

Inhoudsopgave

Introduction
Part I. Getting Started1. The Tools and Pre-Requisites2. Data Factory vs SSIS vs Databricks3. Design a Data Lake Storage Gen2 Account
Part II. Azure Data Factory for ELT4. Dynamically Load SQL Database to Data Lake Storage Gen 25. Use COPY INTO to Load Synapse Analytics Dedicated SQL Pool6. Load Data Lake Storage Gen2 Files into Synapse Analytics Dedicated SQL Pool7. Create and Load Synapse Analytics Dedicated SQL Pool Tables Dynamically8. Build Custom Logs in SQL Database for Pipeline Activity Metrics9. Capture Pipeline Error Logs in SQL Database10. Dynamically Load Snowflake Data Warehouse11. Mapping Data Flows for Data Warehouse ETL 12. Aggregate and Transform Big Data Using Mapping Data Flows13. Incrementally Upsert Data14. Loading Excel Sheets into Azure SQL Database Tables15. Delta Lake
Part III. Real-Time Analytics in Azure16. Stream Analytics Anomaly Detection17. Real-time IoT Analytics Using Apache Spark 18. Azure Synapse Link for Cosmos DB
Part IV. DevOps for Continuous Integration and Deployment 19. Deploy Data Factory Changes 20. Deploy SQL Database
Part V. Advanced Analytics21. Graph Analytics Using Apache Spark’s GraphFrame API22. Synapse Analytics Workspaces23. Machine Learning in Databricks
Part VI. Data Governance24. Purview for Data Governance

Managementboek Top 100

Rubrieken

Populaire producten

    Personen

      Trefwoorden

        The Definitive Guide to Azure Data Engineering