Ascolta e leggi

Entra in un mondo di storie: prova Storytel gratis per 14 giorni

  • Ascolta e leggi quanto vuoi
  • Oltre 400.000 titoli
  • Disdici quando vuoi
  • Ascolta titoli esclusivi e Storytel Original
Prova gratis per 14 giorni
Device Banner Block 894x1036
Cover for Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines

Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines

Lingua
Inglese
Formato
Categoria

Non-fiction

"Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines"

Data quality failures in big data systems rarely look like broken code—they look like “successful” jobs shipping quietly corrupted tables. This book is for experienced data engineers, platform engineers, and analytics/ML practitioners who need enforceable guarantees, not ad‑hoc SQL spot checks. It treats data quality as an engineering discipline: explicit contracts, measurable signals, and operational response patterns that keep pipelines trustworthy without freezing delivery.

You’ll learn Deequ’s core model—metrics plus assertions—and how it maps onto Spark execution, cost, and reproducibility. The book goes deep on authoring production-grade constraints (completeness, uniqueness, validity, ranges, patterns, proportions), composing checks with stable thresholds, and turning failures into actionable diagnostics. It then operationalizes validation via VerificationSuite, showing how to plan analyzer execution, interpret VerificationResult edge cases, and implement gating strategies such as fail-fast, quarantine, and partial publishes. Profiling and constraint suggestion are covered as accelerators—followed by governance and rollout workflows that keep rules maintainable as data and business semantics evolve.

A strong working knowledge of Spark and DataFrames is assumed. Coverage includes longitudinal quality via metrics repositories, regression detection, and alerting, plus advanced patterns for partitioned/incremental data, late arrivals, custom analyzers, and real-world version compatibility across

© 2026 NobleTrex Press (Ebook): 6610001179250

Data di uscita

Ebook: 9 marzo 2026

Tag

    Scegli il piano che fa per te

    • Più di 400.000 titoli

    • Kids Mode (accesso sicuro per bambini)

    • Scarica e ascolta offline

    • Disdici quando vuoi

    Basic

    Le tue prime storie, al prezzo più basso.

    6.49 € /mese

    • Disdici quando vuoi

    Prova gratis per 7 giorni
    Il più popolare

    Unlimited

    Ascolto illimitato. Dove vuoi, quando vuoi.

    9.99 € /mese

    • Disdici quando vuoi

    Prova gratis per 14 giorni

    Unlimited Annuale

    Paghi subito 89.99€/anno, l'equivalente di 7.49€/mese, per 1 anno di ascolto illimitato.

    89.99 € /anno

    12 mesi al prezzo di 9
    • Disdici quando vuoi

    Prova gratis per 14 giorni

    Unlimited Family

    Risparmia con più account. Ognuno con le proprie storie.

    14.99 € /mese

    • Disdici quando vuoi

    Prova gratis per 14 giorni