Escucha y lee

Entra en un mundo infinito de historias

  • Vive la experiencia de leer y escuchar todo lo que quieras
  • Más de 650.000 títulos
  • Títulos en exclusiva y Storytel Originals
  • Primeros 14 días gratis, luego 8,99 €/mes
  • Cancela cuando quieras
Suscríbete ahora
Details page - Device banner - 894x1036
Cover for Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines

Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines

Idioma
Inglés
Formato
Categoría

No ficción

"Deequ Data Quality: Constraint‑Based Validation for Big Data Pipelines"

Data quality failures in big data systems rarely look like broken code—they look like “successful” jobs shipping quietly corrupted tables. This book is for experienced data engineers, platform engineers, and analytics/ML practitioners who need enforceable guarantees, not ad‑hoc SQL spot checks. It treats data quality as an engineering discipline: explicit contracts, measurable signals, and operational response patterns that keep pipelines trustworthy without freezing delivery.

You’ll learn Deequ’s core model—metrics plus assertions—and how it maps onto Spark execution, cost, and reproducibility. The book goes deep on authoring production-grade constraints (completeness, uniqueness, validity, ranges, patterns, proportions), composing checks with stable thresholds, and turning failures into actionable diagnostics. It then operationalizes validation via VerificationSuite, showing how to plan analyzer execution, interpret VerificationResult edge cases, and implement gating strategies such as fail-fast, quarantine, and partial publishes. Profiling and constraint suggestion are covered as accelerators—followed by governance and rollout workflows that keep rules maintainable as data and business semantics evolve.

A strong working knowledge of Spark and DataFrames is assumed. Coverage includes longitudinal quality via metrics repositories, regression detection, and alerting, plus advanced patterns for partitioned/incremental data, late arrivals, custom analyzers, and real-world version compatibility across

© 2026 NobleTrex Press (Ebook): 6610001179250

Fecha de lanzamiento

Ebook: 9 de marzo de 2026

Etiquetas

    Elige el plan:

    • Más de 650.000 títulos

    • Kids mode

    • Modo sin conexión

    • Cancela cuando quieras

    ¡Más popular!

    Unlimited

    Nada mejor que un audiolibro para estos días.

    8.99 € /mes

    Ahorra 78%
    • Escucha y lee los títulos que quieras

    • Modo sin conexión + Kids Mode

    • Cancela en cualquier momento

    Suscríbete ahora

    Family

    Para los que quieren compartir historias con su familia y amigos.

    Desde 15.99 € /mes

    • Escucha y lee los títulos que quieras

    • Modo sin conexión + Kids Mode

    • Cancela en cualquier momento

    Tú + 1 miembro de la familia2 cuentas

    15.99 € /mes

    Pruébalo ahora