Listen and read

Step into an infinite world of stories

  • Listen and read as much as you want
  • Over 400 000+ titles
  • Bestsellers in 10+ Indian languages
  • Exclusive titles + Storytel Originals
  • Easy to cancel anytime
Details page - Device banner - 894x1036
Cover for Data Lake Development with Big Data: Explore architectural approaches to building Data Lakes that ingest, index, manage, and analyze massive amounts of data using Big Data technologies

Data Lake Development with Big Data: Explore architectural approaches to building Data Lakes that ingest, index, manage, and analyze massive amounts of data using Big Data technologies

Language
English
Format
Category

Non-Fiction

A Data Lake is a highly scalable platform for storing huge volumes of multistructured data from disparate sources with centralized data management services. This book explores the potential of Data Lakes and explores architectural approaches to building data lakes that ingest, index, manage, and analyze massive amounts of data using batch and real-time processing frameworks. It guides you on how to go about building a Data Lake that is managed by Hadoop and accessed as required by other Big Data applications. This book will guide readers (using best practices) in developing Data Lake's capabilities. It will focus on architect data governance, security, data quality, data lineage tracking, metadata management, and semantic data tagging. By the end of this book, you will have a good understanding of building a Data Lake for Big Data.

© 2015 Packt Publishing (Ebook): 9781785881664

Release date

Ebook: 26 November 2015