الاستماع والقراءة

خطوة إلى عالم لا حدود له من القصص

  • اقرأ واستمع إلى ما تريده
  • أكثر من مليون عنوان
  • العناوين الحصرية + أصول القصة
  • 7 يوم تجربة مجانية، ثم 9.99$ يورو في الشهر
  • من السهل الإلغاء في أي وقت
جرب مجانا
Details page - Device banner - 894x1036

Mastering Apache Spark 2.x - Second Edition

اللغة
اللغة الإنجليزية
Format
الفئة

كتب واقعية

Advanced analytics on your Big Data with latest Apache Spark 2. x

About This Book • An advanced guide with a combination of instructions and practical examples to extend the most up-to date Spark functionalities.

• Extend your data processing capabilities to process huge chunk of data in minimum time using advanced concepts in Spark.

• Master the art of real-time processing with the help of Apache Spark 2. x

Who This Book Is For

If you are a developer with some experience with Spark and want to strengthen your knowledge of how to get around in the world of Spark, then this book is ideal for you. Basic knowledge of Linux, Hadoop and Spark is assumed. Reasonable knowledge of Scala is expected.

What You Will Learn • Examine Advanced Machine Learning and DeepLearning with MLlib, SparkML, SystemML, H2O and DeepLearning4J

• Study highly optimised unified batch and real-time data processing using SparkSQL and Structured Streaming

• Evaluate large-scale Graph Processing and Analysis using GraphX and GraphFrames

• Apply Apache Spark in Elastic deployments using Jupyter and Zeppelin Notebooks, Docker, Kubernetes and the IBM Cloud

• Understand internal details of cost based optimizers used in Catalyst, SystemML and GraphFrames

• Learn how specific parameter settings affect overall performance of an Apache Spark cluster

• Leverage Scala, R and python for your data science projects

In Detail

Apache Spark is an in-memory cluster-based parallel processing system that provides a wide range of functionalities such as graph processing, machine learning, stream processing, and SQL. This book aims to take your knowledge of Spark to the next level by teaching you how to expand Spark's functionality and implement your data flows and machine/deep learning programs on top of the platform.

The book commences with an overview of the Spark ecosystem. It will introduce you to Project Tungsten and Catalyst, two of the major advancements of Apache Spark 2. x.

You will understand how memory management and binary processing, cache-aware computation, and code generation are used to speed things up dramatically. The book extends to show how to incorporate H20, SystemML, and Deeplearning4j for machine learning, and Jupyter Notebooks and Kubernetes/Docker for cloud-based Spark. During the course of the book, you will learn about the latest enhancements to Apache Spark 2. x, such as interactive querying of live data and unifying DataFrames and Datasets.

You will also learn about the updates on the APIs and how DataFrames and Datasets affect SQL, machine learning, graph processing, and streaming. You will learn to use Spark as a big data operating system, understand how to implement advanced analytics on the new APIs, and explore how easy it is to use Spark in day-to-day tasks.

Style and approach

This book is an extensive guide to Apache Spark modules and tools and shows how Spark's functionality can be extended for real-time processing and storage with worked examples.

© 2017 Packt Publishing (كتاب ): 9781785285226

تاريخ الإصدار

كتاب : 26 يوليو 2017

واستمتع آخرون أيضًا...

  1. A Country Doctor Sarah Orne Jewett
    2.8
  2. PMP Pro: Transform Your Exam Success with Game-Changing Secrets: "Elevate your PMP exam results! Dive into transformative audio lessons for peak performance on test day." Arden Blakewood
  3. Summary - I’m Still Here: Based On The Book By Austin Channing Brown Library Of Stories
    1
  4. Desconexión Digital: Meditaciones Guiadas para Calma y Claridad Refeser
  5. Summary - Emotional Intelligence 2.0.: Based On The Book By Travis Bradberry And Jean Greaves Fastbooks Publishing
  6. Nature’s Symphony of Serene Forest Cricket Sounds Mixed With Piano Rhythms For Deep Calm & Relaxation: Experience Soothing Nights for Restful Sleep & Mindfulness Using Enhanced BGM 8D Audio Cedar Skye
  7. Summary - Freakonomics: Based On The Book By Steven Levitt And Stephen Dubner Fastbooks Publishing
    4
  8. Django Unchained - The Ultimate Trivia Collection: From The Movie Directed By Quentin Tarantino Film Trivia Metaverse
  9. 100 Quotes About Inner Peace That Will Transform Your Life: Finding Serenity Amidst Life's Turmoil The Quotes Library
  10. GED Secrets: Elevate Your Success and Conquer the Exam Today: "Boost your GED prep! Unlock engaging audio lessons for ultimate exam success today!" Ronan Cade
  11. Extended Summary - Maybe You Should Talk To Someone: Based On The Book By Lori Gottlieb Quick Reading Library
  12. Extended Summary - Braving The Wilderness: Based On The Book By Brene Brown Quick Reading Library
    2
  13. Summary - Eat To Live: Based On The Book By Dr. Joel Fuhrman Fastbooks Publishing
  14. Summary - 1620 - A Critical Response To The 1619 Project: Based On The Book By Peter W. Wood Library Of Stories
  15. Summary - Everything Is F*Cked: Based On The Book By Mark Manson Fastbooks Publishing

دائمًا برفقة Storytel

  • أكثر من 200000 عنوان

  • وضع الأطفال (بيئة آمنة للأطفال)

  • تنزيل الكتب للوصول إليها دون الاتصال بالإنترنت

  • الإلغاء في أي وقت

الكتب الأكثر استماعًا

شهري

قصص لكل المناسبات.

9.99 $ / شهر
7 يومًا مجانًا
  • حساب واحد

  • حساب بلا حدود

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن

سنويا

قصص لكل المناسبات.

83.88 $ /سنة
7 يومًا مجانًا
وفر 30%
  • حساب واحد

  • حساب بلا حدود

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن

6 أشهر

قصص لكل المناسبات.

53.64 $ /6 أشهر
7 يومًا مجانًا
وفر 11%
  • حساب واحد

  • حساب بلا حدود

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن