الاستماع والقراءة

خطوة إلى عالم لا حدود له من القصص

  • اقرأ واستمع إلى ما تريده
  • أكثر من مليون عنوان
  • العناوين الحصرية + أصول القصة
  • 7 يوم تجربة مجانية، ثم 9.99$ يورو في الشهر
  • من السهل الإلغاء في أي وقت
جرب مجانا
Details page - Device banner - 894x1036

PySpark Cookbook: Over 60 recipes for implementing big data processing and analytics using Apache Spark and Python

اللغة
اللغة الإنجليزية
Format
الفئة

كتب واقعية

Combine the power of Apache Spark and Python to build effective big data applications

Key FeaturesPerform effective data processing, machine learning, and analytics using PySparkOvercome challenges in developing and deploying Spark solutions using PythonExplore recipes for efficiently combining Python and Apache Spark to process dataBook Description

Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. The PySpark Cookbook presents effective and time-saving recipes for leveraging the power of Python and putting it to use in the Spark ecosystem.

You’ll start by learning the Apache Spark architecture and how to set up a Python environment for Spark. You’ll then get familiar with the modules available in PySpark and start using them effortlessly. In addition to this, you’ll discover how to abstract data with RDDs and DataFrames, and understand the streaming capabilities of PySpark. You’ll then move on to using ML and MLlib in order to solve any problems related to the machine learning capabilities of PySpark and use GraphFrames to solve graph-processing problems. Finally, you will explore how to deploy your applications to the cloud using the spark-submit command.

By the end of this book, you will be able to use the Python API for Apache Spark to solve any problems associated with building data-intensive applications.

What you will learnConfigure a local instance of PySpark in a virtual environment Install and configure Jupyter in local and multi-node environmentsCreate DataFrames from JSON and a dictionary using pyspark.sqlExplore regression and clustering models available in the ML moduleUse DataFrames to transform data used for modelingConnect to PubNub and perform aggregations on streamsWho this book is for

The PySpark Cookbook is for you if you are a Python developer looking for hands-on recipes for using the Apache Spark 2.x ecosystem in the best possible way. A thorough understanding of Python (and some familiarity with Spark) will help you get the best out of the book.

Denny Lee is a technology evangelist at Databricks. He is a hands-on data science engineer with 15+ years of experience. His key focuses are solving complex large-scale data problems—providing not only architectural direction but hands-on implementation of such systems. He has extensive experience of building greenfield teams as well as being a turnaround/change catalyst. Prior to joining Databricks, he was a senior director of data science engineering at Concur and was part of the incubation team that built Hadoop on Windows and Azure (currently known as HDInsight). Tomasz Drabas is a data scientist specializing in data mining, deep learning, machine learning, choice modeling, natural language processing, and operations research. He is the author of Learning PySpark and Practical Data Analysis Cookbook. He has a PhD from University of New South Wales, School of Aviation. His research areas are machine learning and choice modeling for airline revenue management.

© 2018 Packt Publishing (كتاب ): 9781788834254

تاريخ الإصدار

كتاب : 29 يونيو 2018

واستمتع آخرون أيضًا...

  1. A Country Doctor Sarah Orne Jewett
    2.8
  2. PMP Pro: Transform Your Exam Success with Game-Changing Secrets: "Elevate your PMP exam results! Dive into transformative audio lessons for peak performance on test day." Arden Blakewood
  3. Summary - I’m Still Here: Based On The Book By Austin Channing Brown Library Of Stories
    1
  4. Desconexión Digital: Meditaciones Guiadas para Calma y Claridad Refeser
  5. Summary - Emotional Intelligence 2.0.: Based On The Book By Travis Bradberry And Jean Greaves Fastbooks Publishing
  6. Nature’s Symphony of Serene Forest Cricket Sounds Mixed With Piano Rhythms For Deep Calm & Relaxation: Experience Soothing Nights for Restful Sleep & Mindfulness Using Enhanced BGM 8D Audio Cedar Skye
  7. Summary - Freakonomics: Based On The Book By Steven Levitt And Stephen Dubner Fastbooks Publishing
    4
  8. Django Unchained - The Ultimate Trivia Collection: From The Movie Directed By Quentin Tarantino Film Trivia Metaverse
  9. 100 Quotes About Inner Peace That Will Transform Your Life: Finding Serenity Amidst Life's Turmoil The Quotes Library
  10. GED Secrets: Elevate Your Success and Conquer the Exam Today: "Boost your GED prep! Unlock engaging audio lessons for ultimate exam success today!" Ronan Cade
  11. Extended Summary - Maybe You Should Talk To Someone: Based On The Book By Lori Gottlieb Quick Reading Library
  12. Extended Summary - Braving The Wilderness: Based On The Book By Brene Brown Quick Reading Library
    2
  13. Summary - Eat To Live: Based On The Book By Dr. Joel Fuhrman Fastbooks Publishing
  14. Summary - 1620 - A Critical Response To The 1619 Project: Based On The Book By Peter W. Wood Library Of Stories
  15. Summary - Everything Is F*Cked: Based On The Book By Mark Manson Fastbooks Publishing

دائمًا برفقة Storytel

  • أكثر من 200000 عنوان

  • وضع الأطفال (بيئة آمنة للأطفال)

  • تنزيل الكتب للوصول إليها دون الاتصال بالإنترنت

  • الإلغاء في أي وقت

الكتب الأكثر استماعًا

شهري

قصص لكل المناسبات.

9.99 $ / شهر
7 يومًا مجانًا
  • حساب واحد

  • حساب بلا حدود

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن

سنويا

قصص لكل المناسبات.

83.88 $ /سنة
7 يومًا مجانًا
وفر 30%
  • حساب واحد

  • حساب بلا حدود

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن

6 أشهر

قصص لكل المناسبات.

53.64 $ /6 أشهر
7 يومًا مجانًا
وفر 11%
  • حساب واحد

  • حساب بلا حدود

  • 1 حساب

  • استماع بلا حدود

  • إلغاء في أي وقت

جرب الآن