#516: Accelerating Python Data Science at NVIDIA

#516: Accelerating Python Data Science at NVIDIA

0 Ratings
0
Episode
515 of 521
Duration
1H 5min
Language
English
Format
Category
Non-fiction

Python’s data stack is getting a serious GPU turbo boost. In this episode, Ben Zaitlen from NVIDIA joins us to unpack RAPIDS, the open source toolkit that lets pandas, scikit-learn, Spark, Polars, and even NetworkX execute on GPUs. We trace the project’s origin and why NVIDIA built it in the open, then dig into the pieces that matter in practice: cuDF for DataFrames, cuML for ML, cuGraph for graphs, cuXfilter for dashboards, and friends like cuSpatial and cuSignal. We talk real speedups, how the pandas accelerator works without a rewrite, and what becomes possible when jobs that used to take hours finish in minutes. You’ll hear strategies for datasets bigger than GPU memory, scaling out with Dask or Ray, Spark acceleration, and the growing role of vector search with cuVS for AI workloads. If you know the CPU tools, this is your on-ramp to the same APIs at GPU speed.

Episode sponsors

Posit

Talk Python Courses

Links from the show RAPIDS: github.com/rapidsai

Example notebooks showing drop-in accelerators: github.com

Benjamin Zaitlen - LinkedIn: linkedin.com

RAPIDS Deployment Guide (Stable): docs.rapids.ai

RAPIDS cuDF API Docs (Stable): docs.rapids.ai

Asianometry YouTube Video: youtube.com

cuDF pandas Accelerator (Stable): docs.rapids.ai

Watch this episode on YouTube: youtube.com

Episode #516 deep-dive: talkpython.fm/516

Episode transcripts: talkpython.fm

Theme Song: Developer Rap

🥁 Served in a Flask 🎸: talkpython.fm/flasksong

---== Don't be a stranger ==---

YouTube: youtube.com/@talkpython

Bluesky: @talkpython.fm

Mastodon: @talkpython@fosstodon.org

X.com: @talkpython

Michael on Bluesky: @mkennedy.codes

Michael on Mastodon: @mkennedy@fosstodon.org

Michael on X.com: @mkennedy


Listen and read

Step into an infinite world of stories

  • Read and listen as much as you want
  • Over 1 million titles
  • Exclusive titles + Storytel Originals
  • 7 days free trial, then €9.99/month
  • Easy to cancel anytime
Try for free
Details page - Device banner - 894x1036
Cover for #516: Accelerating Python Data Science at NVIDIA

Other podcasts you might like ...