#516: Accelerating Python Data Science at NVIDIA

#516: Accelerating Python Data Science at NVIDIA

0 Calificaciones
0
Episodio
515 of 521
Duración
1H 5min
Idioma
Inglés
Formato
Categoría
No ficción

Python’s data stack is getting a serious GPU turbo boost. In this episode, Ben Zaitlen from NVIDIA joins us to unpack RAPIDS, the open source toolkit that lets pandas, scikit-learn, Spark, Polars, and even NetworkX execute on GPUs. We trace the project’s origin and why NVIDIA built it in the open, then dig into the pieces that matter in practice: cuDF for DataFrames, cuML for ML, cuGraph for graphs, cuXfilter for dashboards, and friends like cuSpatial and cuSignal. We talk real speedups, how the pandas accelerator works without a rewrite, and what becomes possible when jobs that used to take hours finish in minutes. You’ll hear strategies for datasets bigger than GPU memory, scaling out with Dask or Ray, Spark acceleration, and the growing role of vector search with cuVS for AI workloads. If you know the CPU tools, this is your on-ramp to the same APIs at GPU speed.

Episode sponsors

Posit

Talk Python Courses

Links from the show RAPIDS: github.com/rapidsai

Example notebooks showing drop-in accelerators: github.com

Benjamin Zaitlen - LinkedIn: linkedin.com

RAPIDS Deployment Guide (Stable): docs.rapids.ai

RAPIDS cuDF API Docs (Stable): docs.rapids.ai

Asianometry YouTube Video: youtube.com

cuDF pandas Accelerator (Stable): docs.rapids.ai

Watch this episode on YouTube: youtube.com

Episode #516 deep-dive: talkpython.fm/516

Episode transcripts: talkpython.fm

Theme Song: Developer Rap

🥁 Served in a Flask 🎸: talkpython.fm/flasksong

---== Don't be a stranger ==---

YouTube: youtube.com/@talkpython

Bluesky: @talkpython.fm

Mastodon: @talkpython@fosstodon.org

X.com: @talkpython

Michael on Bluesky: @mkennedy.codes

Michael on Mastodon: @mkennedy@fosstodon.org

Michael on X.com: @mkennedy


Escucha y lee

Descubre un mundo infinito de historias

  • Lee y escucha todo lo que quieras
  • Más de 900,000 títulos
  • Títulos exclusivos + Storytel Originals
  • 7 días de prueba gratis, luego $169 MXN al mes
  • Cancela cuando quieras
Suscríbete ahora
Copy of Device Banner Block 894x1036 3
Cover for #516: Accelerating Python Data Science at NVIDIA

Otros podcasts que te pueden gustar...