Generative models: exploration to deployment

Generative models: exploration to deployment

0 Calificaciones
0
Episodio
242 of 336
Duración
49min
Idioma
Inglés
Formato
Categoría
No ficción

What is the model lifecycle like for experimenting with and then deploying generative AI models? Although there are some similarities, this lifecycle differs somewhat from previous data science practices in that models are typically not trained from scratch (or even fine-tuned). Chris and Daniel give a high level overview in this effort and discuss model optimization and serving.

Join the discussion

Changelog++ members save 2 minutes on this episode because they made the ads disappear. Join today!

Sponsors:

Neo4j • – NODES 2023 is coming in October! Fastly • – Our bandwidth partner. • Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.comFly.io • – The home of Changelog.com • — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog • and check out the speedrun in their docs • .

Featuring:

• Chris Benson – Website • , GitHub • , LinkedIn • , X • Daniel Whitenack – Website • , GitHub • , X Show Notes:

BigDLArticle: Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRAPrevious episode: Running large models on CPUsBaseten’s TrussSeldonHugging Face’s TGIIntel Gaudi 2Intel TDX Something missing or broken? PRs welcome!


Escucha y lee

Descubre un mundo infinito de historias

  • Lee y escucha todo lo que quieras
  • Más de 1 millón de títulos
  • Títulos exclusivos + Storytel Originals
  • Precio regular: CLP 7,990 al mes
  • Cancela cuando quieras
Suscríbete ahora
Copy of Device Banner Block 894x1036 3
Cover for Generative models: exploration to deployment

Otros podcasts que te pueden gustar...