Collaboration & evaluation for LLM apps

Collaboration & evaluation for LLM apps

0 Recensioner
0
Episod
255 of 336
Längd
46min
Språk
Engelska
Format
Kategori
Fakta

Small changes in prompts can create large changes in the output behavior of generative AI models. Add to that the confusion around proper evaluation of LLM applications, and you have a recipe for confusion and frustration. Raza and the Humanloop team have been diving into these problems, and, in this episode, Raza helps us understand how non-technical prompt engineers can productively collaborate with technical software engineers while building AI-driven apps.

Join the discussion

Changelog++ members save 4 minutes on this episode because they made the ads disappear. Join today!

Sponsors:

Read Write Own • – Read, Write, Own: Building the Next Era of the Internet—a new book from entrepreneur and investor Chris Dixon—explores one possible solution to the internet’s authenticity problem: Blockchains. From AI that tracks its source material to generative programs that compensate—rather than cannibalize—creators. It’s a call to action for a more open, transparent, and democratic internet. One that opens the black box of AI, tracks the origins we see online, and much more. Order your copy of Read, Write, Own today at readwriteown.comChangelog News • – A podcast+newsletter combo that’s brief, entertaining & always on-point. Subscribe today • . Fly.io • – The home of Changelog.com • — Deploy your apps and databases close to your users. In minutes you can run your Ruby, Go, Node, Deno, Python, or Elixir app (and databases!) all over the world. No ops required. Learn more at fly.io/changelog • and check out the speedrun in their docs • .

Featuring:

• Raza Habib – LinkedIn • , X • Daniel Whitenack – Website • , GitHub • , X Show Notes:

Humanloop Something missing or broken? PRs welcome!


Lyssna när som helst, var som helst

Kliv in i en oändlig värld av stories

  • 1 miljon stories
  • Hundratals nya stories varje vecka
  • Få tillgång till exklusivt innehåll
  • Avsluta när du vill
Starta erbjudandet
SE - Details page - Device banner - 894x1036
Cover for Collaboration & evaluation for LLM apps

Andra podcasts som du kanske gillar...