Measuring LLMs with Jodie Burchell
.NET Rocks!3 Apr 2025

Measuring LLMs with Jodie Burchell

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.

Episoder(1994)

Episode 2000!

Episode 2000!

Recorded live at the Tavern Hall in Bellevue during the Party with Palermo for the MVP Summit, it's episode 2000! Carl and Richard take questions from the audience and play clips from past guests and ...

30 Apr 1h 38min

How We Beat the Y2K Bug

How We Beat the Y2K Bug

The Y2K bug turned out to be a non-event on January 1, 2000. How did that happen? Carl and Richard bring together a number of stories from folks who were there, fixing the software and updating system...

23 Apr 48min

How AI Changes Development with Rob Conery

How AI Changes Development with Rob Conery

How are LLMs changing software development? Carl and Richard talk to Rob Conery about his experiences as a consultant bringing the new AI tools and techniques into companies. Rob talks about focusing ...

15 Apr 58min

Agentic RAG with Ed Charbeneau

Agentic RAG with Ed Charbeneau

How do you make your agents more knowledgeable about your company data? Carl and Richard talk to Ed Charbeneau about Progress Agentic RAG-as-a-Service, using NucliaDB as a vector data store to organiz...

8 Apr 1h 4min

ASP.NET Core in 2026 with Daniel Roth

ASP.NET Core in 2026 with Daniel Roth

ASP.NET Core continues to evolve in 2026! Carl and Richard talk to Daniel Roth about all the goodness in the ASP.NET Core space, including MVC, Razor, and Blazor! Daniel talks about the publicly visib...

2 Apr 1h

Coding for Security with Chris Ayers

Coding for Security with Chris Ayers

What does secure coding look like today? Carl and Richard talk to Chris Ayers about the MITRE ATT&CK matrix, a comprehensive breakdown of the tactics, techniques, and procedures black hats use to expl...

25 Mar 52min

Building Software using Squad with Brady Gaster

Building Software using Squad with Brady Gaster

Let the squad help you build your application! Carl and Richard talk to Brady Gaster about Squad, a tool for creating an AI development team using GitHub Copilot. Brady discusses creating specialist a...

19 Mar 56min

Avalonia 12 with Mike James & Matt Lacey

Avalonia 12 with Mike James & Matt Lacey

Avalonia continues to evolve! Carl and Richard talk to Avalonia CEO Mike James & Matt Lacey about the latest version of Avalonia, the open source UI framework for building cross-platform applications ...

12 Mar 58min

Populært innen Teknologi

lydartikler-fra-aftenposten
romkapsel
teknisk-sett
tomprat-med-gunnar-tjomlid
energi-og-klima
elektropodden
shifter
fornybaren
hans-petter-og-co
rss-impressions-2
nasjonal-sikkerhetsmyndighet-nsm
rss-alt-som-gar-pa-strom
teknologi-og-mennesker
kunstig-intelligens-med-morten-goodwin
rss-ai-forklart
smart-forklart
pedagogisk-intelligens
rss-bouvet-bobler
rss-brukbart
rss-ki-praten