Measuring LLMs with Jodie Burchell

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.

Oppdag Premium

Prøv 14 dager gratis

Kjøp Premium

Episoder(1973)

Paul Sheriff

Paul talks with Mark and Carl about ASP.NET performance tuning, tips, viewstate info, smart navigation, layout mode and more, and Paul shares some great insights and stories from the field. Lots of great tips and information here. Support this podcast at — https://redcircle.com/net-rocks/donations

14 Apr 20031h 9min

Andrew Brust

Carl and Mark talk to Andrew about data binding, data adapters, connected and dis-connected models, stored procedures, performance and encapsulation.Support this podcast at — https://redcircle.com/net-rocks/donations

31 Mar 20031h 8min

Ethan Winer and Bob Zale

Carl and Mark talk with Ethan and Bob about the Good Old Days of the BASIC language, and some of their experiences early on in the first days of the industry, as well as PowerBASIC past, present, and future, crazy tech-support calls, and other stories.Support this podcast at — https://redcircle.com/net-rocks/donations

24 Mar 200352min

Stephen Forte

Stephen talks about the International .NET Association (INETA - www.ineta.org), relates his .NET success stories, and talks about design patterns, COM Interop, Performance Anxiety, ASP.NET Forms Authentication, ViewState, Caching, and the DataGrid control. The DataGrid Girl (www.datagridgirl.com) calls and yaks with Carl, Mark, and Stephen about the ASP.NET DataGrid. Support this podcast at — https://redcircle.com/net-rocks/donations

17 Mar 20031h 11min

Bill Vaughn (Again)

Bill talks with us again picking up where our last conversation with him left off, talking about ADO.NET concurrency, SQL database design, dealing with Data Adapters, and a few other interesting tangents. Always good stuff with Mr. V!Support this podcast at — https://redcircle.com/net-rocks/donations

10 Mar 200353min

Scott Hanselman

Mark Dunn is on temporary leave in Redmond, WA this week teaching a beta Architecture class. Scott and Carl chat about .NET, C#, Reflection, Regular Expressions, Freeware, Code Sharing, Config Files, Sockets, Multi-Threaded programming, and a laundry list of Scott's favorite utilities (shown below) that you just have to check out. We had a few comments that the shows were not loud enough. So, starting with this show the volume has been maximized. Support this podcast at — https://redcircle.com/net-rocks/donations

3 Mar 20031h 7min

Alan Cooper

Alan has a lot to say about programming, programmers, and focuses intently on what's wrong with programming as we know it. Why do businesspeople fear programmers? Is the construction of software managed? These topics and more are the focus of this monumental episode of .NET Rocks! Alan tells the story of that meeting where he showed Visaul Basic (then code-named Ruby) to Bill Gates and his people. Support this podcast at — https://redcircle.com/net-rocks/donations

27 Jan 20031h 22min

Michèle Leroux Bustamante

Michele discusses the differences between programming in Java space vs. .NET from her own first hand experience.Support this podcast at — https://redcircle.com/net-rocks/donations

20 Jan 20031h 10min

Premium

99 kr/ måned

Tilgang til alle våre Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker

Prøv 14 dager gratis

Premium

129 kr/ måned

Tilgang til alle Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker
En Ekstra bruker

Prøv 14 dager gratis

Measuring LLMs with Jodie Burchell

Oppdag Premium

Episoder(1973)

Paul Sheriff

Andrew Brust

Ethan Winer and Bob Zale

Stephen Forte

Bill Vaughn (Again)

Scott Hanselman

Alan Cooper

Michèle Leroux Bustamante

Reklamefrie Premium-podkaster

Skap din egen podkastboble

Prøv 14 dager gratis

Premium

Premium

Populært innen Teknologi

Historiene og stemmene du vil høre