Measuring LLMs with Jodie Burchell

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.

Oppdag Premium

Prøv 14 dager gratis

Kjøp Premium

Episoder(1973)

.NET Sucess Stories Part 1

This is the first of a series of shows we are going to do this year highlighting .NET Success Stories; companies that have implemented .NET applications successfully as either pilot projects or production projects. In this show we hear from two different companiesSupport this podcast at — https://redcircle.com/net-rocks/donations

13 Jan 20031h 11min

Ken Getz

Ken talks about asynchronous calls, events, delegates, the next version of Microsoft Office, printing, C# vs VB.NET, being a control freak, Visual Studio.NET 2003, and training videos.Support this podcast at — https://redcircle.com/net-rocks/donations

23 Des 20021h 11min

Nickolas Landry

Carl and Mark talk with Nick about Mobile computing, and how .NET fully supports mobile devices. A surprise phone call from Tim Huckaby forces us to extend this lousy show for an extra half-hour, but you'll be glad you stuck it out to hear his Bill Gates story.Support this podcast at — https://redcircle.com/net-rocks/donations

16 Des 20021h 40min

Scott Stanfield

Scott talks about the human side of software development, the .NET Pet Shop wars; a now famous head-to-head battle between .NET and Java, and talks about his company, Vertigo Software, and the work that they do with .NET including the Pet Shop application.Support this podcast at — https://redcircle.com/net-rocks/donations

9 Des 20021h 2min

Chris Sells

Chris talks with Test with Carl and Mark about COM and .NET components, finalizers, disposing, Smart Client Windows Forms Applications (we still don't know what to call these things), how to navigate sellsbrothers.com, interview weed-out questions, and why he calls it Sells Brothers. Also, Chris finally clears up the age-old question of why C++ programmers feel so superior to mere humans. Chris' main web page is www.sellsbrothers.com. There you can subscribe to his many lists and take advantage of all the great free information and code he shares with the community. Chris writes regularly for MSDN Magazine on advanced topics. Trust us, you will save yourself hours of guesswork by taking an hour and a half of your day to listen to this interview. This show was actually recorded in mid-November, and its still fresh!Support this podcast at — https://redcircle.com/net-rocks/donations

2 Des 20021h 32min

Dev Connections

.NET Rocks! went on the road to VS.NET Connections, an awesome .NET developer conference in Orlando, FL that took place October 27-30, 2002. We have two hours of interviews with speakers and attendees, which make for a really enjoyable and informative show. Support this podcast at — https://redcircle.com/net-rocks/donations

28 Okt 200255min

Bill Vaughn

Carl and Mark talk to bill about ADO.NET, DataAdapters, Yukon, Concurrency, JET (Bill Hates Jet), MSDE, DTS, and other critical stuff. This is classic Vaughn!This is also the genesis of the messy hair joke that permeates our shows and videos.Support this podcast at — https://redcircle.com/net-rocks/donations

14 Okt 20021h 10min

Mark Anders

Carl and Mark talk with Mark Anders about ASP.NET, Framework v1.1, Languages, IIS 6.0, and other great topics. This week we had some celebrity callers: Chris Sells and MSDN Regional Director Stephen Forte ring the show, making for some great tech talk.Support this podcast at — https://redcircle.com/net-rocks/donations

7 Okt 20021h 7min

Premium

99 kr/ måned

Tilgang til alle våre Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker

Prøv 14 dager gratis

Premium

129 kr/ måned

Tilgang til alle Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker
En Ekstra bruker

Prøv 14 dager gratis

Measuring LLMs with Jodie Burchell

Oppdag Premium

Episoder(1973)

.NET Sucess Stories Part 1

Ken Getz

Nickolas Landry

Scott Stanfield

Chris Sells

Dev Connections

Bill Vaughn

Mark Anders

Reklamefrie Premium-podkaster

Skap din egen podkastboble

Prøv 14 dager gratis

Premium

Premium

Populært innen Teknologi

Historiene og stemmene du vil høre