Measuring LLMs with Jodie Burchell
.NET Rocks!3 Huhti

Measuring LLMs with Jodie Burchell

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.

Jaksot(1953)

Scott Ambler on Agile

Scott Ambler on Agile

Agile Methodology is a wide topic, and this discussion is quite unlike the others that we've had. Scott Ambler brings his experiences from his career at IBM and elsewhere to this lively discussion. Richard was particularly interested in his ideas on database refactoring.Support this podcast at — https://redcircle.com/net-rocks/donations

16 Tammi 20071h 8min

Mark Dunn and Mark Berry on Biztalk 2006

Mark Dunn and Mark Berry on Biztalk 2006

Mark Berry and Mark Dunn from DUNN Training join us this week to dig a little deeper into BizTalk Server 2006. In this gripping sequel to Mark's earlier BizTalk shows, we delved much deeper into the inner workings of BizTalk than we've ever gone before. These guys know their stuff and you can bet this show will be educational and even a wee bit entertaining.Support this podcast at — https://redcircle.com/net-rocks/donations

9 Tammi 20071h

Scott Guthrie Looks Ahead

Scott Guthrie Looks Ahead

ASP.NET Co-creator Scott Guthrie talks about the present state and the future of ASP.NET, and also touches on WPFE, the Expression suite, and Windows Vista.Support this podcast at — https://redcircle.com/net-rocks/donations

2 Tammi 20071h 7min

Live from the Canadian Vista Launch Events!

Live from the Canadian Vista Launch Events!

Carl and Richard were the emcees at the Toronto, Montreal, and Ottawa launch events in December, 2006. They were hired to generate excitement and give away swag! What a gig! Along the way they got to talk to some of the attendees and locals in the community. Great stuff! Featuring the following guests: Shane Miskin, Scott Howlett, Allan Vander-spek, Mohammad Akif, Robert Achmann, Jean-Luc David, and Tony DavidsonSupport this podcast at — https://redcircle.com/net-rocks/donations

19 Joulu 200652min

Ted Neward on Interoperability

Ted Neward on Interoperability

Ted Neward discusses the present state and future of interoperability. Java and .NET compatability are disucssed, Ted touches on a wide range of topics ranging from XML's shortcomings as a messenger format to proprietary systems in .NET 3.0.Support this podcast at — https://redcircle.com/net-rocks/donations

12 Joulu 20061h 8min

Venkat Subramaniam and Andrew Hunt Talk Agile

Venkat Subramaniam and Andrew Hunt Talk Agile

Carl and Richard talk with Venkat Subramaniam, who has had a string of successful dnrTV episodes, and Andrew Hunt about Agile dos and don'ts. Agile Methodology can be overwhelming, and these guys introduce a much-needed dose of reality.Support this podcast at — https://redcircle.com/net-rocks/donations

5 Joulu 20061h 12min

TechEd Europe Interviews

TechEd Europe Interviews

Carl and Richard interview speakers and special guests at Tech Ed: Developer in Barcelona, Spain.Support this podcast at — https://redcircle.com/net-rocks/donations

28 Marras 20061h 17min

Bill Wagner and Diane Marsh on Non-MS Technology

Bill Wagner and Diane Marsh on Non-MS Technology

The discussion this week is on what .NET developers can learn by working with non-MS technology. Our guests' experience with technology includes .NET as well as non-Microsoft development technology.Support this podcast at — https://redcircle.com/net-rocks/donations

21 Marras 20061h 6min