Measuring LLMs with Jodie Burchell

Measuring LLMs with Jodie Burchell

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.

Avsnitt(1955)

Bret Piatt and Josh Odom RackSpace Clouds

Bret Piatt and Josh Odom RackSpace Clouds

Bret Piatt and Josh Odom from Rackspace talk about their company's offerings for hosting in the cloud.Support this podcast at — https://redcircle.com/net-rocks/donations

11 Feb 201045min

Ron Jacobs on Azure AppFabric

Ron Jacobs on Azure AppFabric

Ron Jacobs talks to Carl and Richard about the Windows Azure platform AppFabric, which provides secure connectivity as a service to help developers bridge cloud, on-premises, and hosted deployments.Support this podcast at — https://redcircle.com/net-rocks/donations

9 Feb 20101h 5min

Walling and Taber on Micropreneur Academy.

Walling and Taber on Micropreneur Academy.

Rob Walling and Mike Taber talk about the Micropeneur Academy, a web-based community with the mission to assist one-man tech companies in reaching their goals. Membership is not free, but it's not expensive either.Support this podcast at — https://redcircle.com/net-rocks/donations

4 Feb 201047min

Kent Brown and Ed Pinto on WCF 4.0

Kent Brown and Ed Pinto on WCF 4.0

Richard and Carl talk to Kent Brown and Ed Pinto about the new features of WCF (Windows Communication Foundation) 4.0, which will be released April 12, 2010.Support this podcast at — https://redcircle.com/net-rocks/donations

2 Feb 20101h

Catching up with Juval Löwy

Catching up with Juval Löwy

Juval is back to talk to Carl and Richard. They get to the bottom of his "Every Object Should be a WCF Service" argument, and get his insights into the current state of .NET development.Support this podcast at — https://redcircle.com/net-rocks/donations

28 Jan 20101h 1min

oData

oData

Carl and Richard get the word on oData from Brad Abrams, Bob Dimpsey and Lance Olson.Support this podcast at — https://redcircle.com/net-rocks/donations

26 Jan 201053min

My .NET Story

My .NET Story

Carl and Richard pick the winner for the My .NET Story contest held by Microsoft. Contestants submitted their projects and were judged by a panel of experts. Includes short interviews with the finalists.Support this podcast at — https://redcircle.com/net-rocks/donations

21 Jan 201052min

Jason Olson Digs into the CLR 4.0

Jason Olson Digs into the CLR 4.0

What's new in the CLR? Jason Olson goes through some of the most important new features.Support this podcast at — https://redcircle.com/net-rocks/donations

19 Jan 20101h 2min

Populärt inom Teknik

uppgang-och-fall
rss-racevecka
elbilsveckan
market-makers
svd-tech-brief
rss-uppgang-och-fall
natets-morka-sida
skogsforum-podcast
har-vi-akt-till-mars-an
solcellskollens-podcast
bli-saker-podden
rss-elektrikerpodden
developers-mer-an-bara-kod
rss-technokratin
bilar-med-sladd
gubbar-som-tjotar-om-bilar
rss-militarsnack
hej-bruksbil
rss-en-ai-till-kaffet
rss-veckans-ai