Inferact: Building the Infrastructure That Runs Modern AI

Inferact: Building the Infrastructure That Runs Modern AI

Inferact is a new AI infrastructure company founded by the creators and core maintainers of vLLM. Its mission is to build a universal, open-source inference layer that makes large AI models faster, cheaper, and more reliable to run across any hardware, model architecture, or deployment environment. Together, they broke down how modern AI models are actually run in production, why “inference” has quietly become one of the hardest problems in AI infrastructure, and how the open-source project vLLM emerged to solve it. The conversation also looked at why the vLLM team started Inferact and their vision for a universal inference layer that can run any model, on any chip, efficiently.

Follow Matt Bornstein on X: https://twitter.com/BornsteinMatt

Follow Simon Mo on X: https://twitter.com/simon_mo_

Follow Woosuk Kwon on X: https://twitter.com/woosuk_k

Follow vLLM on X: https://twitter.com/vllm_project

Stay Updated:

Find a16z on YouTube: YouTube

Find a16z on X

Find a16z on LinkedIn

Listen to the a16z Show on Spotify

Listen to the a16z Show on Apple Podcasts

Follow our host: https://twitter.com/eriktorenberg

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Avsnitt(1000)

The $3 Trillion AI Coding Opportunity

The $3 Trillion AI Coding Opportunity

Originally published on the a16z Infra podcast. We're resurfacing it here for our main feed audience.AI coding is already actively changing how software gets built.a16z Infra Partners Yoko Li and Guid...

9 Dec 202537min

The 80-Year Bet: Why Naveen Rao Is Rebuilding the Computer from Scratch

The 80-Year Bet: Why Naveen Rao Is Rebuilding the Computer from Scratch

Naveen Rao is cofounder and CEO of Unconventional AI, an AI chip startup building analog computing systems designed specifically for intelligence. Previously, Naveen led AI at Databricks and founded t...

8 Dec 202530min

What Comes After ChatGPT? The Mother of ImageNet Predicts The Future

What Comes After ChatGPT? The Mother of ImageNet Predicts The Future

Fei-Fei Li is a Stanford professor, co-director of Stanford Institute for Human-Centered Artificial Intelligence, and co-founder of World Labs. She created ImageNet, the dataset that sparked the deep ...

5 Dec 20251h 1min

How AI Created the Fastest Product Cycle in History

How AI Created the Fastest Product Cycle in History

Recently, a16z General Partner Anish Acharya joined Ollie Forsyth on NEW ECONOMIES. They talked about why consumer tech is surging again, how AI is enabling 100M-user products at unprecedented speed, ...

4 Dec 202550min

Why AI Moats Still Matter (And How They've Changed)

Why AI Moats Still Matter (And How They've Changed)

a16z General Partners David Haber, Alex Rampell, and Erik Torenberg discuss why 19 out of 20 AI startups building the same thing will die - and why the survivor might charge $20,000 for what used to c...

3 Dec 202551min

How To Lead | Ben Horowitz on My First Million

How To Lead | Ben Horowitz on My First Million

A16Z co-founder Ben Horowitz joins Shaan Puri and Sam Parr on My First Million to talk about how to be a great leader. Resources:Follow Ben on X: https://x.com/bhorowitzFollow Shaan on X: https://x.co...

2 Dec 20251h 10min

The $700 Billion AI Productivity Problem No One's Talking About

The $700 Billion AI Productivity Problem No One's Talking About

Russ Fradin sold his first company for $300M. He’s back in the arena with Larridin, helping companies measure just how successful their AI actually is.In this episode, Russ sits down with a16z General...

1 Dec 202558min

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

In this episode, a16z GP Martin Casado sits down with Sherwin Wu, Head of Engineering for the OpenAI Platform, to break down how OpenAI organizes its platform across models, pricing, and infrastructur...

28 Nov 202553min

Populärt inom Business & ekonomi

framgangspodden
varvet
rss-svart-marknad
rss-jossan-nina
rss-borsens-finest
svd-tech-brief
uppgang-och-fall
bathina-en-podcast
badfluence
lastbilspodden
rss-inga-dumma-fragor-om-pengar
avanzapodden
fill-or-kill
tabberaset
rss-dagen-med-di
rss-veckans-trade
dynastin
24fragor
rikatillsammans-om-privatekonomi-rikedom-i-livet
kapitalet-en-podd-om-ekonomi