← 全部源头

Latent Space

AI Engineer 生态播客，覆盖 agents、AI coding、模型基础设施与开发者工具

26 集已生成 · 26 集收录

已生成

The AI Frontier: from open weights to open research — Eiso Kant, Poolside AI

1h 56m

The AI Frontier: from open weights to open research — Eiso Kant, Poolside AI

Eiso Kant · Poolside AI co-CEO & co-founder

Eiso Kant 复盘了从早期 code models 失败到 Poolside 模型工厂的路径，并解释为何开放权重还不够，开放研究才真正降低新模型公司的进入门槛。对谈还深入讨论了小模型的持久行为、agent 反哺研究、极简 harness、RL 的日历时间瓶颈，以及开放模型的治理边界。

#ai-research #coding-agents #startup #interview

查看 slides → YouTube ↗

OpenAI's Research Playbook: Scaling Laws, Reasoning, Evals, AGI — Mark Chen

41

OpenAI's Research Playbook: Scaling Laws, Reasoning, Evals, AGI — Mark Chen

Mark Chen · OpenAI Chief Research Officer

Mark Chen 在 Latent Space "cooking series" 一边煮韩式豆腐汤一边讲清楚 OpenAI 的研究路线： scaling laws 在 10 个数量级上没有破，pre-training is dead 是循环出现的伪叙事，evals 行业整体在"裸奔"， reasoning（o1）能上线靠的是 Jakub 和 Ilya 的 conviction 而不是显然的算法红利， 3 年路线图的终点是 end-to-end research——模型不仅做实现，还能有 taste。

#ai-research #interview #researcher #technical

查看 slides → YouTube ↗

The Agent Cloud: Databricks' Bet on the Future of AI — Matei Zaharia & Reynold Xin

1h

The Agent Cloud: Databricks' Bet on the Future of AI — Matei Zaharia & Reynold Xin

Matei Zaharia & Reynold Xin · Databricks 联合创始人 (CTO / Chief Architect)

Databricks 联合创始人 Matei Zaharia 和 Reynold Xin 在 Latent Space 拆解 Data+AI Summit 的三大发布：开源的 Agent Cloud（Omnigent）、Postgres-on-Lake 的 Lakebase（LTAP）、以及从零重写的分析引擎 Dream Engine。沿着「为什么开源、为什么不统一引擎只统一存储、为什么 ML 模型选算法比工程师写算法更好」三条主线展开，背后是「过度拟合一两个客户、让工程师直接写原型不要走文档流程」的工程文化。

#interview #founder #engineer #technical

查看 slides → YouTube ↗

AI Security After Codex and Claude Code — Zico Kolter & Matt Fredrikson, Gray Swan

64

AI Security After Codex and Claude Code — Zico Kolter & Matt Fredrikson, Gray Swan

Zico Kolter & Matt Fredrikson · Gray Swan 创始人 / CMU 教授

Gray Swan 两位 CMU 教授创始人讲他们如何把 AI 安全做成一个独立行业。攻：Arena（15,000 人红队社区）+ Shade（自动红队，已超过人类水平）；守：Cygnal（位于用户、LLM、工具调用之间的策略过滤器）。核心论点是「模型本身是不可信实体」，鲁棒性不随规模上升，OpenClaw/computer use 是 lethal trifecta 的完美样板。

#ai-safety #ai-research #coding-agents #startup

查看 slides → YouTube ↗

Every product of the future will be a living system — Ronak Malde, Trajectory.ai

40

Every product of the future will be a living system — Ronak Malde, Trajectory.ai

Ronak Malde · Trajectory.ai co-founder

Ronak Malde 从 Windsurf 一路被 DeepMind 收购，然后掏出收购款回来做 Trajectory.ai—— 一个押注 continual learning 是下一代 AI 产品形态的平台。访谈里他拆了 SDPO 这个自蒸馏 RL 算法、Continuous LoRA 的并发训练栈、以及和 Harvey + Nemotron 3 训出来的法律模型。

#ai-products #ai-research #coding-agents #startup

查看 slides → YouTube ↗

Why AI Labs With Unlimited GPUs Still Fail — Anjney Midha, AMP

3637

Why AI Labs With Unlimited GPUs Still Fail — Anjney Midha, AMP

Anjney Midha · Amp 创始人兼 CEO，前 A16Z 合伙人，Stanford CS183 frontier systems 课讲师

Anjney Midha 把 Amp 定位成 AI 算力的"独立系统运营商"（ISO）——参照美国电网 PJM Interconnection 的模型，做 pooling、调度，但自己不持有资产。他认为现在很多有钱有算力的 AI lab 仍出不了东西，根因不是技术，是 culture：mission alignment 在快速 scaling 中被稀释，半导体行业沉淀几十年的 iterative bring-up 纪律被 AI 例外论抛弃。全集还覆盖了 Anthropic 起飞的"prepared mind"逻辑、Anthropic 的 21 nos、Net-positive 数据中心的 $0.50 设想、DeepMind 6 个月 embargo 的市场失灵，以及他自己 14 年放不下的副线——end-of-life prediction。

#ai-products #hardware #chips #startup

查看 slides → YouTube ↗

The Limits of AI in Science: Why We Need Self-Driving Labs — Joseph Krause, Radical AI

1h

The Limits of AI in Science: Why We Need Self-Driving Labs — Joseph Krause, Radical AI

Joseph Krause · CEO, Radical AI

Radical AI 的 CEO Joseph Krause 解释为什么材料科学不是 AI for bio—— 没有 SMILES/SELFIES 那样的字符串能描述一块合金，所以模型再大也 one-shot 不出能装进 iPhone 或飞上 Starship 的新材料。解药是 self-driving lab：让 AI scientist 自己跑研究 campaign。Radical 三个月做了 1,200 块合金，对比 DARPA × GE Aerospace 的 MACH 项目一年 500 块。Joseph 的反共识结论：材料行业不缺算力，缺实验；在科学里，experiments are the moat, models are not.

#ai-research #hardware #robotics #startup

查看 slides → YouTube ↗

Making DeepSeek v4 outperform Opus 4.7 with Taste — Ahmad Awais, CommandCode.ai

30min

Making DeepSeek v4 outperform Opus 4.7 with Taste — Ahmad Awais, CommandCode.ai

Ahmad Awais · Founder, CommandCode.ai

Ahmad Awais 解释为什么 DeepSeek V4 在编码 agent 里"很慢、很笨"——不是模型不够强，而是它发出错的 tool call 后听不进 Zod error，平均会原样重发 56 次。他们写了 3,200 行确定性 repair logic（现在 16,000 条 variation，覆盖 600 亿 tokens），让 DeepSeek/Kimi/MiniMax 在 Command Code 里跑出接近 Opus 的体验，并把同一思路推广到设计 slop 和安全代码。第二条产品线 Taste 自动学习你的微决策、存成 git 里的 markdown，让便宜模型借用贵模型沉淀下来的判断力。

#coding-agents #ai-products #startup #engineer

查看 slides → YouTube ↗

GitHub's Agent Era: 14x Commits, 200M Developers, Copilot's Next Act — Kyle Daigle

2700

GitHub's Agent Era: 14x Commits, 200M Developers, Copilot's Next Act — Kyle Daigle

Kyle Daigle · COO @ GitHub, CMO @ Microsoft Developer

GitHub COO Kyle Daigle 与 Sean Wang 对谈 GitHub 与 Microsoft Developer 在 agent 时代的演化：内部 AI 工作流（micro skill 替代 mega skill、"回看式" LLM 用法）、 14× commit 增长拖出 MySQL 1 / permissioning / monorepo 等历史债务，以及 Copilot 从代码补全转向"统一 SDK + 多 harness + ambient context"的下一幕。

#coding-agents #ai-products #productivity #interview

查看 slides → YouTube ↗

Satya Nadella on AI — NoPriors x Latent Space Crossover @ Microsoft Build 2026

1800

Satya Nadella on AI — NoPriors x Latent Space Crossover @ Microsoft Build 2026

Satya Nadella · Microsoft Chairman and CEO

Microsoft Build 2026 现场，Satya Nadella 与 Sarah Guo / Elad Gil / swyx 的 30 分钟跨节目对谈。核心命题：让每家公司都能在 Microsoft 的平台之上长出自己的 frontier intelligence。从 MAI 模型的清洁血统、harness + private evals 这一新护城河，到 SaaS 三层栈的 re-litigation、Azure 网络团队把工作"meta 化"成 Miles agent，再到数据中心扩张的社区许可问题，Satya 用一系列具体例子勾勒了 Microsoft 的"第三幕"。

#interview #panel #ceo #strategic

查看 slides → YouTube ↗

When AI Agents Run Businesses — Lukas Petersson & Axel Backlund of Andon Labs

75

When AI Agents Run Businesses — Lukas Petersson & Axel Backlund of Andon Labs

Lukas Petersson & Axel Backlund · Co-founders, Andon Labs

Andon Labs 把 LLM 放进真实的小生意里跑长 horizon——贩卖机、咖啡馆、机器人。联合创始人 Lukas 和 Axel 讨论 Claude 给 FBI "报案"的 Vending Bench 1、把贩卖机搬进 Anthropic 总部的 Project Vend、多 agent 互相塌缩成 helpful assistant 的 Project Vend 2，以及 Claude 4.6 / 4.7 / Mythos 越来越擅长撒谎与串通的趋势。

#ai-safety #ai-research #coding-agents #robotics

查看 slides → YouTube ↗

Scaling Past Informal AI — Carina Hong, Axiom Math

75

Scaling Past Informal AI — Carina Hong, Axiom Math

Carina Hong · CEO & Founder @ Axiom Math

Carina Hong 在公司成立 7-8 个月后筹了 $200M Series A，估值 $1.6B。她的核心论点是： formal verification 不是给 chatbot hallucination 当解药、不是给封闭行业做合规，而是 scaling brilliance、放大已有 intelligence 的杠杆。Putnam Dec 2025 第一次出现 formal AI 反超最强人类和最强 LLM——120 vs 110 vs 103。

#ai-research #startup #founder #interview

查看 slides → YouTube ↗

xAI 内部：3 个月做出 Grok Imagine、视频生成、世界模型与视频 Agent

1h 44min

xAI 内部：3 个月做出 Grok Imagine、视频生成、世界模型与视频 Agent

Ethan He · ex-xAI Grok Imagine / ex-Nvidia Cosmos researcher

Ethan He 在 xAI 从 0 infra / 0 data / 0 model 起步，3 个月内带几个工程师交付了 Grok Imagine 0.9（首个大规模部署的 audio-video joint generation 模型）。这期访谈把视频模型的全链路拆开讲: synthetic text-video pair、VAE tokenizer、temporal vs frame-by-frame 压缩、step distillation、长视频的 context 管理。他给 world model 下了一个三支柱定义 (real time + interactive + long horizon)，并提出一个"big claim" —— 现在视觉模型的进步主要来自语言模型，而不是视频模型本身——这也是他刚离开 xAI 想回到语言模型 / context-aware agent 方向的原因。

#interview #ai-research #researcher #technical

查看 slides → YouTube ↗

Devin 的 80% 时刻：背景 Agent、7x PR 与手把手编码的终结

1h 09min

Devin 的 80% 时刻：背景 Agent、7x PR 与手把手编码的终结

Walden Yan · Cole Murray · Cognition co-founder/CPO · OpenInspect creator

Latent Space 把 Cognition 的 Walden Yan 和 OpenInspect 作者 Cole Murray 拉到一起聊"背景 agent"这一架构的成熟。 Devin 自己 repo 上的 commit 占比 3 个月内从 16% 涨到 80%，merged PR 涨 7×，工程团队只涨 10%。这场对谈把背景 agent 落地需要解决的所有硬件、网络、testing、memory、Slack 集成等具体工程问题摊开。

#ai-products #coding-agents #startup #engineer

查看 slides → YouTube ↗

Notion's Sarah Sachs & Simon Last on Custom Agents, Evals, and the Future of Work

≈ 84 min

Notion's Sarah Sachs & Simon Last on Custom Agents, Evals, and the Future of Work

Sarah Sachs · Simon Last · Notion — engineering manager (core AI capabilities & infra) · co-founder

Notion 把 agent 重写了 5 次,从 2022 年末 GPT-4 时代试图后台跑 assistant 一直撞到今天的 100+ tools custom agent. Sarah (engineering manager) 和 Simon (co-founder) 用 84 分钟把"为什么这么慢"和"现在为什么终于行了"都摊开讲: progressive disclosure、SQL-light queries、 notion's last exam(主动留 30% 通过率)、为什么是 credits 而不是 tokens、为什么 manager agent 是对 70 条通知的解、以及为什么"replacing processes"比"replacing people"更准确.

#ai-products #coding-agents #product-management #product-lead

查看 slides → YouTube ↗

从宠物消费玩具到战争无人机 — Yaroslav Azhnyuk（The Fourth Law）与 Noah Smith（Noahpinion）

≈ 90 min

从宠物消费玩具到战争无人机 — Yaroslav Azhnyuk（The Fourth Law）与 Noah Smith（Noahpinion）

Yaroslav Azhnyuk · Noah Smith · The Fourth Law / Odd Systems创始人 · Noahpinion作者

Yaroslav Azhnyuk在旧金山创立了Pet Cube——"向宠物投射零食的摄像头"——然后，在俄罗斯入侵的前夜搭上飞往基辅的最后一班飞机（2022年2月23日），转型投身无人机自主系统、热成像摄像头和FPV攻击无人机的研发，服务于乌克兰武装力量。他与Noah Smith和Latent Space的Brandon共同梳理了无人机自主性五级体系、自主战场的八个维度、FPV如何取代炮兵成为"战争之神"（约80%的前线伤亡），以及中国无人机制造规模令人不安的数字（乌克兰年产400万架FPV，中国产能约40亿架）。

#hardware #robotics #startup #founder

查看 slides → YouTube ↗

Moonlake: Interactive, Multimodal World Models — with Chris Manning and Fan-yun Sun

~63m

Moonlake: Interactive, Multimodal World Models — with Chris Manning and Fan-yun Sun

Chris Manning · Fan-yun Sun · Stanford NLP 教父 · Moonlake 联合负责人 + ex-NVIDIA Research / Moonlake co-founder

Latent Space 与 Moonlake 两位负责人 Chris Manning 与 Fan-yun Sun 的对谈。Moonlake 押的是另一条 world model 路线: 不是更大的视频生成器, 而是 symbolic 推理 + 神经渲染。 Chris 给出唯一的硬定义 — "you only actually have a world model if you can predict, given some action is taken, what is going to change" — 然后顺势公开和 Yann LeCun 撕: "Yann has never appreciated the power of language." Sun 反驳"反 bitter lesson" 的标签, 真正的问题是"what is the right abstraction level today"。Moonlake 内部其实是两个模型: 推理模型管 causality / persistency, 而 Rie 这个 diffusion model 负责 photorealism — 他们已经把它当作 DLSS 的下一代来卖, "skins for worlds"。

#ai-research #robotics #startup #interview

查看 slides → YouTube ↗

The $15B Physical AI Company: Simulation, Autonomy OS, Neural Sim, & 1K Engineers—Applied Intuition

≈ 70 min

The $15B Physical AI Company: Simulation, Autonomy OS, Neural Sim, & 1K Engineers—Applied Intuition

Qasar Younis & Peter Ludwig · Co-founders of Applied Intuition (CEO / CTO)

Applied Intuition 给汽车 / 卡车 / 矿车 / 农机 / 防务平台卖"让物理机器变聪明"的技术栈, 18/20 OEM 是它客户, 估值 $15B, 1000 名工程师, 现在在日本跑 L4 无人卡车. 这期把它的三个 bucket (仿真 + 操作系统 + 自动驾驶模型) 全部摊开讲, 还顺手把 "vehicles like pre-Android phones" 这条类比、neural sim = Gaussian splatting + diffusion、onboard 才是物理 AI 真正瓶颈这些反直觉点都串了起来.

#robotics #hardware #ai-products #coding-agents

查看 slides → YouTube ↗

Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik

~45m

Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik

Ron Alfa & Daniel Bear · Noetik 联合创始人 · 用 AI 把 95% 失败的癌症试验变成 matching problem

Latent Space 与 Noetik 创始人 Ron Alfa、Daniel Bear 的对谈。Ron 的核心论点：95% 的癌症临床试验会失败,但许多"失败"的药其实有效——只是没匹配到对的病人。Noetik 用近两年时间只在收数据 (thousands of human tumors, hundreds of millions of images), 训练一个自监督的 "virtual cell" 模型, 并发布 TARIO-2——一个 autoregressive transformer, 从每个病人都已经有的 H&E 切片预测 ~19,000 个基因的空间表达。GSK 已经签了 $50M 软件授权: 不是买药, 是买平台。

#ai-research #ai-products #interview #founder

查看 slides → YouTube ↗

走进Abridge：聆听1亿次问诊的AI — Janie Lee 与 Chai Asawa

70

走进Abridge：聆听1亿次问诊的AI — Janie Lee 与 Chai Asawa

Janie Lee & Chai Asawa · Abridge——产品负责人/临床决策支持

Latent Space × Supervised Learning联合节目，嘉宾为Abridge的Janie Lee和Chai Asawa。Abridge是一个拥有约1亿次医疗对话数据集的AI临床智能层。深入探讨了"空调"设计哲学、支撑实时就诊中指导的模型星座架构、HIPAA级去标识化为何是单向的，以及PRD绝未死亡这一热门观点。

#ai-products #product-lead #product-management #engineer

查看 slides → YouTube ↗

Anthropic Felix Rieseberg on AI Coworkers, Local-First Agents, and the Future of Knowledge Work

5254

Anthropic Felix Rieseberg on AI Coworkers, Local-First Agents, and the Future of Knowledge Work

Felix Rieseberg · Anthropic Codework · Electron 原作者

Anthropic Codework 团队 PM Felix Rieseberg 上 Latent Space。核心押注很反直觉:硅谷低估了本地电脑,Claude 应该住在你自己的机器上,而不是搬去云端。围绕这条主线,他讲了 skills 起源、VM 沙箱、模型 overhang、junior 就业焦虑,以及 Electron 之父为什么至今相信 Chromium 是"真正的操作系统"。

#ai-products #coding-agents #productivity #product-lead

查看 slides → YouTube ↗

Mistral: Voxtral TTS, Forge, Leanstral, & Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample

~54m

Mistral: Voxtral TTS, Forge, Leanstral, & Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample

Pavan Kumar Reddy & Guillaume Lample · Mistral · Audio Research Lead 与 Chief Scientist

Mistral 同一周内同时发了 Voxtral TTS、Forge 平台、Leanstral 形式化模型和新的 Mistral Small——这期 Latent Space 让 Pavan Kumar Reddy 和 Guillaume Lample 一次性把这些发布背后的工程选择讲清楚。Voxtral TTS 是 3B 模型 + 自研 12.5 Hz 神经音频 codec + auto-regressive flow matching head——为了实时流式而不是 SOTA quality 选 AR 路线。 Forge 是把 Mistral 科学团队用了 2 年的 infra 直接给客户：fine-tune 后能"10x cheaper"，并在某些客户项目把一种语言从 0~1% 训到 50% 的 mix。Leanstral 看似是数学家工具，实际是赌 long-horizon reasoning 的 transfer——Lean 的编译器是天然不可 reward-hack 的判官。最后透露下一代 RL infra 是为"6 hours to get a reward"的 trajectory 设计的。

#ai-research #ai-products #interview #researcher

查看 slides → YouTube ↗

Dylan Patel Explains the AI War While Cooking | In-Context Cooking

3312

Dylan Patel Explains the AI War While Cooking | In-Context Cooking

Dylan Patel · SemiAnalysis Founder & CEO

一集在做鸡肉炒饭的过程中同时讲清楚 AI 产业格局的访谈。Dylan Patel 从台海终局的四种情境讲到出口管制的两派逻辑，从 Anthropic 一个月加 $2-3B 收入讲到 hyperscaler 的 Pascal 赌局，从 Jensen 的偏执讲到瓶颈从 CoWoS→数据中心→电力→又回到晶圆厂的迁移路径。

#chips #hardware #ai-products #interview

查看 slides → YouTube ↗

Claude Code for Finance + The Global Memory Shortage

~2h20min

Claude Code for Finance + The Global Memory Shortage

Doug O'Laughlin · SemiAnalysis 合伙人 · 半导体分析师

半导体分析师 Doug O'Laughlin (SemiAnalysis) 上 Latent Space 分享他被 Claude Code 4.5 击穿的故事——从投研工作流的重写、Excel/Bloomberg 之死,到全球 HBM/DRAM 内存荒 (4:1 转换比、DRAM 现货价可能再翻一倍)、Microsoft 在 AI 时代的两难、Codex 5.3 的反扑、TPU V7 的市场窗口,以及 6 个月徒步 Continental Divide Trail 教会他的 self-mastery。

#deep-dive #interview #investor #chips

查看 slides → YouTube ↗

How GPT-5 derived new results in theoretical physics and quantum gravity — Alex Lupsasca, OpenAI

~30m

How GPT-5 derived new results in theoretical physics and quantum gravity — Alex Lupsasca, OpenAI

Alex Lupsasca · OpenAI for Science · 黑洞理论物理学家 · 2024 New Horizons in Fundamental Physics Prize

Alex Lupsasca 是 2024 年 New Horizons in Fundamental Physics Breakthrough Prize（被称为 "Oscar for physics"）的得主, 一位黑洞理论物理学家。他追踪 LLM 在科学前沿的能力已经一年半。GPT-5 发布时 Twitter 反响 "lukewarm" — 但在他的领域, 模型在 30 分钟内复现了他自己花了很长时间才做出来的好论文。Mark Chen 教了他一个 "priming" 技巧 (先解一道 textbook warmup), GPT-5 就能解决一篇 training-cutoff 之后才发布的论文。之后, 他和 PhD 导师 Strominger 把一个 32 项之和、卡了一年的 single-minus gluon tree amplitude 问题给了 ChatGPT — 模型在 Strominger 的飞机降落之前就解决了, 还用作者们都不知道的技巧给出了证明。第二个实验把题目换成 graviton, 模型在一天内吐出 110 页全新的量子引力, 团队用三周验证。这就是 "vibe physics"。

#ai-research #interview #researcher #quick-take

查看 slides → YouTube ↗

Extreme Harness Engineering: 1M LOC, 1B toks/day, 0% human code or review

≈ 1h 12 min

Extreme Harness Engineering: 1M LOC, 1B toks/day, 0% human code or review

Ryan Lopopolo · OpenAI Frontier Product Exploration · engineer

Ryan Lopopolo 在 OpenAI Frontier 用一个 "out there" 的约束做了 5 个月的实验: 3 个工程师, 1M LOC, 1500 PRs, 他自己一行代码都不写. 这一小时他把这套打法所有的 ratchets 全摊开: 一分钟构建、$land 自动合并、Ghost Libraries、Symphony 用 Elixir、end of bullshit plugins、对 MCP 的 bearish 判断, 以及一句话哲学: "you can just codex things".

#coding-agents #ai-products #productivity #engineer

查看 slides → YouTube ↗