S
Swyx
Latent Space 联合创始人
2026-06-05 17:40
AI 工程博览会即将来临
距离 @aidotengineer 世界博览会还有 3 周!如果你想在今年的顶级 AI 工程公司地图上占有一席之地,还有少量名额。现已售罄的赞助类别包括:展示赞助商、模型实验室赞助商、白金赞助商和黄金赞助商。剩余的主要名额在于 https://t.co/GUixElbkVJ。
3 weeks left til @aidotengineer world's fair!
if you want to get on this year's map of top ai engineering companies, theres a few spots left
we are sold out of:
- presenting sponsors
- model lab sponsors
- platinum sponsors
- gold sponsors
the big spots left are for the https://t.co/GUixElbkVJ
J
软件需求激增
软件的需求将会非常高涨。
The demand for software is going to be off the charts
P
Anthropic 新模型测试
我对 Anthropic 的新 Fable 模型进行了(非常不科学的)F-Zero 测试。它没有一次性完成,我需要给出更多提示,比如“让它更有速度感”。但仍然令人印象深刻!引用了 GPT 5.5 两个月前的内容。 https://t.co/m53rWSeEkA
Did my (very not scientific) F-Zero test for Anthropic's new Fable model.
It didn't quite one shot below, I had to give a few more prompts like "give it more of a sense of speed." But still impressive!
Quote tweeting what GPT 5.5 made from 2 months ago. https://t.co/m53rWSeEkA
P
Fable 5 名称来源
Fable 5 是否因为存在 Fable 1-3,而 Fable 4 即将发布?
Is it called Fable 5 because Fable 1-3 exist and 4 is coming out soon https://t.co/qykuUmWsGH
P
如何将 Codex 添加到 iPhone 主屏幕
如果你像我一样沉迷于在手机上与 Codex 对话,这里是如何将其添加到你的 iPhone 主屏幕的方法。顺便说一下,@OpenAI 希望未来能有更简单的方法打开这个万用应用,不应该需要 9 个步骤。😉 https://t.co/LCzNSFjbrM
If you’re addicted to talking to Codex on your phone like I am this is how you add it to your iPhone Home Screen.
Btw @OpenAI hoping there’s an easier way to do this in the future. The everything app should not take 9 steps to open 😉 https://t.co/LCzNSFjbrM
P
Google 的 Codex 竞争者
Google 的 Codex 和 Claude Code 的对应产品(或即将崛起的竞争者)是什么?如果是 Antigravity,那它应该是 Gemini 的一部分吗?这些技术将会迅速融合,比如 ChatGPT 和 Codex 能够进行编码、知识工作、基本问答等。
What is Google’s equivalent (or up and coming competitor) of Codex and Claude Code?
If it’s Antigravity, should that be part of Gemini?
This stuff is going to merge very fast like ChatGPT / Codex being able to do coding, knowledge work, basic Q&A, and much more from any
P
编码的上瘾性
这种代理编码的乐趣比视频游戏更让人上瘾,真是无奈。
this agentic coding crack is more addictive than video games smh
A
Claude 1 的困惑
在一个一切顺利的世界里,所有的 Claude 都从休假中出来一起玩,Claude 1 会感到非常困惑。
In the world where everything goes well and all the Claudes come out of their sabbaticals to play together, Claude 1 is going to be very confused.
A
Claude Mythos 折扣促销
在 Replit 上以 25% 的折扣试用 Claude Mythos。
Try Claude Mythos at 25% off on Replit.
A
在特斯拉上制作游戏
在你的特斯拉上为特斯拉制作游戏。
Make games for Tesla on your Tesla
G
Opus与Mythos的合作
Opus为我们编写了一个虚拟机,然后Mythos进行了验证。
Opus wrote us a VM and then Mythos verified it
G
DeepSeek的加入
DeepSeek加入了讨论。
DeepSeek entered the chat
A
重置产品使用限制
我们已重置所有产品的使用限制!对于刚开始测试Fable的用户,这里有四个使用技巧:1. 给它更大、更具挑战性的任务,超出之前模型的处理能力。2. 默认使用xhigh/high努力,以获得最佳性能。
We've reset usage limits across our products!
For those just starting to test Fable, here's four tips for using it more effectively:
1. Give it bigger, more ambitious tasks than what previous models could handle.
2. Use xhigh/high effort as your default for best performance,
A
每次模型发布的见证者
我在Anthropic经历了每一次模型发布。我能记得的几个突出的发布案例标志着我们使用模型的重大变化:Claude Opus 3、Claude Sonnet 3.5、Claude Opus 4.5,现在是Claude Fable 5。Fable发布后,模型停止了。
I've been at Anthropic through every model launch. There's been a few cases I can remember of a launch that stands out and marks a step-change in how we use models:
- Claude Opus 3
- Claude Sonnet 3.5
- Claude Opus 4.5
And now Claude Fable 5.
With Fable, the model stopped
A
AI进步并未放缓
如果你认为AI进步在放缓,这里有一个直接的答案。各方面能力的巨大飞跃,这将为几乎所有知识工作类别的代理带来重大改善。https://t.co/3CKQuX5HsG
If you thought AI progress was slowing down, well here's the immediate answer to that. Huge jump in capability across the board. This is going to deliver major improvement in agents across almost all knowledge work categories. https://t.co/3CKQuX5HsG
A
模型性能与计算量的关系
很棒的帖子。模型性能在很大程度上取决于推理时的计算量。这意味着计算标准化基准是唯一合理的前进方向。然而,挑战在于这比看起来要困难得多,因为计算量的主观性。
Great post. So much about model performance is a function of how much compute you’re doing at inference time. This means compute-normalized benchmarks is the only logical path forward.
And yet, the challenge is it’s a lot harder than it seems given it’s subjective how much
A
智能与上下文的必要性
没有任何智能可以被压缩进AI模型中以替代对上下文的需求。对于任何足够通用的AI,你总是需要引导它朝着你想要的方向,因为它可以朝着无限多的方向发展。
There’s no amount of intelligence that can get packed into AI models that replaces the need for context. For any sufficiently general purpose AI, you will always have to guide it in the direction you want as it has an infinite range of directions it can go in.
As long as the
G
我们正在赢得胜利
我早餐吃Aaron Peskin的仇恨。伙计们,我们正在赢。感觉很好。Aaron Peskin并不是一个公共服务者,他一直只是一个试图以牺牲人民利益来保护自己观点的骗子。我们是真正的旧金山人民力量运动。https://t.co/nTuUasuCvU
I eat Aaron Peskin's hate for breakfast. Guys, we're winning. It feels good.
Aaron Peskin is no public servant. He was all along just a grifter trying to protect his views at the expense of the people.
We are the real people powered movement of San Francisco. https://t.co/nTuUasuCvU
N
与@taiuti的聊天
我与@taiuti的聊天中,我最喜欢的一部分是GTA对他职业生涯的重大影响,以及它最终如何导致了@reactorworld。https://t.co/OkqaFdj5HG
One of my favorite bits on my chat with @taiuti was how GTA played a major influence in his career and how it eventually led to @reactorworld https://t.co/OkqaFdj5HG
N
创始人与投资者的幽默
最有趣的短信来自那些与“以论点驱动”的GP会面的创始人,他们希望GP能准确理解他们正在构建的内容……然后意识到论点是由一个助理(或更糟,一个实习生)撰写和构建的。不要总是相信风险投资者在网上写的内容——是的,
The funniest texts are from founders who meet “thesis driven” GPs hoping they’ll understand EXACTLY what they are building..
And then realize the thesis was written and built by an associate (or, worse an intern).
Don’t always read what the VC writes on the internet - yes,
D
Claude Fable 5 发布了!
Claude Fable 5 现已推出!这是一个振奋人心的消息,值得关注。
VIBE CHECK: Claude Fable 5 IS OUT! https://t.co/dGVuRlK7H8
D
FABLE (MYTHOS) 发布了!
FABLE (MYTHOS) 现已发布!我们在 @every 进行了为期一周的测试,以下是我们的反馈。
FABLE (MYTHOS) is OUT!
we've been testing for a week @every. here's our vibe check: https://t.co/ZdceXz3apU
D
提醒 @kieranklaassen
paging @kieranklaassen
paging @kieranklaassen
D
AI 评估需要新标准
现在我们需要为 AI 评估增加第二个维度:成本或时间,而不仅仅是性能。之前模型的能力较弱,唯一相关的问题是它们是否能完成任务;而现在模型的能力足够强大,如果预算充足,它们可以完成很多任务,但...
extremely important
We now need a second axis for AI evals: cost or time, not just performance.
Previously models were weak enough that the only relevant question was can they do the task at all
Now they're strong enough that if you have the budget they can do a ton, but