2026 年 7 月 4 日 星期六
  • 登录
  • 注册
周天财经
广告
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金
没有结果
查看所有结果
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金
没有结果
查看所有结果
周天财经
没有结果
查看所有结果
首页 商业

Fei-Fei Li Lays Out Vision for Spatial Intelligence and Hybrid World Models

2025 年 11 月 26 日
在 商业
阅读时间: 3 mins read
阅读:831
A A


Stanford professor and World Labs founder Fei-Fei Li outlined an expansive vision for "spatial intelligence" and defended the role of explicit 3D world generation in future AI systems, in a wide-ranging interview released on Monday.

Related articles

AI算力爆发,电子玻纤布正成为算力硬件的供给短板

阿莫迪,你比张无忌还厉害

2026 年 7 月 4 日
出现「严重信用风险」,这家银行被监管实施接管

出现 「严重信用风险」,这家银行被监管实施接管

2026 年 7 月 4 日

The discussion spanned foundational debates over world models, the technical architecture behind her startup's first product, and the limits of today's physics-aware AI.

广告

Li, a leading figure in computer vision, argued that the next phase of artificial intelligence will be shaped less by language and more by machines' ability to perceive and reason about the physical world. Human cognition, she said, is fundamentally embodied and multimodal—a process that depends on vision, action, and interaction rather than text alone.

"Language captures only a subset of human knowledge," she said. "Much of what we know comes from interacting with the world, often without using language at all."

The remarks come as major AI labs push to build world models—systems that internalize 3D structure, physical dynamics and causal relationships. Li's approach diverges from that of deep-learning pioneer Yann LeCun, who has emphasized implicit, abstract representations that do not require models to explicitly generate scenes. She rejected the idea of a rivalry, saying the field ultimately needs both.

"We're intellectually on the same continuum," she said. "For a universal world model, implicit and explicit representations will both be indispensable."

World Labs' First Model Targets Explicit, Navigable 3D Worlds

Li's comments centered on Marble, World Labs' inaugural model, built on what her team calls a Real-Time Frame Model (RTFM). Unlike video-generation systems that output sequences of frames, Marble generates persistent, navigable 3D environments with object permanence and consistent geometry across viewpoints. It can take in text, images, video or rough spatial layouts and run in real time on a single Nvidia H100 GPU.

Maintaining internal coherence, Li said, required extensive engineering. "In early frame-based generation models, when you moved the camera, object consistency would collapse," she said. Marble's behavior remains largely statistical, not physics-driven: modern generative models, she noted, still imitate patterns in training data rather than compute formal forces.

"I don't think AI today is yet capable of abstracting the laws of physics," she said. "For Einstein-style abstraction, we haven't seen evidence that Transformers can do that." She nonetheless expects progress in physical reasoning within five years.

The Search for a 'Universal Task Function' in Vision

Li identified the absence of a unifying objective for spatial AI as a major research bottleneck. The success of language models was driven by next-token prediction, where training and inference are perfectly aligned. No equivalent exists for vision.

"Next-frame prediction is powerful, because the world has continuity," she said. "But it collapses a 3D world into 2D frames. And animals don't do perfect 3D reconstruction—yet they navigate extremely well."

A universal objective for spatial learning, she said, remains an open question.

Li's long-term vision is a "Neural Spatial Engine" that merges generative models with traditional physics engines used in game development. Physics engines compute collisions and rigid-body dynamics; generative models excel at producing rich content. She expects the two to converge.

"Ultimately, physics engines and world-generation models will merge," she said. "We're still at the beginning."

Such systems could make the creation of interactive 3D worlds inexpensive and accessible, enabling what she described as a "multiverse" of low-cost digital environments for education, entertainment, simulation, and scientific research.

Li said world models operating in robotics and other embodied settings must move beyond static training regimes. "Continuous learning is essential," she said, pointing to a future mix of context-based memory, online learning and algorithmic advances.

She emphasized that spatial intelligence is central to the broader quest for more general AI. "You can't put out a fire with language alone," she said. "A lot of human intelligence goes beyond symbols."

Li closed on a broadly optimistic note, predicting meaningful advances within the next half-decade, despite persistent uncertainty. "Some advances have surprised me by happening faster, and others slower," she said. "But five years is a reasonable timeframe."

更多精彩内容,关注钛媒体微信号 (ID:taimeiti),或者下载钛媒体 App

相关 文章

AI算力爆发,电子玻纤布正成为算力硬件的供给短板

阿莫迪,你比张无忌还厉害

来自 周天财经
2026 年 7 月 4 日
0

(本文作者为 字母 AI,钛媒体经授权发布...

出现「严重信用风险」,这家银行被监管实施接管

出现 「严重信用风险」,这家银行被监管实施接管

来自 周天财经
2026 年 7 月 4 日
0

(本文作者为 Barrons 巴伦,钛媒体...

可灵AI两部作品同时拿下戛纳金狮,视频大模型闯进商业创意主战场

可灵 AI 两部作品同时拿下戛纳金狮,视频大模型闯进商业创意主战场

来自 周天财经
2026 年 7 月 4 日
0

2026 年夏天的戛纳国际创意节,在节庆宫...

从比亚迪出来创业9年,他们准备要去港股敲钟了

从比亚迪出来创业 9 年,他们准备要去港股敲钟了

来自 周天财经
2026 年 7 月 3 日
0

(本文作者为 华夏能源网,钛媒体经授权发...

真正有价值的AI Agent,必须长在业务流程里

真正有价值的 AI Agent,必须长在业务流程里

来自 周天财经
2026 年 7 月 3 日
0

(本文作者为 OliverWyman 奥纬...

加载更多
广告
  • 热门
  • 评论
  • 最新
神马经典投研: 集资讯、策略、研报一站式期货投研工具

神马经典投研: 集资讯、策略、研报一站式期货投研工具

2025 年 11 月 7 日
「我们也深陷残酷价格战」,德资巨头中国区高管警告

「我们也深陷残酷价格战」,德资巨头中国区高管警告

2025 年 8 月 4 日
一周产业基金|上海市人工智能CVC基金发布;湖北百亿人形机器人母基金来了

一周产业基金|上海市人工智能 CVC 基金发布;湖北百亿人形机器人母基金来了

2025 年 8 月 4 日
「硬科技」指数携手上涨,半导体设备ETF易方达(159558)、芯片ETF易方达(516350)等产品助力布局板块龙头

基民懵了!这个火爆的板块年内涨超 37%,主力却借道 ETF 狂抛逾 400 亿元

2025 年 9 月 20 日
Lesson 1: Basics Of Photography With Natural Lighting

The Single Most Important Thing You Need To Know About Success

4
Lesson 1: Basics Of Photography With Natural Lighting

Lesson 1: Basics Of Photography With Natural Lighting

3
Lesson 1: Basics Of Photography With Natural Lighting

5 Ways Animals Will Help You Get More Business

2
Lesson 1: Basics Of Photography With Natural Lighting

New Cryptocurrency That Will Kill Of Bitcoin

2

包承超入职中信证券,任研究部策略大组长

2026 年 7 月 4 日

科技股持续调整,15 只相关主题基金单日跌逾 10%

2026 年 7 月 4 日

2026 版熊猫金币 30 克今天报价 (2026 年 06 月 23 日)

2026 年 7 月 4 日
货币市场日报:7月3日

货币市场日报:7 月 3 日

2026 年 7 月 4 日
  • 隐私政策
  • 联系我们
  • 关于周天
  • 登录
  • 注册
投诉建议:+86 13326565461

© 2025 广州小舟天传媒有限公司 by 周天财经 - 粤 ICP 备 2025452169 号-1

没有结果
查看所有结果
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金

© 2025 广州小舟天传媒有限公司 by 周天财经 - 粤 ICP 备 2025452169 号-1

欢迎回来!

在下面登录您的帐户

忘记密码? 注册

创建新帐户!

填写以下表格进行注册

所有项目需要填写。 登录

重置您的密码

请输入您的用户名或电子邮件地址以重置密码。

登录

用户登录

还没有账号?立即注册

用户注册

已有账号?立即登录