2026 年 4 月 11 日 星期六
  • 登录
  • 注册
周天财经
广告
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金
没有结果
查看所有结果
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金
没有结果
查看所有结果
周天财经
没有结果
查看所有结果
首页 商业

Fei-Fei Li Lays Out Vision for Spatial Intelligence and Hybrid World Models

2025 年 11 月 26 日
在 商业
阅读时间: 3 mins read
阅读:830
A A


Stanford professor and World Labs founder Fei-Fei Li outlined an expansive vision for "spatial intelligence" and defended the role of explicit 3D world generation in future AI systems, in a wide-ranging interview released on Monday.

Related articles

滴滴自动驾驶张博:深耕AI、硬件、场景三大能力,持续强化创新突破

滴滴自动驾驶张博:深耕 AI、硬件、场景三大能力,持续强化创新突破

2026 年 4 月 11 日
出去过的每个人都在讲,国外的钱也没有那么好挣

出去过的每个人都在讲,国外的钱也没有那么好挣

2026 年 4 月 11 日

The discussion spanned foundational debates over world models, the technical architecture behind her startup's first product, and the limits of today's physics-aware AI.

广告

Li, a leading figure in computer vision, argued that the next phase of artificial intelligence will be shaped less by language and more by machines' ability to perceive and reason about the physical world. Human cognition, she said, is fundamentally embodied and multimodal—a process that depends on vision, action, and interaction rather than text alone.

"Language captures only a subset of human knowledge," she said. "Much of what we know comes from interacting with the world, often without using language at all."

The remarks come as major AI labs push to build world models—systems that internalize 3D structure, physical dynamics and causal relationships. Li's approach diverges from that of deep-learning pioneer Yann LeCun, who has emphasized implicit, abstract representations that do not require models to explicitly generate scenes. She rejected the idea of a rivalry, saying the field ultimately needs both.

"We're intellectually on the same continuum," she said. "For a universal world model, implicit and explicit representations will both be indispensable."

World Labs' First Model Targets Explicit, Navigable 3D Worlds

Li's comments centered on Marble, World Labs' inaugural model, built on what her team calls a Real-Time Frame Model (RTFM). Unlike video-generation systems that output sequences of frames, Marble generates persistent, navigable 3D environments with object permanence and consistent geometry across viewpoints. It can take in text, images, video or rough spatial layouts and run in real time on a single Nvidia H100 GPU.

Maintaining internal coherence, Li said, required extensive engineering. "In early frame-based generation models, when you moved the camera, object consistency would collapse," she said. Marble's behavior remains largely statistical, not physics-driven: modern generative models, she noted, still imitate patterns in training data rather than compute formal forces.

"I don't think AI today is yet capable of abstracting the laws of physics," she said. "For Einstein-style abstraction, we haven't seen evidence that Transformers can do that." She nonetheless expects progress in physical reasoning within five years.

The Search for a 'Universal Task Function' in Vision

Li identified the absence of a unifying objective for spatial AI as a major research bottleneck. The success of language models was driven by next-token prediction, where training and inference are perfectly aligned. No equivalent exists for vision.

"Next-frame prediction is powerful, because the world has continuity," she said. "But it collapses a 3D world into 2D frames. And animals don't do perfect 3D reconstruction—yet they navigate extremely well."

A universal objective for spatial learning, she said, remains an open question.

Li's long-term vision is a "Neural Spatial Engine" that merges generative models with traditional physics engines used in game development. Physics engines compute collisions and rigid-body dynamics; generative models excel at producing rich content. She expects the two to converge.

"Ultimately, physics engines and world-generation models will merge," she said. "We're still at the beginning."

Such systems could make the creation of interactive 3D worlds inexpensive and accessible, enabling what she described as a "multiverse" of low-cost digital environments for education, entertainment, simulation, and scientific research.

Li said world models operating in robotics and other embodied settings must move beyond static training regimes. "Continuous learning is essential," she said, pointing to a future mix of context-based memory, online learning and algorithmic advances.

She emphasized that spatial intelligence is central to the broader quest for more general AI. "You can't put out a fire with language alone," she said. "A lot of human intelligence goes beyond symbols."

Li closed on a broadly optimistic note, predicting meaningful advances within the next half-decade, despite persistent uncertainty. "Some advances have surprised me by happening faster, and others slower," she said. "But five years is a reasonable timeframe."

更多精彩内容,关注钛媒体微信号 (ID:taimeiti),或者下载钛媒体 App

相关 文章

滴滴自动驾驶张博:深耕AI、硬件、场景三大能力,持续强化创新突破

滴滴自动驾驶张博:深耕 AI、硬件、场景三大能力,持续强化创新突破

来自 周天财经
2026 年 4 月 11 日
0

(本文作者为 科技指北,钛媒体经授权发布...

出去过的每个人都在讲,国外的钱也没有那么好挣

出去过的每个人都在讲,国外的钱也没有那么好挣

来自 周天财经
2026 年 4 月 11 日
0

(本文作者为 华商韬略,钛媒体经授权发布...

三家芯片厂的豪赌,到底值不值?

三家芯片厂的豪赌,到底值不值?

来自 周天财经
2026 年 4 月 11 日
0

(本文作者为 半导体产业纵横,钛媒体经授...

【科股一线拆解】「数据资产」领域将举办高规格发布会,这一新应用方向正被市场广泛关注

港股 IPO 乱象深析:谁在 「装睡」?谁在 「放水」?

来自 周天财经
2026 年 4 月 11 日
0

(本文作者为 财华社,钛媒体经授权发布)...

张勇回归后的首份财报,「红石榴计划」能否再造海底捞?

张勇回归后的首份财报,「红石榴计划」 能否再造海底捞?

来自 周天财经
2026 年 4 月 11 日
0

作为火锅行业乃至整个中餐领域的龙头,海底...

加载更多
广告
  • 热门
  • 评论
  • 最新
神马经典投研: 集资讯、策略、研报一站式期货投研工具

神马经典投研: 集资讯、策略、研报一站式期货投研工具

2025 年 11 月 7 日
「我们也深陷残酷价格战」,德资巨头中国区高管警告

「我们也深陷残酷价格战」,德资巨头中国区高管警告

2025 年 8 月 4 日
一周产业基金|上海市人工智能CVC基金发布;湖北百亿人形机器人母基金来了

一周产业基金|上海市人工智能 CVC 基金发布;湖北百亿人形机器人母基金来了

2025 年 8 月 4 日
「硬科技」指数携手上涨,半导体设备ETF易方达(159558)、芯片ETF易方达(516350)等产品助力布局板块龙头

基民懵了!这个火爆的板块年内涨超 37%,主力却借道 ETF 狂抛逾 400 亿元

2025 年 9 月 20 日
Lesson 1: Basics Of Photography With Natural Lighting

The Single Most Important Thing You Need To Know About Success

4
Lesson 1: Basics Of Photography With Natural Lighting

Lesson 1: Basics Of Photography With Natural Lighting

3
Lesson 1: Basics Of Photography With Natural Lighting

5 Ways Animals Will Help You Get More Business

2
Lesson 1: Basics Of Photography With Natural Lighting

New Cryptocurrency That Will Kill Of Bitcoin

2

中泰证券首席策略分析师徐驰:创业板改革精准支持 「优质未盈利创新企业」

2026 年 4 月 11 日
CoreWeave斩获重磅协议股价暴涨 「新云」势力加速挑战云霸权

CoreWeave 斩获重磅协议股价暴涨 「新云」 势力加速挑战云霸权

2026 年 4 月 11 日

特朗普称霍尔木兹海峡将 「很快」 开放

2026 年 4 月 11 日
汪滔VS刘靖康:谁是「取经人」, 谁又是「红孩儿」?

汪滔 VS 刘靖康:谁是 「取经人」, 谁又是 「红孩儿」?

2026 年 4 月 11 日
  • 隐私政策
  • 联系我们
  • 关于周天
  • 登录
  • 注册
投诉建议:+86 13326565461

© 2025 广州小舟天传媒有限公司 by 周天财经 - 粤 ICP 备 2025452169 号-1

没有结果
查看所有结果
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金

© 2025 广州小舟天传媒有限公司 by 周天财经 - 粤 ICP 备 2025452169 号-1

欢迎回来!

在下面登录您的帐户

忘记密码? 注册

创建新帐户!

填写以下表格进行注册

所有项目需要填写。 登录

重置您的密码

请输入您的用户名或电子邮件地址以重置密码。

登录

用户登录

还没有账号?立即注册

用户注册

已有账号?立即登录