2026 年 4 月 9 日 星期四
  • 登录
  • 注册
周天财经
广告
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金
没有结果
查看所有结果
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金
没有结果
查看所有结果
周天财经
没有结果
查看所有结果
首页 行业新闻

Nvidia Accelerates Mistral AI Models, Help Close Gap with OpenAI

2025 年 12 月 3 日
在 行业新闻
阅读时间: 3 mins read
阅读:945
A A


TMTPOST -- Nvidia Corp. and French artificial intelligence (AI) startup Mistral AI have achieved significant performance breakthroughs through their latest collaboration, delivering up to 10 times faster inference speeds for Mistral's new model family on Nvidia's GB200 NVL72 systems compared to the previous-generation H200 chips.

Related articles

二手电车谁买谁傻?保值+避坑全攻略,大胆去捡漏

二手电车谁买谁傻?保值+避坑全攻略,大胆去捡漏

2026 年 4 月 9 日
当具身智能走进工厂:没有星辰大海,只有一本算不清的账

当具身智能走进工厂:没有星辰大海,只有一本算不清的账

2026 年 4 月 8 日

AI Generated Image

广告

AI Generated Image

Mistral AI on Tuesday released its Mistral 3 family of open-weight models, optimized for Nvidia platforms from data centers to edge devices. The release includes Mistral Large 3, a 675 billion total parameter mixture-of-experts model with multilingual and multimodal capabilities, alongside nine smaller Ministral 3 variants designed for deployment on robots, drones and offline devices.

The partnership positions the two-year-old French company to better compete with leading AI labs including OpenAI and Google, particularly in enterprise deployments where customization and cost efficiency matter. Mistral has raised $2.7 billion at a $13.7 billion valuation, with Nvidia among its investors.

The collaboration delivers practical advantages for enterprise users. On the GB200 NVL72, Mistral Large 3 achieved over 5 million tokens per second per megawatt at 40 tokens per second per user, translating to lower per-token costs and improved energy efficiency for production AI systems.

GB200 Systems Drive Performance Gains

Mistral Large 3's architecture leverages Nvidia's hardware optimizations to unlock substantial efficiency improvements. The model's mixture-of-experts design activates only the most relevant parts for each task rather than engaging all 675 billion parameters, reducing computational waste while maintaining accuracy.

The performance leap stems from several technical advances. Nvidia's TensorRT-LLM Wide Expert Parallelism exploits the GB200 NVL72's coherent memory domain through NVLink fabric, enabling optimized expert distribution and load balancing. The system also employs NVFP4 low-precision inference and Dynamo disaggregated inference optimizations to deliver peak performance for large-scale training and deployment.

These optimizations work across Nvidia's inference frameworks including TensorRT-LLM, SGLang and vLLM. The models are available through leading open-source platforms and cloud service providers, with deployment expected soon as Nvidia NIM microservices.

Ministral 3 Targets Edge Deployment

The compact Ministral 3 suite brings AI capabilities to devices operating without network connectivity. Available in 3 billion, 8 billion and 14 billion parameter configurations, each size offers Base, Instruct and Reasoning variants to match specific use cases.

Performance on edge platforms demonstrates practical viability. The Ministral-3B variants achieve up to 385 tokens per second on Nvidia's RTX 5090 GPU. On Nvidia Jetson Thor, the models deliver 52 tokens per second for single concurrency, scaling to 273 tokens per second with eight concurrent requests.

Guillaume Lample, Mistral co-founder and chief scientist, emphasized the efficiency advantage: "The huge majority of enterprise use cases are things that can be tackled by small models, especially if you fine-tune them." All Ministral 3 variants support vision, handle 128,000 to 256,000 context windows, and run on single GPUs, reducing deployment costs and latency.

Commercial Push Intensifies Competition

The release comes as Mistral accelerates commercial activity following a 1.7 billion euro funding round in September that valued the company at 11.7 billion euros. Dutch chip equipment maker ASML contributed 1.3 billion euros, with Nvidia also participating.

Mistral has secured contracts worth hundreds of millions of dollars with corporate clients and announced a deal Monday with HSBC for financial analysis and translation tasks. The company is also expanding through acquisitions to compete with U.S. rivals establishing European operations, including Anthropic and OpenAI, which both opened European offices this year.

The startup's open-weight approach contrasts with closed-source competitors. While OpenAI and Anthropic maintain proprietary models accessible only through APIs, Mistral releases model weights publicly for download and customization. Lample argues this delivers superior results for specific enterprise deployments: "In many cases, you can actually match or even out-perform closed-source models" through fine-tuning.

更多精彩内容,关注钛媒体微信号 (ID:taimeiti),或者下载钛媒体 App

相关 文章

二手电车谁买谁傻?保值+避坑全攻略,大胆去捡漏

二手电车谁买谁傻?保值+避坑全攻略,大胆去捡漏

来自 周天财经
2026 年 4 月 9 日
0

文 | 新能源行业观察新能源车的价格战打...

当具身智能走进工厂:没有星辰大海,只有一本算不清的账

当具身智能走进工厂:没有星辰大海,只有一本算不清的账

来自 周天财经
2026 年 4 月 8 日
0

同样的机器人,在苏州的现代化工厂里,1....

英伟达的游戏生意,还剩多少想象力?

英伟达的游戏生意,还剩多少想象力?

来自 周天财经
2026 年 4 月 8 日
0

文 | 半导体产业纵横如果一个人刚刚认识...

电商评价区,上演AI鉴别大赛

电商评价区,上演 AI 鉴别大赛

来自 周天财经
2026 年 4 月 8 日
0

文 | 智商税研究中心网购时浏览评价区,...

新晋排队王「新鲜零食」,到底是行业风口还是智商税?

新晋排队王 「新鲜零食」,到底是行业风口还是智商税?

来自 周天财经
2026 年 4 月 7 日
0

文 | 财经无忌,作者 | 萧田 2026...

加载更多
广告
  • 热门
  • 评论
  • 最新
神马经典投研: 集资讯、策略、研报一站式期货投研工具

神马经典投研: 集资讯、策略、研报一站式期货投研工具

2025 年 11 月 7 日
「我们也深陷残酷价格战」,德资巨头中国区高管警告

「我们也深陷残酷价格战」,德资巨头中国区高管警告

2025 年 8 月 4 日
一周产业基金|上海市人工智能CVC基金发布;湖北百亿人形机器人母基金来了

一周产业基金|上海市人工智能 CVC 基金发布;湖北百亿人形机器人母基金来了

2025 年 8 月 4 日
「硬科技」指数携手上涨,半导体设备ETF易方达(159558)、芯片ETF易方达(516350)等产品助力布局板块龙头

基民懵了!这个火爆的板块年内涨超 37%,主力却借道 ETF 狂抛逾 400 亿元

2025 年 9 月 20 日
Lesson 1: Basics Of Photography With Natural Lighting

The Single Most Important Thing You Need To Know About Success

4
Lesson 1: Basics Of Photography With Natural Lighting

Lesson 1: Basics Of Photography With Natural Lighting

3
Lesson 1: Basics Of Photography With Natural Lighting

5 Ways Animals Will Help You Get More Business

2
Lesson 1: Basics Of Photography With Natural Lighting

New Cryptocurrency That Will Kill Of Bitcoin

2

今日香港黄金价格查询 (2026 年 4 月 8 日)

2026 年 4 月 9 日
OpenAI GPT-4o之母宣布离职 此前曾在谷歌工作近两年

OpenAI GPT-4o 之母宣布离职 此前曾在谷歌工作近两年

2026 年 4 月 9 日

游戏影视大涨点评

2026 年 4 月 9 日
二手电车谁买谁傻?保值+避坑全攻略,大胆去捡漏

二手电车谁买谁傻?保值+避坑全攻略,大胆去捡漏

2026 年 4 月 9 日
  • 隐私政策
  • 联系我们
  • 关于周天
  • 登录
  • 注册
投诉建议:+86 13326565461

© 2025 广州小舟天传媒有限公司 by 周天财经 - 粤 ICP 备 2025452169 号-1

没有结果
查看所有结果
  • 首页
  • 24 小时
  • 世界
  • 商业
  • 基金
  • 期货
  • 股票
  • 行业新闻
  • 黄金

© 2025 广州小舟天传媒有限公司 by 周天财经 - 粤 ICP 备 2025452169 号-1

欢迎回来!

在下面登录您的帐户

忘记密码? 注册

创建新帐户!

填写以下表格进行注册

所有项目需要填写。 登录

重置您的密码

请输入您的用户名或电子邮件地址以重置密码。

登录

用户登录

还没有账号?立即注册

用户注册

已有账号?立即登录