如题
从 ARC-AGI-1 Leaderboard上来看
Gemini 3 Flash 跟GPT 5.2算是第一梯队
但是成本省了对手一大截
一个任务只有0.174刀而已
ARC Prize (https://arcprize.org/leaderboard)
ARC Prize - Leaderboard (https://arcprize.org/leaderboard)
The ARC-AGI Leaderboard.
Official benchmark
Google – 17 Dec 25 (https://blog.google/products/gemini/gemini-3-flash/)
Gemini 3 Flash: frontier intelligence built for speed (https://blog.google/products/gemini/gemini-3-flash/)
Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.