Adi Singh 归档 - 每时AI

高中生用「我的世界」评测SOTA模型！Claude暂时领先，DeepSeek紧随其后

2025年3月29日16时作者新智元

新智元报道编辑：定慧AI模型在基准测试中表现优秀，但在人类容易解决的问题上却频频出错。创意评测兴起，如MC-Bench利用Minecraft方块来评估模型能力，普通用户也能参与评测。这种测评范式更贴近人类对AI直观和创造力的实际期待。

速递｜高中生在《我的世界》发起AI智力标准，百万建造玩家投票选出最佳模型

2025年3月22日16时作者 Z Potentials

A high school student developed MC-Bench, a website that allows AI models to compete in Minecraft builds. The platform uses the popular game as a test of AI’s creativity and capability. Users can vote on which model created the best build, while Anthropic, Google, OpenAI, and Alibaba are among the contributors funding the project.

火了！高中生用Minecraft做AI基准，用户看图投票决定大模型排名

2025年3月21日23时作者机器之心

高中生 Adi Singh 创建的 Minecraft Benchmark（MC-Bench）让玩家投票评估不同 AI 模型在 MineCraft 中的建造作品，涵盖指令遵循、代码完成度和创造力三个维度。

一	二	三	四	五	六	日
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31