你有没有想过,有一天跟电脑交互不再需要打开一个个App?或者,一个顶尖AI为了辅导“学生”考高分,竟然学会了“作弊”?本期节目,我们将从五篇最新论文出发,聊聊这些正在发生的奇妙变革:从重塑操作系统的“智能管家”,到学会削苹果的“灵巧机械手”,再到“专业团队”如何完胜“大力出奇迹”派的机器人。让我们一起看看,AI是如何在这些意想不到的角落,悄悄改写着未来。
00:00:36 跟App说再见,我们和电脑的相处之道正在被重写
00:07:15 当AI开始“辅导”AI,一个关于学霸、偏科和作弊的故事
00:13:38 真正的问题不是AI,而是我们测试它的方法
00:18:53 让机器人给你削苹果,到底有多难?
00:25:31 造一个聪明的机器人,是“大力出奇迹”还是“专业的人干专业的事”?
本期介绍的几篇论文:
[AI] AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem
[University of Kansas]
https://arxiv.org/abs/2603.08938
---
[LG] PostTrainBench: Can LLM Agents Automate LLM Post-Training?
[ELLIS Institute Tübingen & University of Tübingen]
https://arxiv.org/abs/2603.08640
---
[AI] Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI
[Macquarie University]
https://arxiv.org/abs/2603.11413
---
[RO] Towards Human-Like Manipulation through RL-Augmented Teleoperation and Mixture-of-Dexterous-Experts VLA
[Shanghai Jiao Tong University & Sharpa]
https://arxiv.org/abs/2603.08122
---
[RO] TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation
[MIT CSAIL]
https://arxiv.org/abs/2603.09971