AI日报AI日报 – 2025-10-20(早)AHPO自适应混合策略优化算法AI Agent多体协同趋势AI推理能力GPT-5IWR-BenchLLM数学推理性能瓶颈MM-HELIXPenalaran Reflektif Rantai PanjangQwen2.5-VL-7Brazonamiento reflexivo de cadena largaUzun zincirli yansıtmalı çıkarımVideo-to-Code交互式网页重建评测基准多模态大模型机器人通用策略框架LeRobot长链反思性推理