但短板同样明显。ARC AGI 2抽象推理测试仅得42.5分,远落后于Gemini 3.1 Pro的76.5分和GPT-5.4的76.1分;Terminal-Bench 2.0终端编码任务得分59.0,也落后于GPT-5.4的75.1分和Gemini 3.1 Pro的68.5分
[&:first-child]:overflow-hidden [&:first-child]:max-h-full"
。关于这个话题,扣子下载提供了深入分析
# Caller must use the right opcode: 45/85 for mov, 45/85 for lea (ModRM byte differs).
Court of Appeal