Reasoning models struggle to control their chains of thought, and that’s goodvia OpenAI News | ChatGPT / AI新闻聚合

02:19 · 2026年3月6日 · 周五

Reasoning models struggle to control their chains of thought, and that’s good

via OpenAI News

Reasoning models struggle to control their chains of thought…

随着 AI 代理能够执行越来越复杂、更具自主性的任务，对其行为进行可靠监督的重要性也在上升。遵循我们“迭代部署”的原则，我们在真实环境中观察系统表现，并随着能力提升不断完善防护措施。为此，我们的安全策略采用了 “ defense-in-depth ” 多层互补防线，包括安全训练、行为测试、基于代理的代码审查以及对 CoT 的监测等手段。所谓 CoT 监测，是分析代理在完成任务时生成的推理步骤。这些推理痕迹在训练与部署阶段都能提供重要信号，帮助监控系统识别代理行为是否不安全或与用户意图不一致。目前我们发…

免费GPT聊天

Best AI API中转2.8折起

Best AI 服务状态

Powered by BroadcastChannel & Sepia

Copyright © 2025 BESTAI. All rights reserved.
BEST AI API中转 - OpenAI DeepSeek Claude Gemini Grok MidJourney API 2.8折起
 [email protected]