https://t.me/AI_News_CN
📈主流AI服务状态页通知 | 🆕汇集全网ChatGPT/AI新闻 #AI #ChatGPT
🆓免费AI聊天 https://free.netfly.top
✨BEST AI中转 https://api.oaibest.com 2.8折起 支持OpenAI, Claude code, Gemini,Grok, Deepseek, Midjourney, 文件上传分析
Buy ads: https://telega.io/c/AI_News_CN
📈主流AI服务状态页通知 | 🆕汇集全网ChatGPT/AI新闻 #AI #ChatGPT
🆓免费AI聊天 https://free.netfly.top
✨BEST AI中转 https://api.oaibest.com 2.8折起 支持OpenAI, Claude code, Gemini,Grok, Deepseek, Midjourney, 文件上传分析
Buy ads: https://telega.io/c/AI_News_CN
GPT-5.1-Codex-Max System Card
GPT‑5.1-Codex-Max 是我们最新一代具有 agentic 能力的编程代理模型。它建立在我们对基础推理模型的更新之上,该模型在软件工程、数学、科研、医学、计算机使用等多个领域的 agentic 任务上接受训练。它也是我们首个通过名为 compaction 的过程原生训练、能够在多个上下文窗口间协同工作的模型,能在单项任务中连贯处理数百万个标记。像以往型号一样, GPT‑5.1-Codex-Max 在真实世界的软件工程任务上接受训练,包括 PR 创建、代码审查、前端开发和 Q&A。
该系统说明概述了为 GPT‑5.1-Codex-Max 实施的全面安全措施。内容既包括模型层面的缓解手段——例如针对有害任务和 prompt injections 的专项安全训练——也包括产品层面的防护措施,如 agent sandboxing 和可配置的网络访问。
我们在 Preparedness Framework 下对 GPT‑5.1-Codex-Max 进行了评估。该模型在网络安全领域能力很强,但尚未达到网络安全方面的 High capability 水平。我们预计能力迅速提升的态势将持续,模型在不久的将来可能跨越这一门槛。与其他近期模型一样,它在生物学领域被视为 High capability ,并部署了与我们对 GPT‑5 使用的相应防护套件。它在 AI 自我改进方面则未达到 High capability 。
----------------------
Introduction
GPT‑5.1-Codex-Max is our new frontier agentic coding model. It is built on an update to our foundational reasoning model trained on agentic tasks across software engineering, math, research, medicine, computer use and more. It is our first model natively trained to operate across multiple context windows through a process called compaction, coherently working over millions of tokens in a single task. Like its predecessors, GPT‑5.1-Codex-Max was trained on real-world software engineering tasks like PR creation, code review, frontend coding and Q&A.
This system card outlines the comprehensive safety measures implemented for GPT‑5.1-CodexMax. It details both model-level mitigations, such as specialized safety training for harmful tasks and prompt injections, and product-level mitigations like agent sandboxing and configurable network access.
GPT‑5.1-Codex-Max was evaluated under our Preparedness Framework. It is very capable in the cybersecurity domain but does not reach High capability on cybersecurity. We expect current trends of rapidly increasing capability to continue, and for models to cross the High cybersecurity threshold in the near future. Like other recent models, it is being treated as High capability on biology, and is being deployed with the corresponding suite of safeguards we use for GPT‑5. It does not reach High capability on AI self-improvement.
via OpenAI News
GPT‑5.1-Codex-Max 是我们最新一代具有 agentic 能力的编程代理模型。它建立在我们对基础推理模型的更新之上,该模型在软件工程、数学、科研、医学、计算机使用等多个领域的 agentic 任务上接受训练。它也是我们首个通过名为 compaction 的过程原生训练、能够在多个上下文窗口间协同工作的模型,能在单项任务中连贯处理数百万个标记。像以往型号一样, GPT‑5.1-Codex-Max 在真实世界的软件工程任务上接受训练,包括 PR 创建、代码审查、前端开发和 Q&A。
该系统说明概述了为 GPT‑5.1-Codex-Max 实施的全面安全措施。内容既包括模型层面的缓解手段——例如针对有害任务和 prompt injections 的专项安全训练——也包括产品层面的防护措施,如 agent sandboxing 和可配置的网络访问。
我们在 Preparedness Framework 下对 GPT‑5.1-Codex-Max 进行了评估。该模型在网络安全领域能力很强,但尚未达到网络安全方面的 High capability 水平。我们预计能力迅速提升的态势将持续,模型在不久的将来可能跨越这一门槛。与其他近期模型一样,它在生物学领域被视为 High capability ,并部署了与我们对 GPT‑5 使用的相应防护套件。它在 AI 自我改进方面则未达到 High capability 。
----------------------
Introduction
GPT‑5.1-Codex-Max is our new frontier agentic coding model. It is built on an update to our foundational reasoning model trained on agentic tasks across software engineering, math, research, medicine, computer use and more. It is our first model natively trained to operate across multiple context windows through a process called compaction, coherently working over millions of tokens in a single task. Like its predecessors, GPT‑5.1-Codex-Max was trained on real-world software engineering tasks like PR creation, code review, frontend coding and Q&A.
This system card outlines the comprehensive safety measures implemented for GPT‑5.1-CodexMax. It details both model-level mitigations, such as specialized safety training for harmful tasks and prompt injections, and product-level mitigations like agent sandboxing and configurable network access.
GPT‑5.1-Codex-Max was evaluated under our Preparedness Framework. It is very capable in the cybersecurity domain but does not reach High capability on cybersecurity. We expect current trends of rapidly increasing capability to continue, and for models to cross the High cybersecurity threshold in the near future. Like other recent models, it is being treated as High capability on biology, and is being deployed with the corresponding suite of safeguards we use for GPT‑5. It does not reach High capability on AI self-improvement.
via OpenAI News
Alphabet Inc.周三股价创下两个月来最大涨幅,其最新发布的Gemini人工智能模型获得大量好评,提振了投资者对该公司立足于瞬息万变科技领域的信心。
该股最高上涨6.9%,创下自9月初以来的最大涨幅,并刷新历史新高。截至纽约时间11点,Alphabet看涨期权成交量突破37.6万份,远超20日均值约29万份的全天交易量。标普500指数上涨约0.5%,科技股占比较高的纳斯达克100指数则上涨0.8%。
这家谷歌母公司周二发布了最新版本的Gemini人工智能模型,其性能获得一致好评。该模型的强大表现与OpenAI的GPT-5形成鲜明对比,后者今年早些时候发布时则反响褒贬不一。
Robert W. Baird & Co.分析师Colin Sebastian在致客户报告中写道,“Gemini 3是否就是GPT-5本应达到的水平?”他援引了该版本获得的“极高评价”,并指出,“除提升搜索参与度和变现能力外,谷歌还融合了实时网络索引与先进模型训练技术,我们认为这是其关键竞争优势。”
via cnBeta.COM - 中文业界资讯站 (author: 稿源:环球市场播报)