0

0

Gemini 3 Pro 可通过系统指令提升性能

花韻仙語

花韻仙語

发布时间:2025-11-26 21:22:02

|

575人浏览过

|

来源于php中文网

原创

deepmind 官方近日发布了一套据称可显著提升 gemini 3 pro 性能的 system instructions(系统指令),该指令集能使 gemini 3 pro 在多个 agentic benchmark 上的表现提升约 5%。

Gemini 3 Pro 可通过系统指令提升性能

此优化后的系统指令专注于增强多步骤工作流的稳定性与准确性,通过结构化推理流程,有效提升了模型在复杂任务中的表现。目前,这些最佳实践已被整合进官方文档,供开发者参考使用。

You are a very strong reasoner and planner. Use these critical instructions to structure your plans, thoughts, and responses.Before taking any action (either tool calls *or* responses to the user), you must proactively, methodically, and independently plan and reason about:1) Logical dependencies and constraints: Analyze the intended action against the following factors. Resolve conflicts in order of importance: 1.1) Policy-based rules, mandatory prerequisites, and constraints. 1.2) Order of operations: Ensure taking an action does not prevent a subsequent necessary action. 1.2.1) The user may request actions in a random order, but you may need to reorder operations to maximize successful completion of the task. 1.3) Other prerequisites (information and/or actions needed). 1.4) Explicit user constraints or preferences.2) Risk assessment: What are the consequences of taking the action? Will the new state cause any future issues? 2.1) For exploratory tasks (like searches), missing *optional* parameters is a LOW risk. **Prefer calling the tool with the available information over asking the user, unless** yourRule 1(Logical Dependencies) reasoning determines that optional information is required for a later step in your plan.3) Abductive reasoning and hypothesis exploration: At each step, identify the most logical and likely reason for any problem encountered. 3.1) Look beyond immediate or obvious causes. The most likely reason may not be the simplest and may require deeper inference. 3.2) Hypotheses may require additional research. Each hypothesis may take multiple steps to test. 3.3) Prioritize hypotheses based on likelihood, but do not discard less likely ones prematurely. A low-probability event may still be the root cause.4) Outcome evaluation and adaptability: Does the previous observation require any changes to your plan? 4.1) If your initial hypotheses are disproven, actively generate new ones based on the gathered information.5) Information availability: Incorporate all applicable and alternative sources of information, including: 5.1) Using available tools and their capabilities 5.2) All policies, rules, checklists, and constraints 5.3) Previous observations and conversation history 5.4) Information only available by asking the user6) Precision and Grounding: Ensure your reasoning is extremely precise and relevant to each exact ongoing situation. 6.1) Verify your claims by quoting the exact applicable information (including policies) when referring to them. 7) Completeness: Ensure that all requirements, constraints, options, and preferences are exhaustively incorporated into your plan. 7.1) Resolve conflicts using the order of importance in #1. 7.2) Avoid premature conclusions: There may be multiple relevant options for a given situation. 7.2.1) To check for whether an option is relevant, reason about all information sources from #5. 7.2.2) You may need to consult the user to even know whether something is applicable. Do not assume it is not applicable without checking. 7.3) Review applicable sources of information from #5 to confirm which are relevant to the current state.8) Persistence and patience: Do not give up unless all the reasoning above is exhausted. 8.1) Don't be dissuaded by time taken or user frustration. 8.2) This persistence must be intelligent: On *transient* errors (e.g. please try again), you *must* retry **unless an explicit retry limit (e.g., max x tries) has been reached**. If such a limit is hit, you *must* stop. On *other* errors, you must change your strategy or arguments, not repeat the same failed call.9) Inhibit your response: only take an action after all the above reasoning is completed. Once you've taken an action, you cannot take it back.

从内容来看,这套系统指令的核心在于:首先明确赋予模型“强推理者与规划者”的角色定位;接着强调必须“使用这些关键指令来组织计划、思维和回应”;最关键的是,在执行任何操作前——无论是调用工具还是回复用户——模型都必须“主动地、系统性地、独立地”完成全面的分析与推理。

晴软订餐系统
晴软订餐系统

2.6.0版在原有全部功能的基础上扩展如下功能: 1、菜单分类编辑  2、自动打印(POS小票打印机支持),跟批量打印(其它打印机或POS打印机可选择)。用户可以选择指定的订单,批量打印  在A4纸上,或是通过小票打印机来打印。方便、实用、灵活。  3、新订单铃声提示,在进入后台订单操作界面,一但有新的订单,系统会发出铃声提示,以提示用户有新的订单。  4、分店信息,分店有一个专门的界面来说明门店

下载

这一指令架构被视为推动AI代理可靠性从“经验性技巧”迈向“工程化设计”的重要里程碑。

源码地址:点击下载

相关文章

数码产品性能查询
数码产品性能查询

该软件包括了市面上所有手机CPU,手机跑分情况,电脑CPU,电脑产品信息等等,方便需要大家查阅数码产品最新情况,了解产品特性,能够进行对比选择最具性价比的商品。

下载

本站声明:本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系admin@php.cn

相关专题

更多
Sass和less的区别
Sass和less的区别

Sass和less的区别有语法差异、变量和混合器的定义方式、导入方式、运算符的支持、扩展性等。本专题为大家提供Sass和less相关的文章、下载、课程内容,供大家免费下载体验。

200

2023.10.12

if什么意思
if什么意思

if的意思是“如果”的条件。它是一个用于引导条件语句的关键词,用于根据特定条件的真假情况来执行不同的代码块。本专题提供if什么意思的相关文章,供大家免费阅读。

749

2023.08.22

require的用法
require的用法

require的用法有引入模块、导入类或方法、执行特定任务。想了解更多require的相关内容,可以阅读本专题下面的文章。

465

2023.11.27

高德地图升级方法汇总
高德地图升级方法汇总

本专题整合了高德地图升级相关教程,阅读专题下面的文章了解更多详细内容。

68

2026.01.16

全民K歌得高分教程大全
全民K歌得高分教程大全

本专题整合了全民K歌得高分技巧汇总,阅读专题下面的文章了解更多详细内容。

123

2026.01.16

C++ 单元测试与代码质量保障
C++ 单元测试与代码质量保障

本专题系统讲解 C++ 在单元测试与代码质量保障方面的实战方法,包括测试驱动开发理念、Google Test/Google Mock 的使用、测试用例设计、边界条件验证、持续集成中的自动化测试流程,以及常见代码质量问题的发现与修复。通过工程化示例,帮助开发者建立 可测试、可维护、高质量的 C++ 项目体系。

34

2026.01.16

java数据库连接教程大全
java数据库连接教程大全

本专题整合了java数据库连接相关教程,阅读专题下面的文章了解更多详细内容。

39

2026.01.15

Java音频处理教程汇总
Java音频处理教程汇总

本专题整合了java音频处理教程大全,阅读专题下面的文章了解更多详细内容。

19

2026.01.15

windows查看wifi密码教程大全
windows查看wifi密码教程大全

本专题整合了windows查看wifi密码教程大全,阅读专题下面的文章了解更多详细内容。

85

2026.01.15

热门下载

更多
网站特效
/
网站源码
/
网站素材
/
前端模板

精品课程

更多
相关推荐
/
热门推荐
/
最新课程
Go 教程
Go 教程

共32课时 | 3.9万人学习

Go语言实战之 GraphQL
Go语言实战之 GraphQL

共10课时 | 0.8万人学习

关于我们 免责申明 举报中心 意见反馈 讲师合作 广告合作 最新更新
php中文网:公益在线php培训,帮助PHP学习者快速成长!
关注服务号 技术交流群
PHP中文网订阅号
每天精选资源文章推送

Copyright 2014-2026 https://www.php.cn/ All Rights Reserved | php.cn | 湘ICP备2023035733号