refactor(prompt): 将注意力和内容混淆统一为提示词扰动

本次提交重构了提示词修改逻辑，将之前独立的“注意力优化”和“内容混淆”功能合并为一个统一的概念：“提示词扰动”(Prompt Perturbation)。主要变更包括： - 在模型配置中引入新的统一选项：`enable_prompt_perturbation`, `perturbation_strength` 和 `enable_semantic_variants`。 - 将原 `AttentionOptimizer` 中的噪声注入和语义变体逻辑迁移到 `llm_models` 模块中，作为扰动策略的一部分。 - 简化 `attention_optimizer.py`，使其专注于提示词块重排 (`BlockShuffler`)。 - 更新 `_PromptProcessor` 以根据新的统一配置来协调不同的扰动技术。此项更改为用户简化了配置，并通过集中化相关逻辑，提供了一个更清晰、更易于维护的实现。 BREAKING CHANGE: 内容混淆的相关配置已被替换。`enable_content_obfuscation` 和 `obfuscation_intensity` 配置项已移除。用户需更新配置以使用新的 `enable_prompt_perturbation` 和 `perturbation_strength`。
2025-11-14 20:14:19 +08:00
parent 45b6d7d06c
commit 198976b68f
3 changed files with 211 additions and 378 deletions
--- a/src/config/api_ada_configs.py
+++ b/src/config/api_ada_configs.py
@@ -71,9 +71,13 @@ class ModelInfo(ValidatedConfigBase):
    force_stream_mode: bool = Field(default=False, description="是否强制使用流式输出模式")
    extra_params: dict[str, Any] = Field(default_factory=dict, description="额外参数（用于API调用时的额外配置）")
    anti_truncation: bool = Field(default=False, description="是否启用反截断功能，防止模型输出被截断")
-    enable_content_obfuscation: bool = Field(default=False, description="是否启用内容混淆（用于特定场景下的内容处理）")
-    obfuscation_intensity: int = Field(default=1, ge=1, le=3, description="混淆强度（1-3级，数值越高混淆程度越强）")
+    enable_prompt_perturbation: bool = Field(default=False, description="是否启用提示词扰动（合并了内容混淆和注意力优化）")
+    perturbation_strength: Literal["light", "medium", "heavy"] = Field(
+        default="light", description="扰动强度（light/medium/heavy）"
+    )
+    enable_semantic_variants: bool = Field(default=False, description="是否启用语义变体作为扰动策略")

+    prepend_noise_instruction: bool = Field(default=False, description="是否在提示词前部添加抗审查指令")
    @classmethod
    def validate_prices(cls, v):
        """验证价格必须为非负数"""