【Dify精讲】第16章：Provider接入开发

最新推荐文章于 2025-07-13 09:45:10 发布

1989

最新推荐文章于 2025-07-13 09:45:10 发布

阅读量1.1k

点赞数 34

CC 4.0 BY-SA版权

分类专栏： Dify源码深度分析与开发实战指南文章标签：网络服务器运维性能优化 python flask AI编程

本文链接：https://blog.csdn.net/wiyi9891/article/details/148796718

Dify源码深度分析与开发实战指南专栏收录该内容

34 篇文章

订阅专栏

引言

还记得当年第一次看到 Dify 支持几十种模型提供商时的震撼吗？从 OpenAI、Anthropic 这样的头部厂商，到 Hugging Face、Ollama 这样的开源方案，再到国内的通义千问、智谱 ChatGLM，几乎涵盖了市面上所有主流的大语言模型。

但这背后的秘密是什么？为什么 Dify 能如此轻松地支持这么多不同的模型？答案就在 Provider 系统的精妙设计上。

作为一个在 AI 生态摸爬滚打多年的老兵，我深知模型接入的复杂性：不同的 API 格式、各异的认证方式、千差万别的参数结构。但 Dify 通过抽象层设计，让这一切变得井然有序。

今天，我们就来深入探索 Dify 的 Provider 接入开发，看看如何为 Dify 添加一个全新的模型提供商。

一、Provider接口规范深度解析

1.1 架构设计的哲学思考

打开 Dify 的插件源码仓库，你会发现一个有趣的现象：从 Dify v1.0.0（2025年2月）开始，所有模型和工具都已迁移到插件中，现在存储在专门的插件仓库中。

这不是简单的代码重构，而是架构进化的必然结果。让我们看看这套系统的核心设计：

# provider配置的核心结构
provider: anthropic              # 提供商标识符
label:                          # 显示名称
  en_US: Anthropic
  zh_Hans: Anthropic
supported_model_types:          # 支持的模型类型
  - llm
  - text-embedding
configurate_methods:            # 配置方法
  - predefined-model           # 预定义模型
  - customizable-model         # 可定制模型

这种设计的精妙之处在于职责分离：

Provider 层：负责统一的认证和基础配置
Model 层：负责具体的模型调用逻辑
Schema 层：负责参数验证和类型约束

1.2 三种配置模式的智慧

Dify 支持三种 Provider 配置模式，每种都有其适用场景：

1. predefined-model（预定义模型）
这是最常见的模式，适用于像 OpenAI、Anthropic 这样有固定模型列表的提供商：

# 只需要配置统一的 Provider 凭据
provider_credentials = {
    "api_key": "sk-xxxxxxx"
}
# 就可以使用所有预定义的模型
models = ["gpt-4", "gpt-3.5-turbo", "claude-3-opus"]

2. customizable-model（可定制模型）
适用于每个模型都需要独特配置的场景，比如 Xinference：

# 每个模型都需要独立的配置
model_config = {
    "model_uid": "unique-model-id",
    "server_url": "http://localhost:9997"
}

3. fetch-from-remote（远程获取）
动态从提供商获取可用模型列表，适用于像 Hugging Face 这样模型数量巨大的平台。

坑点提醒：这三种模式可以共存！一个 Provider 可以同时支持预定义模型和自定义模型，这给了开发者极大的灵活性。

1.3 Provider 基类接口剖析

每个 Provider 都需要继承 ModelProvider 基类：

from dify_plugin import ModelProvider
from dify_plugin.entities.model import ModelType
from dify_plugin.errors.model import CredentialsValidateFailedError

class CustomProvider(ModelProvider):
    def validate_provider_credentials(self, credentials: dict) -> None:
        """
        验证 Provider 级别的凭据
        这是唯一必须实现的方法
        """
        try:
            # 尝试使用凭据调用一个轻量级的 API
            model_instance = self.get_model_instance(ModelType.LLM)
            model_instance.validate_credentials(
                model="default-model", 
                credentials=credentials
            )
        except Exception as ex:
            raise CredentialsValidateFailedError(f"凭据验证失败: {str(ex)}")

设计巧思：注意这里的验证策略，Dify 并不直接验证 Provider 凭据，而是委托给具体的模型实例进行验证。这种设计避免了重复的验证逻辑。

二、自定义LLM接入实战

2.1 实际案例：接入一个虚拟的"智能助手API"

假设我们要接入一个名为 “SmartAI” 的模型提供商，它提供类似 OpenAI 的 API 接口。让我们一步步完成整个接入过程。

第一步：创建项目结构

使用 Dify 提供的脚手架工具：

# 创建新的插件项目
dify plugin init my-smartai-provider --type models

# 项目结构
my-smartai-provider/
├── providers/
│   ├── smartai.yaml          # Provider 配置
│   └── smartai.py           # Provider 实现
├── models/
│   └── llm/
│       ├── llm.py           # LLM 实现
│       ├── smart-chat-v1.yaml  # 模型定义
│       └── _position.yaml   # 模型排序
└── _assets/
    ├── icon_s_en.svg        # 小图标
    └── icon_l_en.svg        # 大图标

第二步：编写 Provider 配置

# providers/smartai.yaml
provider: smartai
label:
  en_US: SmartAI
  zh_Hans: 智能助手
description:
  en_US: Advanced AI models for chat and reasoning
icon_small:
  en_US: icon_s_en.svg
icon_large:
  en_US: icon_l_en.svg
background: "#1E40AF"
help:
  title:
    en_US: Get your API Key from SmartAI
  url:
    en_US: https://console.smartai.com/api-keys
supported_model_types:
  - llm
configurate_methods:
  - predefined-model
provider_credential_schema:
  credential_form_schemas:
    - variable: smartai_api_key
      label:
        en_US: API Key
      type: secret-input
      required: true
      placeholder:
        en_US: Enter your SmartAI API Key
    - variable: smartai_base_url
      label:
        en_US: Base URL
      type: text-input
      required: false
      default: "https://api.smartai.com/v1"
      placeholder:
        en_US: Enter custom base URL (optional)

第三步：实现 Provider 类

# providers/smartai.py
import logging
from dify_plugin import ModelProvider
from dify_plugin.entities.model import ModelType
from dify_plugin.errors.model import CredentialsValidateFailedError

logger = logging.getLogger(__name__)

class SmartAIProvider(ModelProvider):
    def validate_provider_credentials(self, credentials: dict) -> None:
        """
        验证 SmartAI 的 API 凭据
        """
        try:
            # 获取 LLM 模型实例进行验证
            model_instance = self.get_model_instance(ModelType.LLM)
            model_instance.validate_credentials(
                model="smart-chat-v1",  # 使用默认模型进行验证
                credentials=credentials
            )
        except CredentialsValidateFailedError as ex:
            raise ex
        except Exception as ex:
            logger.exception(f"SmartAI credentials validation failed")
            raise CredentialsValidateFailedError(
                f"Invalid API credentials: {str(ex)}"
            )

2.2 LLM 模型实现的核心逻辑

现在来到最关键的部分——LLM 模型的具体实现：

# models/llm/llm.py
import json
import requests
from typing import Optional, Generator, Union, List
from dify_plugin.entities.model import ModelType
from dify_plugin.entities.model.llm import LLMResult, LLMResultChunk, LLMUsage
from dify_plugin.entities.model.message import PromptMessage, UserPromptMessage
from dify_plugin.errors.model import CredentialsValidateFailedError, InvokeError
from dify_plugin.interfaces.model.large_language_model import LargeLanguageModel

class SmartAILargeLanguageModel(LargeLanguageModel):
    """
    SmartAI 大语言模型实现
    """
    
    def _invoke(self, 
                model: str,
                credentials: dict,
                prompt_messages: List[PromptMessage],
                model_parameters: Optional[dict] = None,
                tools: Optional[List[dict]] = None,
                stop: Optional[Union[str, List[str]]] = None,
                stream: bool = True,
                user: Optional[str] = None) -> Union[LLMResult, Generator]:
        """
        调用 SmartAI 模型
        这是核心方法，需要处理流式和非流式两种调用
        """
        
        # 1. 准备请求参数
        api_key = credentials.get('smartai_api_key')
        base_url = credentials.get('smartai_base_url', 'https://api.smartai.com/v1')
        
        if not api_key:
            raise CredentialsValidateFailedError("API Key is required")
        
        # 2. 转换消息格式
        messages = self._convert_prompt_messages_to_dict(prompt_messages)
        
        # 3. 构建请求体
        request_data = {
            'model': model,
            'messages': messages,
            'stream': stream,
            **(model_parameters or {})
        }
        
        if tools:
            request_data['tools'] = tools
        if stop:
            request_data['stop'] = stop
            
        # 4. 发起请求
        headers = {
            'Authorization': f'Bearer {api_key}',
            'Content-Type': 'application/json'
        }
        
        try:
            if stream:
                return self._handle_stream_response(
                    base_url, headers, request_data
                )
            else:
                return self._handle_sync_response(
                    base_url, headers, request_data
                )
        except Exception as e:
            raise InvokeError(f"Model invocation failed: {str(e)}")
    
    def _handle_stream_response(self, 
                              base_url: str, 
                              headers: dict, 
                              request_data: dict) -> Generator:
        """
        处理流式响应
        """
        url = f"{base_url}/chat/completions"
        
        with requests.post(url, headers=headers, 
                         json=request_data, stream=True) as response:
            response.raise_for_status()
            
            for line in response.iter_lines():
                if line:
                    line = line.decode('utf-8')
                    if line.startswith('data: '):
                        data = line[6:]  # 去掉 'data: ' 前缀
                        if data == '[DONE]':
                            break
                        
                        try:
                            chunk_data = json.loads(data)
                            chunk = self._extract_response_chunk(chunk_data)
                            if chunk:
                                yield chunk
                        except json.JSONDecodeError:
                            continue
    
    def _handle_sync_response(self, 
                            base_url: str, 
                            headers: dict, 
                            request_data: dict) -> LLMResult:
        """
        处理同步响应
        """
        url = f"{base_url}/chat/completions"
        request_data['stream'] = False
        
        response = requests.post(url, headers=headers, json=request_data)
        response.raise_for_status()
        
        response_data = response.json()
        return self._extract_response_result(response_data)
    
    def _convert_prompt_messages_to_dict(self, 
                                       messages: List[PromptMessage]) -> List[dict]:
        """
        将 Dify 的消息格式转换为 SmartAI API 格式
        """
        result = []
        for message in messages:
            if isinstance(message, UserPromptMessage):
                result.append({
                    "role": "user",
                    "content": message.content
                })
            # 处理其他消息类型...
        return result
    
    def _extract_response_chunk(self, chunk_data: dict) -> Optional[LLMResultChunk]:
        """
        从流式响应中提取数据块
        """
        if 'choices' not in chunk_data or not chunk_data['choices']:
            return None
            
        choice = chunk_data['choices'][0]
        delta = choice.get('delta', {})
        
        # 提取文本内容
        content = delta.get('content', '')
        finish_reason = choice.get('finish_reason')
        
        # 构建使用量信息（如果可用）
        usage = None
        if 'usage' in chunk_data:
            usage_data = chunk_data['usage']
            usage = LLMUsage(
                prompt_tokens=usage_data.get('prompt_tokens', 0),
                completion_tokens=usage_data.get('completion_tokens', 0),
                total_tokens=usage_data.get('total_tokens', 0)
            )
        
        return LLMResultChunk(
            model=chunk_data.get('model', ''),
            prompt_messages=[],  # 流式响应中通常不包含原始消息
            delta=LLMResultChunkDelta(
                index=choice.get('index', 0),
                message=AssistantPromptMessage(content=content),
                finish_reason=finish_reason,
                usage=usage
            )
        )
    
    def validate_credentials(self, model: str, credentials: dict) -> None:
        """
        验证模型凭据
        """
        try:
            # 使用一个简单的测试请求验证凭据
            test_messages = [
                UserPromptMessage(content="Hello")
            ]
            
            # 发起一个非流式的简单调用
            result = self._invoke(
                model=model,
                credentials=credentials,
                prompt_messages=test_messages,
                stream=False
            )
            
            if not isinstance(result, LLMResult):
                raise CredentialsValidateFailedError("Invalid response format")
                
        except Exception as e:
            raise CredentialsValidateFailedError(f"Credentials validation failed: {str(e)}")
    
    def get_num_tokens(self, 
                      model: str, 
                      credentials: dict, 
                      prompt_messages: List[PromptMessage],
                      tools: Optional[List[dict]] = None) -> int:
        """
        计算 Token 数量
        如果提供商没有专门的 Token 计算 API，可以使用近似估算
        """
        # 简单估算：大约 4 个字符 = 1 个 token
        total_content = ""
        for message in prompt_messages:
            if hasattr(message, 'content'):
                total_content += message.content
        
        return len(total_content) // 4

2.3 模型定义文件

最后，我们需要定义具体的模型：

# models/llm/smart-chat-v1.yaml
model: smart-chat-v1
label:
  en_US: SmartChat V1
  zh_Hans: 智能对话 V1
model_type: llm
features:
  - function-call  # 支持函数调用
  - vision        # 支持视觉输入
model_properties:
  context_size: 32768
  max_chunks: 1
parameter_rules:
  - name: temperature
    use_template: temperature
    min: 0.0
    max: 2.0
    default: 0.7
  - name: top_p  
    use_template: top_p
    min: 0.0
    max: 1.0
    default: 1.0
  - name: max_tokens
    use_template: max_tokens
    min: 1
    max: 4096
    default: 1024

三、向量库适配开发进阶

3.1 向量模型的特殊性

向量模型与 LLM 的主要区别在于输入输出格式：

# models/text_embedding/text_embedding.py
from dify_plugin.interfaces.model.text_embedding_model import TextEmbeddingModel
from dify_plugin.entities.model.text_embedding import TextEmbeddingResult

class SmartAITextEmbeddingModel(TextEmbeddingModel):
    def _invoke(self,
                model: str,
                credentials: dict,
                texts: List[str],
                user: Optional[str] = None) -> TextEmbeddingResult:
        """
        调用向量化模型
        """
        api_key = credentials.get('smartai_api_key')
        base_url = credentials.get('smartai_base_url', 'https://api.smartai.com/v1')
        
        # 构建请求
        request_data = {
            'model': model,
            'input': texts
        }
        
        headers = {
            'Authorization': f'Bearer {api_key}',
            'Content-Type': 'application/json'
        }
        
        response = requests.post(
            f"{base_url}/embeddings",
            headers=headers,
            json=request_data
        )
        response.raise_for_status()
        
        result = response.json()
        
        # 提取向量和使用量信息
        embeddings = []
        for item in result['data']:
            embeddings.append(item['embedding'])
        
        usage = TextEmbeddingUsage(
            tokens=result.get('usage', {}).get('total_tokens', 0),
            total_tokens=result.get('usage', {}).get('total_tokens', 0)
        )
        
        return TextEmbeddingResult(
            model=model,
            embeddings=embeddings,
            usage=usage
        )

3.2 Rerank 模型实现

Rerank 模型用于对检索结果进行重排序：

# models/rerank/rerank.py  
from dify_plugin.interfaces.model.rerank_model import RerankModel
from dify_plugin.entities.model.rerank import RerankResult, RerankDocument

class SmartAIRerankModel(RerankModel):
    def _invoke(self,
                model: str,
                credentials: dict,
                query: str,
                docs: List[str],
                score_threshold: Optional[float] = None,
                top_n: Optional[int] = None,
                user: Optional[str] = None) -> RerankResult:
        """
        对文档进行重排序
        """
        # 实现 Rerank 逻辑
        # 返回排序后的文档和分数
        pass

四、Provider测试验证

4.1 单元测试编写

测试是确保 Provider 质量的关键：

# tests/smartai/test_llm.py
import os
import pytest
from dify_plugin.entities.model.message import UserPromptMessage
from models.llm.llm import SmartAILargeLanguageModel

class TestSmartAILLM:
    def setup_method(self):
        self.model = SmartAILargeLanguageModel()
        self.credentials = {
            'smartai_api_key': os.environ.get('SMARTAI_API_KEY'),
            'smartai_base_url': 'https://api.smartai.com/v1'
        }
    
    def test_validate_credentials(self):
        """测试凭据验证"""
        self.model.validate_credentials(
            model='smart-chat-v1',
            credentials=self.credentials
        )
    
    def test_invoke_sync(self):
        """测试同步调用"""
        result = self.model.invoke(
            model='smart-chat-v1',
            credentials=self.credentials,
            prompt_messages=[
                UserPromptMessage(content="Say hello")
            ],
            stream=False
        )
        
        assert result.message.content
        assert result.usage.total_tokens > 0
    
    def test_invoke_stream(self):
        """测试流式调用"""
        chunks = list(self.model.invoke(
            model='smart-chat-v1',
            credentials=self.credentials,
            prompt_messages=[
                UserPromptMessage(content="Count from 1 to 5")
            ],
            stream=True
        ))
        
        assert len(chunks) > 0
        # 验证最后一个 chunk 包含使用量信息
        last_chunk = chunks[-1]
        assert last_chunk.delta.usage is not None

4.2 集成测试策略

# tests/smartai/test_provider.py
import pytest
from providers.smartai import SmartAIProvider

class TestSmartAIProvider:
    def setup_method(self):
        self.provider = SmartAIProvider()
    
    def test_validate_provider_credentials(self):
        """测试 Provider 级别的凭据验证"""
        credentials = {
            'smartai_api_key': os.environ.get('SMARTAI_API_KEY')
        }
        
        # 应该不抛出异常
        self.provider.validate_provider_credentials(credentials)
    
    def test_invalid_credentials(self):
        """测试无效凭据"""
        with pytest.raises(CredentialsValidateFailedError):
            self.provider.validate_provider_credentials({
                'smartai_api_key': 'invalid-key'
            })

4.3 本地调试技巧

在开发过程中，这些调试技巧能节省大量时间：

1. 使用环境变量管理测试凭据

# .env.example 添加测试变量
SMARTAI_API_KEY=your_test_key_here
SMARTAI_BASE_URL=https://api.smartai.com/v1

2. 启用详细日志

import logging
logging.basicConfig(level=logging.DEBUG)

3. 使用 Mock 进行离线测试

from unittest.mock import patch, MagicMock

@patch('requests.post')
def test_llm_invoke_mocked(mock_post):
    # 模拟 API 响应
    mock_response = MagicMock()
    mock_response.json.return_value = {
        'choices': [{
            'message': {'content': 'Hello!'},
            'finish_reason': 'stop'
        }],
        'usage': {'total_tokens': 10}
    }
    mock_post.return_value = mock_response
    
    # 运行测试
    result = model.invoke(...)
    assert result.message.content == 'Hello!'

五、生产环境最佳实践

5.1 错误处理和重试机制

在生产环境中，网络问题不可避免，完善的错误处理至关重要：

import time
import random
from functools import wraps

def retry_with_exponential_backoff(max_retries=3, base_delay=1):
    """
    指数退避重试装饰器
    """
    def decorator(func):
        @wraps(func)
        def wrapper(*args, **kwargs):
            last_exception = None
            
            for attempt in range(max_retries + 1):
                try:
                    return func(*args, **kwargs)
                except (requests.RequestException, InvokeError) as e:
                    last_exception = e
                    if attempt == max_retries:
                        break
                    
                    # 计算延迟时间（指数退避 + 随机抖动）
                    delay = base_delay * (2 ** attempt) + random.uniform(0, 1)
                    time.sleep(delay)
            
            raise InvokeError(f"Max retries exceeded: {str(last_exception)}")
        return wrapper
    return decorator

class RobustSmartAILargeLanguageModel(SmartAILargeLanguageModel):
    @retry_with_exponential_backoff(max_retries=3)
    def _invoke(self, *args, **kwargs):
        return super()._invoke(*args, **kwargs)

5.2 性能优化策略

1. 连接池复用

import requests
from requests.adapters import HTTPAdapter
from urllib3.util.retry import Retry

class OptimizedSmartAIModel:
    def __init__(self):
        self.session = requests.Session()
        
        # 配置重试策略
        retry_strategy = Retry(
            total=3,
            backoff_factor=1,
            status_forcelist=[429, 500, 502, 503, 504],
        )
        
        adapter = HTTPAdapter(max_retries=retry_strategy)
        self.session.mount("http://", adapter)
        self.session.mount("https://", adapter)
    
    def _make_request(self, url, headers, data):
        return self.session.post(url, headers=headers, json=data)

2. 流式响应优化

def _handle_stream_response_optimized(self, url, headers, request_data):
    """
    优化的流式响应处理
    """
    try:
        with self.session.post(
            url, 
            headers=headers, 
            json=request_data, 
            stream=True,
            timeout=(30, 60)  # 连接超时30秒，读取超时60秒
        ) as response:
            response.raise_for_status()
            
            buffer = ""
            for chunk in response.iter_content(chunk_size=1024, decode_unicode=True):
                if chunk:
                    buffer += chunk
                    lines = buffer.split('\n')
                    buffer = lines[-1]  # 保留最后一个可能不完整的行
                    
                    for line in lines[:-1]:
                        if line.startswith('data: '):
                            yield self._process_line(line)
                            
    except requests.exceptions.Timeout:
        raise InvokeError("Request timeout")
    except requests.exceptions.RequestException as e:
        raise InvokeError(f"Request failed: {str(e)}")

5.3 监控和可观测性

import time
from contextlib import contextmanager

@contextmanager
def model_invoke_metrics(model_name: str, operation: str):
    """
    模型调用指标收集
    """
    start_time = time.time()
    try:
        yield
        # 记录成功指标
        duration = time.time() - start_time
        logger.info(f"Model {model_name} {operation} succeeded in {duration:.2f}s")
    except Exception as e:
        # 记录失败指标
        duration = time.time() - start_time
        logger.error(f"Model {model_name} {operation} failed in {duration:.2f}s: {str(e)}")
        raise

class InstrumentedSmartAIModel:
    def _invoke(self, model, *args, **kwargs):
        with model_invoke_metrics(model, "llm_invoke"):
            return super()._invoke(model, *args, **kwargs)

六、Provider 发布和维护

6.1 打包和发布流程

当你完成了 Provider 的开发和测试，是时候将其贡献给 Dify 社区了。让我们看看完整的发布流程：

第一步：代码质量检查

# 运行代码格式检查
black providers/ models/ tests/
flake8 providers/ models/ tests/

# 运行类型检查
mypy providers/ models/

# 运行全部测试
pytest tests/ -v --cov=providers --cov=models

第二步：打包插件

# 在插件项目根目录执行
dify plugin package providers/smartai

# 这将生成 smartai.difypkg 文件

第三步：本地验证

# 在 Dify 开发环境中安装插件进行最终验证
dify plugin install smartai.difypkg

# 启动 Dify 服务
cd dify && docker-compose up -d

第四步：提交到社区

# Fork 官方插件仓库
git clone https://github.com/your-username/dify-official-plugins.git
cd dify-official-plugins

# 创建新分支
git checkout -b add-smartai-provider

# 复制你的 Provider 代码
cp -r /path/to/your/smartai-provider models/

# 提交代码
git add .
git commit -m "feat: add SmartAI provider support

- Support LLM models: smart-chat-v1, smart-reasoning-v2
- Support Text Embedding models: smart-embed-v1  
- Include comprehensive test coverage
- Add proper error handling and retry mechanisms"

# 推送并创建 Pull Request
git push origin add-smartai-provider

6.2 版本管理策略

语义版本控制

# 在 provider 配置中添加版本信息
provider: smartai
version: "1.0.0"  # 主版本.次版本.修订版本
label:
  en_US: SmartAI v1.0.0

版本更新规则：

主版本（1.x.x）：不兼容的 API 更改
次版本（x.1.x）：向后兼容的功能新增
修订版本（x.x.1）：向后兼容的问题修复

迁移指南编写

当需要进行破坏性更改时，一定要提供详细的迁移指南：

# SmartAI Provider v2.0.0 迁移指南

## 破坏性更改

### 1. 凭据字段更名
- `smartai_api_key` → `api_key`
- `smartai_base_url` → `base_url`

### 2. 模型名称更新
- `smart-chat-v1` → `smartai-chat-v1`

## 迁移步骤

1. 更新插件到 v2.0.0
2. 重新配置 Provider 凭据
3. 更新应用中的模型选择

## 兼容性说明

v2.0.0 向前兼容 6 个月，建议在 2025年8月前完成迁移。

6.3 社区协作和维护

Issue 处理流程

# 在代码中添加详细的错误信息，方便用户报告问题
class SmartAIError(Exception):
    def __init__(self, message: str, error_code: str = None, details: dict = None):
        self.message = message  
        self.error_code = error_code
        self.details = details or {}
        
        # 构建详细的错误信息
        error_info = [f"SmartAI Error: {message}"]
        if error_code:
            error_info.append(f"Error Code: {error_code}")
        if details:
            error_info.append(f"Details: {json.dumps(details, indent=2)}")
            
        super().__init__("\n".join(error_info))

文档维护

为你的 Provider 创建完整的文档：

# SmartAI Provider 使用指南

## 快速开始

1. 获取 API Key
   访问 [SmartAI Console](https://console.smartai.com) 获取 API Key

2. 配置 Provider
   在 Dify 中进入 Settings -> Model Providers -> SmartAI

3. 选择模型
   支持的模型列表：
   - `smart-chat-v1`: 通用对话模型，适合聊天应用
   - `smart-reasoning-v2`: 推理增强模型，适合复杂问题解决

## 高级配置

### 自定义 Base URL
如果你使用的是私有部署的 SmartAI 服务：
```yaml
base_url: "https://your-private-smartai.com/v1"

参数调优建议
● temperature: 0.7 (平衡创造性和准确性)
● max_tokens: 1024 (根据应用需求调整)
● top_p: 0.9 (控制输出的多样性)
故障排除
常见错误
错误: "Invalid API Key"
解决: 检查 API Key 是否正确，确认账户余额充足
错误: "Rate limit exceeded"
解决: 降低请求频率或升级账户套餐
错误: "Model not found"
解决: 确认模型名称正确，检查模型是否在你的账户权限范围内
支持渠道
● GitHub Issues: 报告 Bug 或功能请求
● 社区论坛: Dify Community
● 邮件支持: smartai-support@example.com

七、进阶技巧和优化

7.1 Function Calling 实现

现代 LLM 的一个重要特性是 Function Calling，让我们看看如何在 Provider 中支持这个功能：

def _invoke(self, 
            model: str,
            credentials: dict,
            prompt_messages: List[PromptMessage],
            model_parameters: Optional[dict] = None,
            tools: Optional[List[dict]] = None,
            stop: Optional[Union[str, List[str]]] = None,
            stream: bool = True,
            user: Optional[str] = None) -> Union[LLMResult, Generator]:

    # 处理工具调用
    request_data = {
        'model': model,
        'messages': self._convert_prompt_messages_to_dict(prompt_messages),
        'stream': stream,
        **(model_parameters or {})
    }

    # 如果提供了工具，添加到请求中
    if tools:
        request_data['tools'] = self._convert_tools_to_smartai_format(tools)
        request_data['tool_choice'] = 'auto'  # 让模型自动决定是否调用工具

    # ... 发起请求的逻辑 ...

def _convert_tools_to_smartai_format(self, tools: List[dict]) -> List[dict]:
    """
    将 Dify 的工具格式转换为 SmartAI 的格式
    """
    smartai_tools = []
    for tool in tools:
        smartai_tool = {
            'type': 'function',
            'function': {
                'name': tool['name'],
                'description': tool['description'],
                'parameters': tool['parameters']
            }
        }
        smartai_tools.append(smartai_tool)
    return smartai_tools

def _extract_tool_calls(self, response_data: dict) -> List[dict]:
    """
    从响应中提取工具调用信息
    """
    tool_calls = []
    if 'choices' in response_data:
        choice = response_data['choices'][0]
        message = choice.get('message', {})

        if 'tool_calls' in message:
            for tool_call in message['tool_calls']:
                tool_calls.append({
                    'id': tool_call['id'],
                    'type': tool_call['type'],
                    'function': {
                        'name': tool_call['function']['name'],
                        'arguments': tool_call['function']['arguments']
                    }
                })

    return tool_calls

7.2 Vision 能力支持

对于支持图像输入的模型，需要处理多模态消息：

from dify_plugin.entities.model.message import ImagePromptMessageContent

def _convert_prompt_messages_to_dict(self, messages: List[PromptMessage]) -> List[dict]:
    """
    转换消息格式，支持图像输入
    """
    result = []
    for message in messages:
        if isinstance(message, UserPromptMessage):
            # 检查是否包含图像内容
            if hasattr(message, 'content') and isinstance(message.content, list):
                content = []
                for item in message.content:
                    if isinstance(item, ImagePromptMessageContent):
                        content.append({
                            "type": "image_url",
                            "image_url": {
                                "url": item.data,  # base64 或 URL
                                "detail": item.detail or "auto"
                            }
                        })
                    else:
                        content.append({
                            "type": "text", 
                            "text": str(item)
                        })
                
                result.append({
                    "role": "user",
                    "content": content
                })
            else:
                # 纯文本消息
                result.append({
                    "role": "user",
                    "content": message.content
                })
    
    return result

7.3 性能监控和指标收集

在生产环境中，我们需要收集详细的性能指标：

import time
from collections import defaultdict
from threading import Lock

class ProviderMetrics:
    """
    Provider 性能指标收集器
    """
    def __init__(self):
        self.metrics = defaultdict(list)
        self.lock = Lock()
    
    def record_invoke(self, model: str, duration: float, tokens: int, success: bool):
        with self.lock:
            self.metrics['invocations'].append({
                'model': model,
                'duration': duration,
                'tokens': tokens,
                'success': success,
                'timestamp': time.time()
            })
    
    def get_stats(self, window_minutes: int = 60):
        """
        获取指定时间窗口内的统计信息
        """
        cutoff = time.time() - (window_minutes * 60)
        recent_invocations = [
            inv for inv in self.metrics['invocations'] 
            if inv['timestamp'] > cutoff
        ]
        
        if not recent_invocations:
            return {}
        
        total_invocations = len(recent_invocations)
        successful_invocations = sum(1 for inv in recent_invocations if inv['success'])
        total_tokens = sum(inv['tokens'] for inv in recent_invocations)
        avg_duration = sum(inv['duration'] for inv in recent_invocations) / total_invocations
        
        return {
            'total_invocations': total_invocations,
            'success_rate': successful_invocations / total_invocations,
            'total_tokens': total_tokens,
            'avg_duration': avg_duration,
            'tokens_per_second': total_tokens / sum(inv['duration'] for inv in recent_invocations)
        }

# 在模型类中集成指标收集
class MetricsEnabledSmartAIModel(SmartAILargeLanguageModel):
    def __init__(self):
        super().__init__()
        self.metrics = ProviderMetrics()
    
    def _invoke(self, *args, **kwargs):
        start_time = time.time()
        success = False
        tokens = 0
        
        try:
            result = super()._invoke(*args, **kwargs)
            success = True
            
            if hasattr(result, 'usage') and result.usage:
                tokens = result.usage.total_tokens
            
            return result
        finally:
            duration = time.time() - start_time
            self.metrics.record_invoke(
                model=kwargs.get('model', 'unknown'),
                duration=duration,
                tokens=tokens,
                success=success
            )

7.4 缓存策略优化

对于一些昂贵的操作，实现缓存可以显著提升性能：

import hashlib
import json
from functools import wraps
from typing import Any, Dict, Optional

class LRUCache:
    """
    简单的 LRU 缓存实现
    """
    def __init__(self, capacity: int = 1000):
        self.capacity = capacity
        self.cache = {}
        self.order = []
    
    def get(self, key: str) -> Optional[Any]:
        if key in self.cache:
            # 移动到最前面
            self.order.remove(key)
            self.order.append(key)
            return self.cache[key]
        return None
    
    def put(self, key: str, value: Any):
        if key in self.cache:
            self.order.remove(key)
        elif len(self.cache) >= self.capacity:
            # 移除最久未使用的项
            oldest = self.order.pop(0)
            del self.cache[oldest]
        
        self.cache[key] = value
        self.order.append(key)

def cache_embeddings(cache_ttl: int = 3600):
    """
    向量化结果缓存装饰器
    """
    cache = LRUCache(capacity=10000)
    
    def decorator(func):
        @wraps(func)
        def wrapper(self, model: str, credentials: dict, texts: List[str], **kwargs):
            # 生成缓存键
            cache_key = hashlib.md5(
                json.dumps({
                    'model': model,
                    'texts': sorted(texts),  # 排序确保一致性
                    'provider': self.__class__.__name__
                }, sort_keys=True).encode()
            ).hexdigest()
            
            # 尝试从缓存中获取
            cached_result = cache.get(cache_key)
            if cached_result:
                cache_time, result = cached_result
                if time.time() - cache_time < cache_ttl:
                    return result
            
            # 缓存未命中，执行实际调用
            result = func(self, model, credentials, texts, **kwargs)
            
            # 存入缓存
            cache.put(cache_key, (time.time(), result))
            
            return result
        return wrapper
    return decorator

# 在向量模型中使用缓存
class CachedSmartAITextEmbeddingModel(SmartAITextEmbeddingModel):
    @cache_embeddings(cache_ttl=3600)  # 缓存1小时
    def _invoke(self, *args, **kwargs):
        return super()._invoke(*args, **kwargs)

八、常见问题和解决方案

8.1 调试技巧汇总

在 Provider 开发过程中，这些调试技巧能帮你快速定位问题：

1. 请求响应日志记录

import logging
import json

logger = logging.getLogger(__name__)

def log_api_call(func):
    """
    API 调用日志装饰器
    """
    @wraps(func)
    def wrapper(*args, **kwargs):
        # 记录请求信息（注意不要记录敏感信息）
        request_info = {
            'method': func.__name__,
            'model': kwargs.get('model'),
            'stream': kwargs.get('stream', False)
        }
        logger.info(f"API Request: {json.dumps(request_info)}")
        
        try:
            result = func(*args, **kwargs)
            logger.info(f"API Success: {func.__name__}")
            return result
        except Exception as e:
            logger.error(f"API Error: {func.__name__} - {str(e)}")
            raise
    return wrapper

2. 响应数据结构验证

from pydantic import BaseModel, ValidationError
from typing import List, Optional

class SmartAIResponse(BaseModel):
    """
    SmartAI API 响应结构验证
    """
    choices: List[dict]
    usage: Optional[dict] = None
    model: str
    
    class Config:
        extra = "allow"  # 允许额外字段

def validate_response(response_data: dict) -> SmartAIResponse:
    """
    验证 API 响应格式
    """
    try:
        return SmartAIResponse(**response_data)
    except ValidationError as e:
        logger.error(f"Invalid response format: {e}")
        raise InvokeError(f"Invalid API response: {e}")

8.2 性能问题排查

1. 网络延迟分析

import time
import requests
from urllib3.util import parse_url

def diagnose_network_latency(base_url: str, api_key: str):
    """
    网络延迟诊断工具
    """
    parsed = parse_url(base_url)
    host = parsed.host
    
    # DNS 解析时间
    dns_start = time.time()
    socket.gethostbyname(host)
    dns_time = time.time() - dns_start
    
    # TCP 连接时间
    tcp_start = time.time()
    with socket.create_connection((host, parsed.port or 443), timeout=10):
        pass
    tcp_time = time.time() - tcp_start
    
    # HTTP 请求时间
    http_start = time.time()
    response = requests.get(
        f"{base_url}/models",  # 假设有这个轻量级端点
        headers={'Authorization': f'Bearer {api_key}'},
        timeout=30
    )
    http_time = time.time() - http_start
    
    return {
        'dns_resolution_ms': dns_time * 1000,
        'tcp_connection_ms': tcp_time * 1000,
        'http_request_ms': http_time * 1000,
        'total_latency_ms': (dns_time + tcp_time + http_time) * 1000
    }

2. 内存使用监控

import psutil
import tracemalloc
from contextlib import contextmanager

@contextmanager
def memory_profiler():
    """
    内存使用分析上下文管理器
    """
    tracemalloc.start()
    process = psutil.Process()
    initial_memory = process.memory_info().rss / 1024 / 1024  # MB
    
    try:
        yield
    finally:
        current, peak = tracemalloc.get_traced_memory()
        final_memory = process.memory_info().rss / 1024 / 1024
        tracemalloc.stop()
        
        logger.info(f"Memory usage - Initial: {initial_memory:.2f}MB, "
                   f"Final: {final_memory:.2f}MB, "
                   f"Peak traced: {peak / 1024 / 1024:.2f}MB")

# 使用示例
def _invoke_with_profiling(self, *args, **kwargs):
    with memory_profiler():
        return super()._invoke(*args, **kwargs)

8.3 错误处理最佳实践

1. 分层错误处理

class SmartAIException(Exception):
    """SmartAI 异常基类"""
    pass

class SmartAIAuthenticationError(SmartAIException):
    """认证错误"""
    pass

class SmartAIRateLimitError(SmartAIException):
    """速率限制错误"""
    def __init__(self, retry_after: int = None):
        self.retry_after = retry_after
        super().__init__(f"Rate limit exceeded. Retry after {retry_after} seconds")

class SmartAIModelError(SmartAIException):
    """模型相关错误"""
    pass

def handle_api_error(response: requests.Response):
    """
    统一的 API 错误处理
    """
    if response.status_code == 401:
        raise SmartAIAuthenticationError("Invalid API key")
    elif response.status_code == 429:
        retry_after = response.headers.get('Retry-After')
        raise SmartAIRateLimitError(int(retry_after) if retry_after else None)
    elif response.status_code >= 400:
        error_data = response.json() if response.content else {}
        error_message = error_data.get('error', {}).get('message', 'Unknown error')
        raise SmartAIException(f"API error {response.status_code}: {error_message}")

2. 优雅降级策略

def _invoke_with_fallback(self, 
                         model: str,
                         credentials: dict,
                         prompt_messages: List[PromptMessage],
                         **kwargs):
    """
    带降级策略的模型调用
    """
    fallback_models = {
        'smart-chat-v2': 'smart-chat-v1',
        'smart-reasoning-v2': 'smart-chat-v1'
    }
    
    try:
        return self._invoke(model, credentials, prompt_messages, **kwargs)
    except SmartAIModelError as e:
        if model in fallback_models:
            logger.warning(f"Model {model} failed, falling back to {fallback_models[model]}")
            return self._invoke(
                fallback_models[model], 
                credentials, 
                prompt_messages, 
                **kwargs
            )
        else:
            raise e