此页面由 Cloud Translation API 翻译。

使用 LangChain 代理

除了使用代理的一般说明之外，本页还介绍了特定于 LangchainAgent 的功能。

准备工作

本教程假定您已阅读并遵循以下说明：

开发 LangChain 代理：将 agent 作为 LangchainAgent 的实例进行开发。
用户身份验证，以用户身份进行身份验证，以便查询代理。

支持的操作

LangchainAgent 支持以下操作：

query：用于同步获取对查询的响应。
stream_query：用于对查询的响应进行流式传输。

query 和 stream_query 方法都支持相同类型的参数：

input：要发送给智能体的邮件。
config：查询上下文的配置（如果适用）。

查询代理

命令：

agent.query(input="What is the exchange rate from US dollars to Swedish currency?")

等效于以下（完整形式）：

agent.query(input={
    "input": [ # The input is represented as a list of messages (each message as a dict)
        {
            # The role (e.g. "system", "user", "assistant", "tool")
            "role": "user",
            # The type (e.g. "text", "tool_use", "image_url", "media")
            "type": "text",
            # The rest of the message (this varies based on the type)
            "text": "What is the exchange rate from US dollars to Swedish currency?",
        },
    ]
})

角色用于帮助模型在响应时区分不同类型的消息。如果输入中省略了 role，则默认为 "user"。

角色	说明
`system`	用于告知聊天模型如何表现并提供其他上下文。并非所有聊天模型提供商都支持此功能。
`user`	表示与模型互动的用户的输入，通常采用文本或其他互动输入的形式。
`assistant`	表示模型的回答，其中可能包含文本或调用工具的请求。
`tool`	用于在检索外部数据或处理后将工具调用的结果传回模型的消息。

消息的 type 还将决定如何解读消息的其余部分（请参阅处理多模态内容）。

使用多模态内容查询客服人员

我们将使用以下代理（该代理会将输入转发给模型，并且不使用任何工具）来演示如何向代理传入多模态输入：

agent = agent_engines.LangchainAgent(
    model="gemini-1.5-pro-preview-0409",
    runnable_builder=lambda model, **kwargs: model,
)

多模态消息通过指定 type 和相应数据的内容块表示。通常，对于多模式内容，您应将 type 指定为 "media"，将 file_uri 指定为指向 Cloud Storage URI，并将 mime_type 用于解读文件。

图片

agent.query(input={"input": [
    {"type": "text", "text": "Describe the attached media in 5 words!"},
    {"type": "media", "mime_type": "image/jpeg", "file_uri": "gs://cloud-samples-data/generative-ai/image/cricket.jpeg"},
]})

视频

agent.query(input={"input": [
    {"type": "text", "text": "Describe the attached media in 5 words!"},
    {"type": "media", "mime_type": "video/mp4", "file_uri": "gs://cloud-samples-data/generative-ai/video/pixel8.mp4"},
]})

音频

agent.query(input={"input": [
    {"type": "text", "text": "Describe the attached media in 5 words!"},
    {"type": "media", "mime_type": "audio/mp3", "file_uri": "gs://cloud-samples-data/generative-ai/audio/pixel.mp3"},
]})

如需查看 Gemini 支持的 MIME 类型列表，请参阅以下文档：

使用可运行的配置查询代理

在查询代理时，您还可以为代理指定 config（遵循 RunnableConfig 的架构）。下面列出了两种常见场景：

默认配置参数：
- run_id / run_name：运行作业的标识符。
- tags / metadata：使用 OpenTelemetry 进行跟踪时运行的类别分类器。
自定义配置参数（通过 configurable）：
- session_id：运行时所处的会话（请参阅存储聊天记录）。
- thread_id：运行所在的线程（请参阅存储检查点）。

例如：

import uuid

run_id = uuid.uuid4()  # Generate an ID for tracking the run later.

response = agent.query(
    input="What is the exchange rate from US dollars to Swedish currency?",
    config={  # Specify the RunnableConfig here.
        "run_id": run_id                               # Optional.
        "tags": ["config-tag"],                        # Optional.
        "metadata": {"config-key": "config-value"},    # Optional.
        "configurable": {"session_id": "SESSION_ID"}   # Optional.
    },
)

print(response)

后续步骤

管理已部署的代理。