此页面由 Cloud Translation API 翻译。

设置 Speech-to-Text 模型自适应

客服助手使用 Speech-to-Text 模型自适应技术，通过更频繁地识别某些短语来提高转写质量。本页面提供了有关为 Speech-to-Text 转写设置模型自适应的指南。

使用 Speech-to-Text 控制台

您只能使用 Speech-to-Text 控制台创建全局短语集。必须使用 Speech-to-Text API 创建区域短语集。

在 Google Cloud 控制台中，前往 Speech-to-Text 页面。前往“Speech-to-Text”
点击模型自适应。
点击 add_box 新建资源。
选择词组集资源和 API 版本 V1，然后填写词组和权重值，最后复制词组集名称。
点击保存。
前往 Agent Assist 控制台。
点击对话资料，然后选择要修改的对话资料。
前往词组集部分，然后粘贴词组集名称。

使用 Speech-to-Text API

按照语音识别说明创建短语集脚本。

运行以下 Python 脚本以更新对话个人资料：

# Conversation Profile to update
PROJECT_ID = "sample-project"
LOCATION = "global"
CONVERSATION_PROFILE_ID = "sample-conversation-profile"
# Speech model adaptation resource names
SPEECH_ADAPTATION_PHRASES = ["projects/sample-project/locations/global/phraseSets/sample-phrase-sets"]

import google.auth
from google.auth.transport.requests import AuthorizedSession

scopes=['https://www.googleapis.com/auth/cloud-platform']
credentials, project = google.auth.default(
  scopes=scopes,
  quota_project_id=PROJECT_ID,
)
session = AuthorizedSession(credentials)

profile_url = f"https://dialogflow.googleapis.com/v2beta1/projects/{PROJECT_ID}/locations/{LOCATION}/conversationProfiles/{CONVERSATION_PROFILE_ID}"
get_response = session.get(profile_url)
print("Checking for existing ConversationProfile...")
print(get_response.status_code)
print(get_response.json())
if get_response.status_code == 200:
  patch_response = session.patch(
     profile_url,
     params={
        "updateMask": "sttConfig.phraseSets"
     },
     json={
        "sttConfig": {
           "phraseSets": SPEECH_ADAPTATION_PHRASES
        }
     }
  )
  print("Updating ConversationProfile...")
  print(patch_response.status_code)
  print(patch_response.json())

区域短语集

虽然语音转文字模型自适应功能仅支持英语 (en-US)，但您可以使用 Speech-to-Text API 为其他语言区域配置短语集。这在转写这些区域进行的英语对话时特别有用。

使用以下示例命令通过 Speech-to-Text API 创建地区性短语集。

curl -X POST \
    -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    -H "Content-Type: application/json; charset=utf-8" \
    -H "X-Goog-User-Project: sample_project" \
    -d @sample_phrase_sets.json \
"https://us-speech.googleapis.com/v1/projects/sample-project/locations/us/phraseSets"

JSON 文件 @sample_phrase_sets.json 包含以下短语集内容：

{
  "parent": "projects/sample-project/locations/us",
  "phraseSetId": "sample-phrase-sets",
  "phraseSet": {
    "name": "sample-phrase-sets",
    "phrases": [
      {
        "value": "Some phrase",
        "boost": 20
      }
    ]
  }
}

对于单个 Dialogflow 区域中的对话资料，下表显示了用于创建短语集的相应 Speech-to-Text 区域。

Dialogflow 区域	Speech-to-Text 区域
us us-central1 us-east1 us-east7 us-west1 northamerica-northeast1 northamerica-northeast2	我们
eu europe-west1 europe-west2 europe-west3 europe-west4	欧盟
australia-southeast1 asia-northeast1 asia-south1 asia-southeast1 me-west1 全球	全局