Speech-to-Text 价格

Speech-to-Text 的价格取决于服务每月成功处理的音频量（以 1 秒为增量计算）。如果 API 返回响应，则表明请求中发送的音频已成功处理。其中包括空响应，这表示 API 处理了音频但无法进行转写。导致服务器错误的请求不会被计为成功处理，因此不会产生任何费用。

您可以在 Google Cloud 控制台中查看当前结算状态，包括用量和当前账单。如需详细了解如何管理您的账号，请参阅 Cloud Billing 文档或 Cloud Billing 支持。

Speech-to-Text V2 API

下表中的价格适用于 Speech-to-Text v2 API 每月处理的音频时长（分钟）。

标准识别模型

类别	模型	价格 (USD)
识别 (sku:3099-B70F-0949)	标准	0 minute to 500,000 minute US$0.016 / 1 minute, per 1 month / account 500,000 minute to 1,000,000 minute US$0.01 / 1 minute, per 1 month / account 1,000,000 minute to 2,000,000 minute US$0.008 / 1 minute, per 1 month / account 2,000,000 minute and above US$0.004 / 1 minute, per 1 month / account

类别

模型

价格 (USD)

识别

(sku:3099-B70F-0949)

标准

0 minute to 500,000 minute

US$0.016 / 1 minute, per 1 month / account

500,000 minute to 1,000,000 minute

US$0.01 / 1 minute, per 1 month / account

1,000,000 minute to 2,000,000 minute

US$0.008 / 1 minute, per 1 month / account

2,000,000 minute and above

US$0.004 / 1 minute, per 1 month / account

标准动态批量识别

类别	模型	每分钟
动态批量识别 (sku:7700-6778-EF8E)	Standard¹	US$0.003 / 1 minute, per 1 month / account

类别

模型

每分钟

动态批量识别

(sku:7700-6778-EF8E)

Standard¹

US$0.003 / 1 minute, per 1 month / account

Standard¹ 模型包括：default、command_and_search、latest_short、latest_long、phone_call、video、chirp（仅限 Speech-to-Text V2）
Medical² 模型包括：medical_conversation、medical_dictation
每个请求的时长均以 1 秒为增量向上取整到最近的数字

Speech-to-Text V1 API

下表中的价格适用于 Speech-to-Text v1 API 每月处理的音频时长（分钟）。

类别	模型	价格 (USD)
语音识别（具有数据日志记录） sku:67F5-A183-E319	Standard¹	0 minute to 60 minute US$0.00 (Free) / 1 minute, per 1 month / account 60 minute and above US$0.016 / 1 minute, per 1 month / account
语音识别（无数据日志记录） sku:60AE-2FE3-C3D8	Standard¹	0 minute to 60 minute US$0.00 (Free) / 1 minute, per 1 month / account 60 minute and above US$0.024 / 1 minute, per 1 month / account

类别

模型

价格 (USD)

语音识别（具有数据日志记录）

sku:67F5-A183-E319

Standard¹

0 minute to 60 minute

US$0.00 (Free) / 1 minute, per 1 month / account

60 minute and above

US$0.016 / 1 minute, per 1 month / account

语音识别（无数据日志记录）

sku:60AE-2FE3-C3D8

Standard¹

0 minute to 60 minute

US$0.00 (Free) / 1 minute, per 1 month / account

60 minute and above

US$0.024 / 1 minute, per 1 month / account

Standard¹ 模型包括：default、command_and_search、latest_short、latest_long、phone_call、video、chirp（仅限 Speech-to-Text V2）
每个请求的时长均以 1 秒为增量向上取整到最近的数字

医疗模型

类别	模型	价格 (USD)
医疗口录 (sku:6649-62EF-CB8F)	Medical²	0 minute to 60 minute US$0.00 (Free) / 1 minute, per 1 month / account 60 minute and above US$0.078 / 1 minute, per 1 month / account
医疗对话 (sku:7247-19E1-FB4D)	Medical²	0 minute to 60 minute US$0.00 (Free) / 1 minute, per 1 month / account 60 minute and above US$0.078 / 1 minute, per 1 month / account

类别

模型

价格 (USD)

医疗口录

(sku:6649-62EF-CB8F)

Medical²

0 minute to 60 minute

US$0.00 (Free) / 1 minute, per 1 month / account

60 minute and above

US$0.078 / 1 minute, per 1 month / account

医疗对话

(sku:7247-19E1-FB4D)

Medical²

0 minute to 60 minute

US$0.00 (Free) / 1 minute, per 1 month / account

60 minute and above

US$0.078 / 1 minute, per 1 month / account

Medical² 模型包括：medical_conversation、medical_dictation

价格要素

Speech-to-Text 的价格取决于以下因素：

正在识别的音频中的通道数量
您发送的音频的时长和数量
您使用的识别模型
您使用的批处理方法
您使用的 API 版本

多通道

每个音频通道均单独结算。如果您发送包含多个通道的请求，则系统会根据所处理的所有通道中的音频总时长向您收取费用。该时间计费与每月用量限额的跟踪方式不同。用量限额不考虑多个通道，仅由音频文件的时长决定。例如，如果您发送了一个包含 30 秒音频和 4 个通道的请求，系统将向您收取 120 秒的费用，但只有 30 秒会计入您的每月配额。如需了解详情，请参阅配额和限制页面。

动态批处理

Speech-to-Text V2 API 有一个使用动态批处理的选项。动态批处理以较低的紧急程度处理音频。如果您启用动态批处理，系统会按折扣费率向您收费。

大型工作负载

对于工作负载非常大的客户，可能会提供额外的批量折扣。如需了解详情，请与销售人员联系。

Google Cloud 价格

如果您在 Google Cloud Storage 中存储要识别的音频文件，或者使用 Speech-to-Text 的同时还使用了其他 Google Cloud 资源（例如 Google App Engine 实例），则您还需要支付使用这些服务所产生的费用。请使用 Google Cloud 的价格计算器根据当前费率确定其他费用。

后续步骤

申请自定义报价

Google Cloud 采用随用随付的价格模式，您只需为实际使用的服务付费。请与我们的销售团队联系，获取为贵组织量身定制的报价。