Method: projects.agent.sessions.detectIntent

Processes a natural language query and returns structured, actionable data as a result. This method is not idempotent, because it may cause contexts and session entity types to be updated, which in turn might affect results of future queries.

HTTP request


The URL uses gRPC Transcoding syntax.

Path parameters



Required. The name of the session this query is sent to. Format: projects/<Project ID>/agent/sessions/<Session ID>, or projects/<Project ID>/agent/environments/<Environment ID>/users/<User ID>/sessions/<Session ID>. If Environment ID is not specified, we assume default 'draft' environment. If User ID is not specified, we are using "-". It's up to the API caller to choose an appropriate Session ID and User Id. They can be a random number or some type of user and session identifiers (preferably hashed). The length of the Session ID and User ID must not exceed 36 characters.

For more information, see the API interactions guide.

Authorization requires the following IAM permission on the specified resource session:

  • dialogflow.sessions.detectIntent

Request body

The request body contains data with the following structure:

JSON representation
  "queryParams": {
    object (QueryParameters)
  "queryInput": {
    object (QueryInput)
  "outputAudioConfig": {
    object (OutputAudioConfig)
  "outputAudioConfigMask": string,
  "inputAudio": string

object (QueryParameters)

The parameters of this query.


object (QueryInput)

Required. The input specification. It can be set to:

  1. an audio config which instructs the speech recognizer how to process the speech audio,

  2. a conversational query in the form of text, or

  3. an event that specifies which intent to trigger.


object (OutputAudioConfig)

Instructs the speech synthesizer how to generate the output audio. If this field is not set and agent-level speech synthesizer is not configured, no output audio is generated.


string (FieldMask format)

Mask for outputAudioConfig indicating which settings in this request-level config should override speech synthesizer settings defined at agent-level.

If unspecified or empty, outputAudioConfig replaces the agent-level config in its entirety.

A comma-separated list of fully qualified names of fields. Example: "user.displayName,photo".


string (bytes format)

The natural language speech audio to be processed. This field should be populated iff queryInput is set to an input audio config. A single request can contain up to 1 minute of speech audio data.

A base64-encoded string.

Response body

If successful, the response body contains an instance of DetectIntentResponse.

Authorization Scopes

Requires one of the following OAuth scopes:


For more information, see the Authentication Overview.