bq command-line tool reference
This document describes the syntax, commands, flags, and arguments for bq,
the Python-based command-line tool for BigQuery.
For a tutorial on using the bq command-line tool, see Load and query data with the bq tool.
Ways to use the bq command-line tool
You can enter bq command-line tool commands in Cloud Shell either from the Google Cloud console or from a local installation of the Google Cloud CLI.
- To use the bq command-line tool from the Google Cloud console, activate Cloud Shell: 
- To use the bq command-line tool locally, install and configure the gcloud CLI. To keep your installation current, see Manage an installation in the gcloud CLI documentation. 
Command format
The bq command-line tool uses the following format:
bq COMMAND [FLAGS] [ARGUMENTS]
Some flags can be used with multiple bq command-line tool commands; these flags are described in the Global flags section.
Other flags are command-specific; they can only be used with a particular bq command-line tool command. The command-specific flags are described in the command sections.
Specifying values for flags
When you specify a value for a flag, the equals sign (=) is optional. For
example, the following two commands are equivalent:
bq ls --format prettyjson myDataset bq ls --format=prettyjson myDataset
This document uses the equals sign for clarity.
Some bq command-line tool flags are boolean; you can set the flag's value to either
true or false.
The bq command-line tool accepts the following formats for setting boolean flags.
| Value | Format | Example | 
|---|---|---|
| true | --FLAGNAME=true | --debug_mode=true | 
| true | --FLAGNAME | --debug_mode | 
| false | --FLAGNAME=false | --debug_mode=false | 
| false | --noFLAGNAME | --nodebug_mode | 
This document uses the --FLAGNAME=VALUE
format for boolean flags.
All boolean flags are optional; if a boolean flag is not present, then BigQuery uses the flag's default value.
Specifying BigQuery resources in arguments
The format for specifying a resource depends on the context; in some cases the
separator between the project and dataset is a colon (:) and in some cases, it
is a period (.). The following table
describes how to specify a BigQuery table in different contexts.
| Context | Format | Example | 
|---|---|---|
| bq command-line tool | PROJECT:DATASET.TABLE | myProject:myDataset.myTable | 
| GoogleSQL query | PROJECT.DATASET.TABLE | myProject.myDataset.myTable | 
| Legacy SQL query | PROJECT:DATASET.TABLE | myProject:myDataset.myTable | 
If you don't specify a project, then BigQuery uses the current
project. For example, if the current project is myProject, then
BigQuery interprets myDataset.myTable as
myProject:myDataset.myTable (or myProject.myDataset.myTable).
Some resource identifiers must be quoted using back ticks (`).
If your resource identifier begins with a letter or underscore character, and
contains only characters that are letters, numbers, and underscores, then you
don't need to quote it.  However, if your resource identifier
contains other types of
characters, or reserved keywords, you need to surround the identifier (or the
part of the identifier with the special characters or reserved keywords) with
back ticks.  For more information, see
Identifiers.
How to run commands
Place any global flags before the bq command, and then
include command-specific flags. You can include multiple global or
command-specific flags. For example:
bq --location=us mk --reservation --project_id=project reservation_name
You can specify command arguments in the following ways:
- --FLAG ARGUMENT(as shown in the previous examples)
- --FLAG=ARGUMENT
- --FLAG='ARGUMENT'
- --FLAG="ARGUMENT"
- --FLAG 'ARGUMENT'
- --FLAG "ARGUMENT"
Replace the following:
- FLAG: a global or command-specific flag
- ARGUMENT: the flag's argument
Some commands require the use of quotes around arguments. If quotes are required, either single or double quotes are acceptable. Arguments that require quotes are typically values that contain spaces, commas, or other special characters. If your argument contains a BigQuery resource, be sure to follow the rules for specifying resource names in commands.
This example shows how to run a GoogleSQL query on the command line:
bq query --nouse_legacy_sql \ 'SELECT COUNT(*) FROM `bigquery-public-data`.samples.shakespeare'
Flags with boolean values can be specified without an argument. If you specify
true or false, then you must use the format
FLAG=ARGUMENT.
For example, this command specifies false for the boolean flag
--use_legacy_sql by placing no at the front of the flag:
bq query --nouse_legacy_sql \ 'SELECT COUNT(*) FROM `bigquery-public-data`.samples.shakespeare'
Alternatively, to specify false as the flag's argument, you can enter the
following:
bq query --use_legacy_sql=false \ 'SELECT COUNT(*) FROM `bigquery-public-data`.samples.shakespeare'
Run commands in a script
You can run the bq command-line tool in a script, as you would run a
Google Cloud CLI command. The following is an
example of gcloud and bq commands in a bash script:
#!/bin/bash
gcloud config set project myProject
bq query --use_legacy_sql=false --destination_table=myDataset.myTable \
'SELECT
   word,
   SUM(word_count) AS count
 FROM
   `bigquery-public-data`.samples.shakespeare
 WHERE
   word LIKE "%raisin%"
 GROUP BY
   word'
Use a service account
You can use a service account to make authorized API calls or run query jobs on your behalf. To use a service account in the bq command-line tool, authorize access to Google Cloud from the service account. For more information, see gcloud auth activate-service-account.
To start running bq commands using
service account impersonation,
run the following command:
gcloud config set auth/impersonate_service_account SERVICE_ACCOUNT_NAME
Replace SERVICE_ACCOUNT_NAME with your service account
name.
bq commands that you run now use the service account credentials.
To stop running bq commands from a service account, run the following command:
gcloud config unset auth/impersonate_service_account
Set default values for command-line flags
You can set default values for command-line flags by including them in the
bq command-line tool's configuration file, .bigqueryrc. Before you configure your
default options, you must first create a .bigqueryrc file. You can use your
preferred text editor to create the file. After you create the .bigqueryrc
file, you can specify the path to the file using the --bigqueryrc global flag.
If the --bigqueryrc flag is not specified, then the BIGQUERYRC environment
variable is used. If that is not specified, then the path ~/.bigqueryrc is
used. The default path is $HOME/.bigqueryrc.
How to add flags to .bigqueryrc
To add default values for command-line flags to .bigqueryrc:
- Place global flags at the top of the file without a header.
- For command-specific flags, enter the command name (in brackets) and add the command-specific flags (one per line) after the command name.
For example:
--apilog=stdout --format=prettyjson --location=US [query] --use_legacy_sql=false --max_rows=100 --maximum_bytes_billed=10000000 [load] --destination_kms_key=projects/myproject/locations/mylocation/keyRings/myRing/cryptoKeys/myKey
The preceding example sets default values for the following flags:
- The global flag --apilogis set tostdoutto print debugging output to the Google Cloud console.
- The global flag --formatis set toprettyjsonto display command output in a human-readable JSON format.
- The global flag --locationis set to theUSmulti-region location.
- The - querycommand-specific flag- --use_legacy_sqlis set to- falseto make GoogleSQL the default query syntax.
- The - querycommand-specific flag- --max_rowsis set to- 100to control the number of rows in the query output.
- The - querycommand-specific flag- --maximum_bytes_billedis set to 10,000,000 bytes (10 MB) to fail queries that read more than 10 MB of data.
- The - loadcommand-specific flag- --destination_kms_keyis set to- projects/myproject/locations/mylocation/keyRings/myRing/cryptoKeys/myKey.
CLI help
You can get help with the bq command-line tool by running the following commands:
| Description | Help command format | Example | 
|---|---|---|
| Installed version | bq version | bq version | 
| List of all commands with examples | bq help | bq help | 
| Description of global flags | bq --help | bq --help | 
| Description of a particular command | bq help COMMAND | bq help mk | 
Troubleshooting CLI commands
- To log requests sent and received:
- Add the - --apilog=PATH_TO_FILEflag to save a log of operations to a local file. Replace- PATH_TO_FILEwith the location where you want to save the log. The bq command-line tool works by making standard REST-based API calls, which can be useful to see when troubleshooting. It's also useful to attach this log when you're reporting issues to Cloud Customer Care.
- Using - -or- stdoutinstead of a path prints the log to the Google Cloud console. Setting- --apilogto- stderroutputs to the standard error file. To log more requests, use the- --httplib2_debuglevel=LOG_LEVELflag. A higher- LOG_LEVELlogs more information about the HTTP requests.
- To troubleshoot errors:
- Enter the - --format=prettyjsonflag when you get a job's status or when you view detailed information about resources such as tables and datasets. Using this flag outputs the response in JSON format, including the- reasonproperty. You can use the- reasonproperty to find error messages.
- For more information about errors when you run a command, use the - --debug_modeflag.
Global flags
You can use the following flags with any bq command, where applicable:
- --api=ENDPOINT
- Specifies the API endpoint to call. The default value is
https://www.googleapis.com.
- --api_version=VERSION
- Specifies the API version to use. The default is v2.
- --apilog=FILE
- Logs all API requests and responses to the file specified by - FILE. Possible values are the following:- the path to a file - logs to the specified file
- stdout- logs to standard output
- stderr- logs to standard error
- false- API requests and responses are not logged (default)
 
- --use_google_auth={true|false}
- If set to - true, enables authentication using Google Auth libraries. The default value is- true.
- --bigqueryrc=PATH
- Specifies the path to the bq command-line tool configuration file. If you don't specify the - --bigqueryrcflag, then the command uses the- BIGQUERYRCenvironment variable. If the environment variable is not set, then- $HOME/.bigqueryrcis used. If that file does not exist, then- ~/.bigqueryrcis used. For more information, see Setting default values for command-line flags.
- --ca_certificates_file=PATH
- Specifies the location of your Certificate Authority Service (CA) file. 
- --dataset_id=DATASET_ID
- Specifies the default dataset to use with the command. This flag is ignored when not applicable. You can specify the - DATASET_IDargument using the format- PROJECT:DATASETor- DATASET. If the- PROJECTpart is missing, then the default project is used. You can override the default project setting by specifying the- --project_idflag.
- --debug_mode={true|false}
- If set to - true, shows tracebacks on Python exceptions. The default value is- false.
- --disable_ssl_validation={true|false}
- If set to - true, enables HTTPS certificate validation. The default value is- false.
- --discovery_file=PATH
- Specifies the JSON file to read for discovery. 
- --enable_gdrive={true|false}
- If set to - false, requests a new OAuth token without Google Drive scope. The default value is- true; requests a new OAuth token with Drive scope. To set this flag to- falsewhen authenticated using a user account, the- --use_google_authflag must be set to- false.
- --fingerprint_job_id={true|false}
- To use a job ID that is derived from a fingerprint of the job configuration, set to - true. This prevents the same job from running multiple times accidentally. The default value is- false.
- --format=FORMAT
- Specifies the format of the command's output. Use one of the following values: - pretty: formatted table output
- sparse: simpler table output
- prettyjson: easy-to-read JSON format
- json: maximally compact JSON
- csv: csv format with header
 - pretty,- sparse, and- prettyjsonare intended to be human-readable.- jsonand- csvare intended to be used by another program. If- noneis specified, then the command produces no output. If the- --formatflag is absent, then an appropriate output format is chosen based on the command.
- --headless={true|false}
- To run the - bqsession without user interaction, set to- true. For example,- debug_modedoes not break into the debugger, and the frequency of informational printing is lowered. The default value is- false.
- --httplib2_debuglevel=DEBUG_LEVEL
- Specifies whether to show HTTP debugging information. If - DEBUG_LEVELis greater than- 0, then the command logs HTTP server requests and responses to stderr, in addition to error messages. If- DEBUG_LEVELis not > 0, or if the- --httplib2_debuglevelflag is not used, then only error messages are provided.- For example: - --httplib2_debuglevel=1 
- --job_id=JOB_ID
- Specifies a job identifier for a new job. This flag applies only to commands that create jobs: - cp,- extract,- load, and- query. If you don't use the- --job_idflag, then the commands generate a unique job identifier. For more information, see Running jobs programmatically.
- --job_property=KEY:VALUE
- A key-value pair to include in the properties field of the job configuration. Repeat this flag to specify additional properties. 
- --location=LOCATION
- A string corresponding to a region or multi-region location. The location flag is required for the - bq cancelcommand and for the- bq showcommand when you use the- --jobsflag to show information about jobs. The location flag is optional for the following commands:- query
- cp
- load
- extract
- partition
- update
- wait
- mkwhen you use the- --dataset,- --reservation,- --capacity_commitment, or- --reservation_assignmentflags
- lswhen you use the- --reservation,- --capacity_commitment, or- --reservation_assignmentflags
 - All other commands ignore the - --locationflag.
- --max_rows_per_request=MAX_ROWS
- An integer that specifies the maximum number of rows to return per read. 
- --project_id=PROJECT
- Specifies the project to use for commands. 
- --proxy_address=PROXY
- Specifies the name or IP address of the proxy host to use for connecting to Google Cloud. 
- --proxy_password=PASSWORD
- Specifies the password to use when authenticating with the proxy host. 
- --proxy_port=PORT
- Specifies the port number to use to connect to the proxy host. 
- --proxy_username=USERNAME
- Specifies the username to use when authenticating with the proxy host. 
- --quiet={true|false}or- -q={true|false}
- To suppress status updates while jobs are running, set to - true. The default value is- false.
- --synchronous_mode={true|false}or- -sync={true|false}
- To create the job and immediately return, with a successful completion status as the error code, set to - false. If set to- true, then the command waits for the job to complete before returning, and returns the job completion status as the error code. The default value is- true.
- --trace=token:TOKEN
- Specifies a tracing token to include in API requests. 
- --use_regional_endpoints={true|false}
- In preview. To connect to a regional endpoint, set the - --use_regional_endpointsflag to- trueand the- --locationflag to the region you want to connect to. The default value is- false.
Deprecated global flags
The following global flag for specifying bq command-line tool flags from a file
        is
         deprecated. To specify flags from a file, use the
         --bigqueryrc flag.
      
--flagfile=PATH 
When specified, flag definitions from the supplied file are inserted
        into
         the bq command-line tool. The default value is ''. For more information, see
         
           Setting default values for command-line flags.
      
Commands
The following sections describe the bq command-line tool commands, along with their command-specific flags and arguments.
bq add-iam-policy-binding
Use the bq add-iam-policy-binding command to retrieve the
Identity and Access Management (IAM) policy
for a table or view and add a binding to the
policy, in one step.
This command is an alternative to the following three-step process:
- Using the bq get-iam-policycommand to retrieve the policy file (in JSON format).
- Editing the policy file.
- Using the bq set-iam-policycommand to update the policy with a new binding.
Synopsis
bq add-iam-policy-binding [FLAGS] --member=MEMBER_TYPE:MEMBER --role=ROLE [--table] RESOURCE
Example
bq add-iam-policy-binding --member=user:myAccount@gmail.com \ --role=roles/bigquery.dataViewer myDataset.myTable
Flags and arguments
The bq add-iam-policy-binding command uses the following flags and
arguments:
- --member=MEMBER_TYPE:MEMBER
- Required. Use the - --memberflag to specify the member part of the IAM policy binding. The- --memberflag is required along with the- --roleflag. One combination of- --memberand- --roleflags equals one binding.- The - MEMBER_TYPEvalue specifies the type of member in the IAM policy binding. Use one of the following values:- user
- serviceAccount
- group
- domain
 - The - MEMBERvalue specifies the email address or domain of the member in the IAM policy binding.
- --role=ROLE
- Required. Specifies the role part of the IAM policy binding. The - --roleflag is required along with the- --memberflag. One combination of- --memberand- --roleflags equals one binding.
- --table={true|false}
- To return an error if the - RESOURCEargument is not a table or view identifier, set the- --tableflag to- true. The default value is- false. This flag is supported for consistency with other commands.
- RESOURCE
- The table or view whose policy you want to add to. 
For more information, see the IAM policy reference.
bq cancel
Use the bq cancel command to cancel BigQuery jobs.
Synopsis
bq [--synchronous_mode=false] cancel JOB_ID
Examples
bq cancel bqjob_12345
bq --synchronous_mode=false cancel bqjob_12345
Flags and arguments
The bq cancel command uses the following flags and arguments:
- --synchronous_mode=false
- If you don't want to wait for the bq cancelcommand to complete ,set the global--synchronous_modeflag tofalse. The default istrue.
- JOB_ID
- The job you want to cancel.
For more information about using the bq cancel command, see
Managing jobs.
bq cp
Use the bq cp command for the following tasks:
- Create a copy of a table, table clone, or table snapshot.
- Create a table clone.
- Create a table snapshot.
Synopsis
bq cp [FLAGS] SOURCE_TABLE DESTINATION_TABLE
Example
bq cp myDataset.myTable myDataset.myTableCopy
Flags and arguments
The bq cp command uses the following flags and arguments:
- --append_table={true|false}or- -a={true|false}
- To append a table to an existing table, set to - true. The default value is- false.- You can't use the flag settings - --append_table=trueand- --clone=trueat the same time.
- --clone={true|false}
- To create a table clone, set to - true. The base table can be a standard table, a table clone, or a table snapshot. The destination table is a table clone. The default is- false; if neither- --clone=truenor- --snapshot=trueis specified, then the destination table is the same type of table as the base table. Requires the- --no_clobberflag.- You can't use the flag settings - --append_table=trueand- --clone=trueat the same time.
- --destination_kms_key=KEY
- Specifies a Cloud KMS key resource ID for encrypting the destination table data. - For example: - --destination_kms_key=projects/myProject/locations/global/keyRings/myKeyRing/cryptoKeys/myKey 
- --expiration=SECONDS
- The number of seconds until a table snapshot expires. If not included, the table snapshot expiration is set to the default expiration of the dataset containing the new table snapshot. Use with the - --snapshotflag.
- --force={true|false}or- -f={true|false}
- To overwrite the destination table, if it exists, without prompting, set to - true. The default value is- false; if the destination table exists, then the command prompts for confirmation before overwriting.
- --no_clobber={true|false}or- -n={true|false}
- To disallow overwriting the destination table, if it exists, set to - true. The default value is- false; if the destination table exists, then it is overwritten.
- --restore={true|false}
- This flag is being deprecated. To create a writeable table from a table snapshot, use the - bq cpcommand or the- bq cp --clonecommand.
- --snapshot={true|false}
- To create a table snapshot of the table that's specified in the - SOURCE_TABLEargument, set to- true. The base table can be a standard table, a table clone, or another table snapshot. The default is- false; if neither- --clone=truenor- --snapshot=trueis specified, then the destination table is the same type of table as the base table. Requires the- --no_clobberflag.
- SOURCE_TABLE
- The table that you want to copy. 
- DESTINATION_TABLE
- The table that you want to copy to. 
For more information about using the cp command, see the following:
bq extract
Use the bq extract command to export table data to Cloud Storage.
Synopsis
bq extract [FLAGS] RESOURCE DESTINATION
Examples
bq extract --compression=GZIP --destination_format=CSV --field_delimiter=tab \ --print_header=false myDataset.myTable gs://my-bucket/myFile.csv.gzip
bq extract --destination_format=CSV --field_delimiter='|' myDataset.myTable \ gs://myBucket/myFile.csv
Flags and arguments
The bq extract command uses the following flags and arguments:
- --compression=COMPRESSION_TYPE
- Specifies the type of compression to use for exported files. Possible values are the following: - GZIP
- DEFLATE
- SNAPPY
- NONE
 - The default value is - NONE.- For information about which formats are supported for each compression type, see Export formats and compression types. 
- --destination_format=FORMAT
- Specifies the format for the exported data. Possible values are the following: - CSV
- NEWLINE_DELIMITED_JSON
- AVRO
- PARQUET
 - The default value is - CSV.
- --field_delimiter=DELIMITER
- For CSV exports, specifies the character that marks the boundary between columns in the output file. The delimiter can be any ISO-8859-1 single-byte character. You can use - \tor- tabto specify tab delimiters.
- --print_header={true|false}
- To suppress printing header rows for formats that have headers, set to - false. The default is- true; header rows are included.
- RESOURCE
- The table that you are exporting from. 
- DESTINATION
- The storage location that receives the exported data. 
For more information about using the bq extract command, see
Exporting table data.
bq get-iam-policy
Use the bq get-iam-policy command to retrieve the
IAM policy
for a resource and
print it to stdout. The resource can be a table, a view, or a
slot reservation.
The policy is in JSON format.
Synopsis
bq get-iam-policy [FLAGS] RESOURCE
Examples
bq get-iam-policy myDataset.myTable
bq get-iam-policy --reservation myReservation
Flags and arguments
The bq get-iam-policy command uses the following flags and arguments:
- --table={true|false}or- --t={true|false}
- To return an error if RESOURCEis not a table or view identifier, set the--tableflag totrue. The default value isfalse. This flag is supported for consistency with other commands.
- --reservation={true|false}
- To get the IAM policy of a reservation, set to true(Preview). The default value isfalse. When this flag is used,RESOURCEis treated as a reservation identifier. The reservation can have optional project and location prefixes:myProject:myLocation.myReservation.
- RESOURCE
- The table or view whose policy you want to get.
For more information about the bq get-iam-policy command, see
Control access to resources with IAM.
bq head
Use the bq head command to display the specified rows and columns of a table.
By default, it displays all columns of the first 100 rows.
Synopsis
bq head [FLAGS] [TABLE]
Example
bq head --max_rows=10 --start_row=50 --selected_fields=field1,field3 \ myDataset.myTable
Flags and arguments
The bq head command uses the following flags and arguments:
- --job=JOB or -j=JOB
- To read the results of a query job, specify this flag with a valid job ID.
- --max_rows=MAX or -n=MAX
- An integer that indicates the maximum number of rows to print when
showing table data. The default value is 100.
- --selected_fields=COLUMN_NAMES or -c=COLUMN_NAMES
- A comma-separated list that specifies a subset of fields (including nested and repeated fields) to return when showing table data. If this flag is not specified, then all columns are returned.
- --start_row=START_ROW or -s=START_ROW
- An integer that specifies the number of rows to skip
before showing table data. The default value is 0; the table data starts at the first row.
- --table={true|false}or- -t={true|false}
- To return an error if the command argument is not a
table or view, set to true. The default value isfalse. This flag is supported for consistency with other commands.
- TABLE
- The table whose data you want to retrieve.
For more information about using the bq head command, see
Managing table data.
bq help
Use the bq help command to display bq command-line tool documentation within the tool.
Synopsis
bq help [COMMAND]
Flags and arguments
The bq help command uses the following flags and arguments:
- COMMAND
- Specifies a particular bq command-line tool command that you want to get online help for.
bq insert
Use the bq insert command to insert rows of newline-delimited,
JSON-formatted data into a table from a file using the streaming insert. Data
types are converted to match
the column types of the destination table. This command is intended for testing
purposes only. To stream data into BigQuery, use the
insertAll API method.
Synopsis
bq insert [FLAGS] TABLE FILE
Examples
bq insert --ignore_unknown_values --template_suffix=_insert myDataset.myTable /tmp/myData.json
echo '{"a":1, "b":2}' | bq insert myDataset.myTable
Flags and arguments
The bq insert command uses the following flags and arguments:
- --ignore_unknown_values={true|false}or- -i={true|false}
- When set to true, BigQuery ignores any key-value pairs that don't match the table's schema, and inserts the row with the data that does match the schema. When set tofalse, rows with data that doesn't match the table's schema are not inserted. The default isfalse.
- --skip_invalid_rows={true|false}or- -s={true|false}
- When set to true, BigQuery attempts to insert any valid rows, even if invalid rows are present. When set tofalse, the command fails if any invalid rows are present. The default isfalse.
- --template_suffix=SUFFIX or -x=SUFFIX
- When specified, treat the destination table TABLE as a
base template, and insert the rows into an instance table named
{destination}{templateSuffix}. BigQuery creates the instance table using the schema of the base template.
- TABLE
- The table that you want to insert data into.
- FILE
- The file containing the data that you want to insert.
For more information about using the bq insert command, see
Streaming data into BigQuery.
bq load
Use the bq load command to load data into a table.
Synopsis
bq load [FLAGS] DESTINATION_TABLE SOURCE_DATA [SCHEMA]
Example
bq load myDataset.newTable gs://mybucket/info.csv ./info_schema.json
Flags and arguments
The bq load command uses the following flags and arguments:
- --allow_jagged_rows={true|false}
- To allow missing trailing optional columns in CSV data, set to true.
- --preserve_ascii_control_characters={true|false}
- To allow embedded ASCII control characters in CSV data, set to true.
- --allow_quoted_newlines={true|false}
- To allow quoted newlines in CSV data, set to true.
- --autodetect={true|false}
- To enable schema auto-detection for CSV and JSON data, set to true. The default isfalse. If--autodetectisfalse, and no schema is specified by using the--schemaflag, and the destination table exists, then the schema of the destination table is used.
- --clustering_fields=COLUMNS
- A comma-separated list of up to four column names that specifies the fields to use for table clustering.
- --column_name_character_map=SCOPE
- Defines the scope and handling of
characters in column names, with the option of enabling
flexible column names.
Requires the --autodetectoption for CSV files. For a list of possible values, seeload_option_list.
- --destination_kms_key=KEY
- Specifies a Cloud KMS key resource ID for encrypting the destination table data.
- --encoding=ENCODING_TYPE or -E=ENCODING_TYPE
- The character encoding used in the data. Use one of the following values:
- ISO-8859-1(also known as Latin-1)
- UTF-8
 
- --field_delimiter=DELIMITER or -F=DELIMITER
- Specifies the character that marks the boundary between columns in the data.
The delimiter can be any ISO-8859-1 single-byte character. You can use either
\tortabto specify tab delimiters.
- --ignore_unknown_values={true|false}
- When set to truefor CSV and JSON files, rows with extra column values that don't match the table schema are loaded, but the extra columns are ignored. When set totruefor Avro, Parquet and ORC files, fields in the file schema that don't exist in the table schema are ignored and aren't loaded.
- --json_extension=JSON_TYPE
- Specifies the type of JSON file to load. Applies only to JSON files. Possible values are the following: - GEOJSON- newline-delimited GeoJSON file
 - To use this flag, the - --source_formatflag must be set to- NEWLINE_DELIMITED_JSON.- For more information, see Loading newline-delimited GeoJSON files. 
- --max_bad_records=MAX
- An integer that specifies the maximum number of bad records allowed before the entire job fails. The default value is - 0. At most, five errors of any type are returned regardless of the- --max_bad_recordsvalue. This flag applies for loading CSV, JSON, and Google Sheets data only.
- --null_marker=STRING
- An optional custom string that represents a - NULLvalue in CSV data.
- --projection_fields=PROPERTY_NAMES
- If you set - --source_formatto- DATASTORE_BACKUP, then this flag indicates which entity properties to load from a Datastore export. Specify the property names in a comma-separated list. Property names are case sensitive and must refer to top-level properties. You can also use this flag with Firestore exports.
- --quote=CHARACTER
- Specifies a quote character to surround fields in CSV data. The - CHARACTERargument can be any one-byte character. The default value is double quote (- "). To specify that there is no quote character, use an empty string- "".
- --replace={true|false}
- To erase any existing data and schema when new data is loaded, set to - true. Any Cloud KMS key is also removed, unless you specify the- --destination_kms_keyflag. The default value is- false.- Equivalent to the - WRITE_TRUNCATEvalue for- JobConfigurationLoad.writeDisposition.
- --schema={SCHEMA_FILE|SCHEMA}
- Specifies either the path to a local JSON schema file or a comma-separated list of column definitions in the form - FIELD:DATA_TYPE, FIELD:DATA_TYPE. If you use a schema file, don't add an extension to the filename.- For example: - --schema=/tmp/tabledef - --schema=Region:STRING,Quarter:STRING,Total_sales:INTEGER - If no schema is specified, and - --autodetectis- false, and the destination table exists, then the schema of the destination table is used.
- --schema_update_option=OPTION
- When appending data to a table (in a load job or a query job), or when overwriting a table partition, specifies how to update the schema of the destination table. Use one of the following values: - ALLOW_FIELD_ADDITION: Allow new fields to be added
- ALLOW_FIELD_RELAXATION: Allow relaxing- REQUIREDfields to- NULLABLE
 - Repeat this flag to specify multiple schema update options. 
- --skip_leading_rows=NUMBER_OF_ROWS
- An integer that specifies the number of rows to skip at the beginning of the source file. The default is - 0.
- --file_set_spec_type=FILE_SET_SPEC_TYPE
- Specifies how to interpret source URIs. - FILE_SYSTEM_MATCH: Expands source URIs by listing files from the object store. This is the default behavior if FileSetSpecType is not set.
- NEW_LINE_DELIMITED_MANIFEST: Indicates that the provided URIs are newline-delimited manifest files, with one URI per line. Wildcard URIs are not supported in the manifest files, and all referenced data files must be in the same bucket as the manifest.
 - For example, if you have a source URI of - "gs://bucket/path/file"and the- file_set_spec_typeis- FILE_SYSTEM_MATCH, then the file is used directly as a data file. If the- file_set_spec_typeis- NEW_LINE_DELIMITED_MANIFEST, then each line in the file is interpreted as a URI that points to a data file.
- --source_format=FORMAT
- The format of the source data. Use one of the following values: - CSV
- NEWLINE_DELIMITED_JSON
- AVRO
- DATASTORE_BACKUP(use this value for Filestore)
- PARQUET
- ORC
 
- --time_partitioning_expiration=SECONDS
- An integer that specifies (in seconds) when a time-based partition should be deleted. The expiration time evaluates to the partition's UTC date plus the integer value. A negative number indicates no expiration. 
- --time_partitioning_field=COLUMN_NAME
- Specifies the field that determines how to create a time-based partition. If time-based partitioning is enabled without this value, then the table is partitioned based on the load time. 
- --time_partitioning_type=INTERVAL
- Enables time-based partitioning on a table and sets the partition type. Use one of the following values: - DAY
- HOUR
- MONTH
- YEAR
 - The default partition type for time-based partitioning is - DAY.
- --use_avro_logical_types={true|false}
- If the - --source_formatflag is set to- AVRO, then set this flag to- trueto convert logical types into their corresponding types (such as- TIMESTAMP) instead of only using their raw types (such as- INTEGER).
- --decimal_target_types=DECIMAL_TYPE
- Determines how to convert a - Decimallogical type. Equivalent to- JobConfigurationLoad.decimalTargetTypes. Repeat this flag to specify multiple target types.
- --parquet_enum_as_string={true|false}
- If the - --source_formatflag is set to- PARQUET, and you want BigQuery to infer Parquet- ENUMlogical types as- STRINGvalues, then set this flag to- true. The default is- false.
- --parquet_enable_list_inference={true|false}
- If the - --source_formatflag is set to- PARQUET, then this flag indicates whether to use schema inference for Parquet- LISTlogical types.
- --reference_file_schema_uri=URI
- Specifies the path to a reference file with the expected table schema for creating external tables. Equivalent to - ExternalDataConfiguration.referenceFileSchemaUri. This flag is enabled for Avro, ORC, and PARQUET formats.
- DESTINATION_TABLE
- The table that you want to load data into. 
- SOURCE_DATA
- The Cloud Storage URI of the file that contains the data that you want to load. 
- SCHEMA
- The schema for the destination table. 
For more information about loading data from Cloud Storage using the
bq load
command, see the following:
- Loading Avro data
- Loading CSV data
- Loading JSON data
- Loading ORC data
- Loading Parquet data
- Loading data from Datastore exports
- Loading data from Firestore exports
For more information about loading data from a local source using the bq load
command, see the following:
bq ls
Use the bq ls command to list objects in a collection.
Synopsis
bq ls [FLAGS] [RESOURCE]
Example
bq ls myDataset
Flags and arguments
The bq ls command uses the following flags and arguments:
- --all={true|false}or- -a={true|false}
- To show all results, set to true. Shows jobs from all users or all datasets, including hidden ones. This flag is not needed when listing transfer configurations or transfer runs. The default value isfalse.
- --capacity_commitment={true|false}
- To list capacity commitments, set to - trueand use the- --locationflag to specify the location. For more information, see View purchased commitments.- For example: - bq ls --capacity_commitment=true --location='us'
- --datasets={true|false}or- -d={true|false}
- To list datasets, set to - true. The default value is- false.
- --filter="FILTER"
- Filters the listed resources to match the - FILTERargument.- For datasets, - FILTERconsists of one or more space-separated triples in the format- labels.KEY:VALUE. If more than one triple is provided, then the command only returns datasets matching all of the triples (i.e., the command uses the- ANDlogical operator, not- OR). If you want to specify more than one triple, then surround the- FILTERvalue with quotation marks.- To filter based on dataset labels, use the keys and values that you applied to your datasets. - For example: - --filter "labels.department:marketing labels.team:sales" - For transfer configurations, use - dataSourceIdsas the key, and one of the following data sources as the value:
- amazon_s3- Amazon S3 data transfer
- azure_blob_storage- Azure Blob Storage data transfer
- dcm_dt- Campaign Manager data transfer
- google_cloud_storage- Cloud Storage data transfer
- cross_region_copy- Dataset Copy
- dfp_dt- Google Ad Manager data transfer
- displayvideo- Display & Video 360 data transfer
- google_ads- Google Ads data transfer
- merchant_center- Google Merchant Center data transfer
- mysql- MySQL data transfer
- play- Google Play data transfer
- scheduled_query- Scheduled queries data transfer
- search_ads- Search Ads 360 data transfer
- youtube_channel- YouTube Channel data transfer
- youtube_content_owner- YouTube Content Owner data transfer
- redshift- Amazon Redshift migration
- on_premises- Teradata migration
For example:
--filter labels.dataSourceIds:dcm_dt
For transfer runs, use states as the key and one or more of the following
transfer states as the value:
- SUCCEEDED
- FAILED
- PENDING
- RUNNING
- CANCELLED
For example:
--filter="states:FAILED"
For jobs, use states as the key and one or more of the following job states as
the value:
- RUNNING
- PENDING
- DONE
For example:
bq ls --jobs --filter="states:RUNNING" bq ls --jobs --filter="states:RUNNING,PENDING"
- --jobs={true|false}or- -j={true|false}
- To list jobs, set to true. The default value isfalse. By default, you are limited to 100,000 results.
- --max_creation_time=MAX_CREATION_TIME_MS
- An integer that represents a
Unix epoch timestamp in milliseconds.
When specified with the --jobsflag, this flag lists only the jobs created before the timestamp.
- --max_results=MAX_RESULTS or -n=MAX_RESULTS
- An integer indicating the maximum number of results. The default value is 50
and the maximum value is 1000. If you have more than 1000 jobs, then you can
use the page_tokenflag to list all jobs using pagination.
- --min_creation_time=MIN_CREATION_TIME_MS
- An integer that represents a
Unix epoch timestamp
in milliseconds. When specified with the --jobsflag, this flag lists only the jobs created after the timestamp.
- --message_type=messageTypes:MESSAGE_TYPE
- To list only transfer run log messages of a particular type, specify - messageTypes:MESSAGE_TYPE. Possible values are the following:- INFO
- WARNING
- ERROR
 
- --models={true|false}or- -m={true|false}
- To list BigQuery ML models, set to - true. The default value is- false.
- --page_token=TOKENor- -k=TOKEN
- Lists items starting from the specified page token. 
- --projects={true|false}or- -p={true|false}
- To show all projects, set to - true. The default value is- false.
- --reservation={true|false}
- To list all reservations for a given project and location, set to - true. The default value is- false. Use with the- --project_idand- --locationflags.- For example: - bq ls --reservation=true --project_id=myProject --location=us 
- --reservation_assignment={true|false}
- To list all reservation assignments for a given project and location, set to - true. The default value is- false. Use with the- --project_idand- --locationflags.
- --routines={true|false}
- To list all routines in the specified dataset, set to - true. The default value is- false. Routines include persistent user-defined functions, table functions (Preview), and stored procedures.
- --row_access_policies
- When specified, lists all the row-level access policies on a table. Row-level access policies are used for row-level security. You must supply the table name in the format - dataset.table.
- --run_attempt=RUN_ATTEMPT
- Use with the - --transfer_runflag. To list all run attempts for the specified transfer run, set to- RUN_ATTEMPT_UNSPECIFIED. To list only the latest run attempt, set to- LATEST. The default is- LATEST.
- --transfer_config={true|false}
- To list transfer configurations in the specified project and location, set to - true. Use with the- --transfer_locationand- --project_idflag. The default value is- false.
- --transfer_location=LOCATION
- List transfer configurations in the specified location. You set the transfer location when the transfer is created. 
- --transfer_log={true|false}
- Use with the - --transfer_runflag. To list transfer log messages for the specified transfer run, set to- true. The default value is- false.
- --transfer_run={true|false}
- Lists the transfer runs for the specified transfer configuration. - For example: - bq ls --transfer_run=true projects/myProject/locations/us/transferConfigs/12345 
- RESOURCE
- The collection whose objects that you want to list. The resource can be a dataset, project, reservation, or transfer configuration. 
For more information about using the bq ls command, see the following:
- Managing jobs
- Listing datasets in a project
- Creating and using tables
- Listing views in a dataset
- Working with transfers
- Listing table snapshots in a dataset
bq mk
Use the bq mk command to create a BigQuery resource.
Synopsis
bq mk TYPE_FLAG [OTHER FLAGS] [ARGS]
Flags and arguments
The bq mk command takes a type flag that specifies the type of resource to
create, and additional flags that depend on the resource type.
TYPE_FLAG: Set one of the following flags to true.
Your selection specifies the type of resource to create.
- --capacity_commitment: Purchase a capacity commitment.
- --connection: Create a connection.
- --datasetor- -d: Create a dataset.
- --materialized_view: Create a materialized view.
- --reservation: Create a reservation.
- --reservation_assignment. Assign a folder, project, or organization to a reservation.
- --reservation-group: Create a reservation group.
- --tableor- -t: Create a table.
- --transfer_config: Create a transfer configuration.
- --transfer_run: Create a transfer run for a time range.
- --view: Create a view.
The bq mk command supports the following flag for all types of resources:
- --force={true|false}or- -f={true|false}
- To ignore errors if a resource with the same name already exists, set to
true. If the resource already exists, then the exit code is 0, but setting this flag totruedoes not cause thebq mkcommand to overwrite the resource. The default value isfalse.
The bq mk command supports additional flags, depending on the type of resource
you are creating, as described in the following sections.
bq mk --capacity_commitment
To purchase a capacity commitment, set --capacity_commitment to true and use
the following flags:
- --location=LOCATION
- Specifies the location of the commitment.
- --plan=PLAN_TYPE
- Specifies the commitment plan type. Must be one of the following values: - ANNUAL
- THREE_YEAR
 - Customers using legacy flat-rate pricing also may use one of the following values: - FLEX
- MONTHLY
- ANNUAL
 
- --renewal_plan=RENEWAL_TYPE
- Specifies the renewal plan type. Required for - ANNUALor- THREE_YEARcommitment plans. Must be one of the following:- ANNUAL
- THREE_YEAR
- NONE
 - Customers using legacy flat-rate pricing also may use one of the following values: - FLEX
- MONTHLY
- ANNUAL
 
- --project_id=PROJECT_ID
- Specifies the project that administers the slots. 
- --slots=NUMBER_OF_BASELINE_SLOTS
- Specifies the number of baseline slots to purchase. 
- --edition=EDITION
- The edition associated with the capacity commitment. Must be one of the following: - ENTERPRISE
- ENTERPRISE_PLUS
 
For more information, see Purchase slots.
bq mk --connection
Creates a connection. The following flags are supported:
- --connection_type=CONNECTION_TYPE
- The type of the connection, for example CLOUD_SQLfor Cloud SQL connections.
- --properties=PROPERTIES
- Connection specific parameters in JSON format. - instanceId,- databaseand- typemust be specified.
- If you create a Spanner connection and want to use Data Boost, include the - "useParallelism":trueand- "useDataBoost":truepairs.
- --connection_credential=CONNECTION_CREDENTIAL
- The credentials of the connection in JSON format. - usernameand- passwordmust be specified.
- --project_id=PROJECT_ID
- Specifies the ID of the project that the connection belongs to. 
- --location=LOCATION
- Specifies the location that the connection will be stored. 
- --display_name=DISPLAY_NAME
- Specifies an optional friendly name for the connection. 
- --description=DESCRIPTION
- Specifies an optional description of the connection. 
- --iam_role_id=ROLE_ID
- For BigQuery Omni on AWS, specifies an IAM role that allows access to the resource. - Use the following format: - "arn:aws:iam::AWS_ACCOUNT_ID:role/POLICY_NAME", where:- AWS_ACCOUNT_ID is the ID number of the connection's AWS IAM user.
- POLICY_NAME is the policy name.
 - Example: - "arn:aws:iam::0123456789AB:policy/s3-read-role"
- --tenant_id=TENANT_ID
- For BigQuery Omni on Microsoft Azure, specifies the tenant ID of the Microsoft Azure directory that contains the Microsoft Azure Storage account. 
- CONNECTION_ID
- Specifies an optional connection ID for the connection. If a connection ID is not provided a unique ID is automatically generated. The connection ID can contain letters, numbers and underscores. 
For more information, see Introduction to connections.
bq mk --dataset
Creates a dataset. The following flags are supported:
- --add_tags=TAGS
- Specifies the tags that you are attaching to the new
dataset, separated by commas. For example,
556741164180/env:prod,myProject/department:sales. Each tag must have the namespaced key name and value short name.
- --default_kms_key=KEY
- Specifies the default Cloud KMS key resource ID for encrypting the table data in a dataset if no explicit key is provided during table creation or query.
- --default_partition_expiration=SECONDS
- An integer that specifies the default expiration time, in seconds, for all
partitions in newly-created partitioned tables in the dataset. A partition's
expiration time is set to the partition's UTC date plus the integer value.
If this property is set, then its value overrides the dataset-level default
table
expiration if it exists. If you supply the --time_partitioning_expirationflag when you create or update a partitioned table, then the table-level partition expiration takes precedence over the dataset-level default partition expiration.
- --default_table_expiration=SECONDS
- An integer that specifies the default lifetime, in seconds, for newly created tables in a dataset. The expiration time is set to the current UTC time plus this value.
- --description=DESCRIPTION
- Specifies the description of the dataset.
- --external_source=EXTERNAL_SOURCE
- Specifies the external data source when you create a federated dataset.
- --label=KEY:VALUE
- Specifies a label for the dataset. Repeat this flag to specify multiple labels.
- --location=LOCATIONor- --data_location=LOCATION
- Specifies the location of the dataset. Prefer the --locationflag; the--data_locationflag is a legacy flag.
- --max_time_travel_hours=HOURS
- Specifies the duration in
hours of the
time travel window
for the dataset. The --max_time_travel_hoursvalue must be an integer expressed in multiples of 24 (48, 72, 96, 120, 144, 168) between 48 (2 days) and 168 (7 days). 168 hours is the default if this flag isn't specified.
- --storage_billing_model=BILLING_MODEL
- Specifies the storage billing model for a dataset. Set the - --storage_billing_modelvalue to- PHYSICALto use physical bytes when calculating storage charges, or to- LOGICALto use logical bytes.- LOGICALis the default.- When you change a dataset's billing model, it takes 24 hours for the change to take effect. - Once you change a dataset's storage billing model, you must wait 14 days before you can change the storage billing model again. 
For more information, see Creating datasets.
bq mk --materialized_view
Creates a materialized view. The following flags are supported:
- --enable_refresh={true|false}
- To disable automatic refresh for a materialized view, set to false. The default when creating a materialized view istrue.
- --refresh_interval_ms=MILLISECONDS
- Specifies the number of milliseconds for the refresh interval of a materialized view. If this flag is not specified, then the default refresh interval for a materialized view that has refresh enabled is 1,800,000 milliseconds, which is 30 minutes.
For more information, see Creating and using materialized views.
bq mk --reservation
Creates a reservation with dedicated slots. The following flags are supported:
- --target_job_concurrency=CONCURRENCY
- Specifies the target number of queries that execute simultaneously. The default value is 0, which means that the concurrency is automatically computed based on the reservation size. For more information, see Use query queues.
- --ignore_idle_slots={true|false}
- To restrict jobs running in this reservation to only use slots allocated to
the reservation, set to true. The default value isfalse; jobs in this reservation can use idle slots from other reservations, or slots that are not allocated to any reservation. For more information, see Idle slots.
- --location=LOCATION
- Specifies the location of the reservation.
- --project_id=PROJECT_ID
- Specifies the project that owns the reservation.
- --slots=NUMBER_OF_BASELINE_SLOTS
- Specifies the number of baseline slots to allocate to this reservation.
- --edition=EDITION
- The edition associated with the capacity
commitment. Must be one of the following:
- STANDARD
- ENTERPRISE
- ENTERPRISE_PLUS
 
- --autoscale_max_slots=NUMBER_OF_AUTOSCALING_SLOTS
- The number of autoscaling slots assigned to the reservation. This is equal to
the value of the max reservation size minus the number of baseline slots. Only
available with the --editionflag.
- --max_slots=MAXIMUM_NUMBER_OF_SLOTS
- The maximum number of slots the reservation will consume. Must be configured
with the --scaling_modeflag (Preview).
- --scaling_mode=SCALING_MODE
- The scaling mode of the reservation. Must be one of the following: - IDLE_SLOTS_ONLY
- ALL_SLOTS
- AUTOSCALE_ONLY
- SCALING_MODE_UNSPECIFIED
 - Must be configured with the - --max_slotsflag (Preview).
For more information, see Create a reservation with dedicated slots.
bq mk --reservation_assignment
Assigns a project, folder, or organization to a reservation. The following flags are supported:
- --assignee_id=ASSIGNEE_ID
- Specifies the ID of the folder, organization, or project.
- --assignee_type=ASSIGNEE_TYPE
- Specifies the type of entity to assign to the reservation. One of
the following:
- FOLDER
- ORGANIZATION
- PROJECT
 
- --job_type=JOB_TYPE
- Specifies the type of job to assign to the reservation. One of the following:
- QUERY
- PIPELINE
- ML_EXTERNAL
- BACKGROUND
 
- --location=LOCATION
- Specifies the location of the reservation.
- --project_id=PROJECT_ID
- Specifies the project that owns the reservation.
- --reservation_id=RESERVATION_ID
- Specifies the ID of the reservation.
For more information, see Work with reservation assignments.
bq mk --alpha=reservation_groups --reservation-group
Create a reservation group. The following flags are supported:
- --location=LOCATION
- Specifies the location of the reservation.
- --project_id=PROJECT_ID
- Specifies the project that owns the reservation.
Must be configured with the --alpha=reservation_groups flag
  (Preview).
For more information, see Work with reservation groups.
bq mk --table
Creates a table. The following flags are supported:
- --add_tags=TAGS
- Specifies the tags that you are attaching to the new
table, separated by commas. For example,
556741164180/env:prod,myProject/department:sales. Each tag must have the namespaced key name and value short name.
- --clustering_fields=COLUMNS
- A comma-separated list of up to four column names that specifies the fields to use for table clustering. If specified with partitioning, then the table is first partitioned, and then each partition is clustered using the supplied columns.
- --description=DESCRIPTION
- Specifies the description of the table.
- --destination_kms_key=KEY
- Specifies a Cloud KMS key resource ID for encrypting the destination table data.
- --expiration=SECONDS
- Specifies the lifetime for the table. If you don't specify the --expirationflag, then BigQuery creates the table with the dataset's default table lifetime, or the table doesn't expire.
- --external_table_definition=STRING
- Specifies a table definition for creating an external table. - For Cloud Storage and Google Drive external tables: -  --external_table_definition={PATH_TO_FILE|DEFINITION}
- The value can be either a path to a file containing a
table definition
(PATH_TO_FILE) or an inline table definition (DEFINITION).
 - The format for the DEFINITIONfield isSCHEMA@FORMAT=URI.
- The format for the - SCHEMAvalue is a comma-separated list of column definitions in the form- FIELD:DATA_TYPE, FIELD:DATA_TYPE. You can omit the- SCHEMAvalue if the data format is self-describing (such as Avro) or if you are using schema auto-detection.
- The - FORMATvalue specifies the data format; one of the following:- AVRO
- CSV
- DATASTORE_BACKUP(use this value for Filestore)
- ICEBERG
- NEWLINE_DELIMITED_JSON
- ORC
- PARQUET
 
 - If you specify a table definition file, don't add an extension to the filename. - For example: - --external_table_definition=/tmp/tabledef - --external_table_definition=Region:STRING,Quarter:STRING,Total_sales:INTEGER@CSV=gs://mybucket/sales.csv - For Bigtable external tables, and BigLake tables based on AWS and Azure: - --external_table_definition=PATH_TO_FILE
- The value must be a path to a file containing a table definition.
 - For BigLake tables based on Cloud Storage: - --external_table_definition=FORMAT=BUCKET_PATH@REGION.CONNECTION_NAME:- The - FORMATvalue specifies the data format; one of the following:- AVRO
- CSV
- NEWLINE_DELIMITED_JSON
- ICEBERG
- ORC
- PARQUET
 
- BUCKET_PATHis the path to one or more files in Cloud Storage that contain the BigLake table data. You can specify- BUCKET_PATHin the following formats:- For a single file: gs://bucket_name/[folder_name/]file_name.
- For multiple files in a single bucket:
gs://bucket_name/[folder_name/]*.
- For multiple files in multiple buckets: - gs://mybucket1/*,gs://mybucket2/folder5/*.- You can use wildcards to limit the files included in the BigLake table. For example, if the bucket contains several types of data, you could have the table only use PARQUET files by specifying - gs://bucket_name/*.parquet. For more information about using wildcards, see URI wildcards.
 
- For a single file: 
- The - REGIONvalue specifies the region or multi-region that contains the connection.
- The - CONNECTION_NAMEvalue specifies the name of the cloud resource connection to use with this external table. The connection determines which service account is used to read data from Cloud Storage.
 - For object tables: - --external_table_definition=BUCKET_PATH@REGION.CONNECTION_NAME:- BUCKET_PATHis the path to the Cloud Storage bucket that contains the objects represented by the object table, in the format- gs://bucket_name/[folder_name/]*.You can specify multiple buckets by providing multiple paths, for example- gs://mybucket1/*,gs://mybucket2/folder5/*.- You can use wildcards to limit the objects included in the object table. For example, if the bucket contains several types of unstructured data, you could create the object table over only PDF objects by specifying - gs://bucket_name/*.pdf. For more information about using wildcards, see URI wildcards.
- The - REGIONvalue specifies the region or multi-region that contains the connection.
- The - CONNECTION_NAMEvalue specifies the name of the cloud resource connection to use with this external table. The connection determines which service account is used to read data from Cloud Storage.
 
-  
- --file_set_spec_type=FILE_SET_SPEC_TYPE
- Specifies how to interpret source URIs. - FILE_SYSTEM_MATCH: Expands source URIs by listing files from the object store. This is the default behavior if FileSetSpecType is not set.
- NEW_LINE_DELIMITED_MANIFEST: Indicates that the provided URIs are newline-delimited manifest files, with one URI per line. Wildcard URIs are not supported in the manifest files, and all referenced data files must be in the same bucket as the manifest.
 - For example, if you have a source URI of - "gs://bucket/path/file"and the- file_set_spec_typeis- FILE_SYSTEM_MATCH, then the file is used directly as a data file. If the- file_set_spec_typeis- NEW_LINE_DELIMITED_MANIFEST, then each line in the file is interpreted as a URI that points to a data file.
- --reference_file_schema_uri=URI
- Specifies the path to a reference file with the expected table schema for creating external tables. Equivalent to - ExternalDataConfiguration.referenceFileSchemaUri. This flag is enabled for Avro, ORC, and PARQUET formats.
- --label=KEY:VALUE
- Specifies a label for the table. Repeat this flag to specify multiple labels. 
- --max_staleness=INTERVAL
- Specifies whether cached metadata is used by operations against the table, and how fresh the cached metadata must be in order for the operation to use it. - Applicable for BigLake tables and object tables. - To disable metadata caching, specify 0. This is the default. - To enable metadata caching, specify an interval value between 30 minutes and 7 days, using the - Y-M D H:M:Sformat described in the- INTERVALdata type documentation. For example, specify- 0-0 0 4:0:0for a 4 hour staleness interval. With this value, operations against the table use cached metadata if it has been refreshed within the past 4 hours. If the cached metadata is older than that, the operation falls back to retrieving metadata from Cloud Storage instead.
- --object_metadata=STRING
- Set the value of this flag to - SIMPLEwhen creating an object table.- Only required when creating an object table. 
- --range_partitioning=COLUMN_NAME,START,END,INTERVAL
- Specifies options for an integer-range partition, as follows: - column_nameis the column used to create the integer range partitions.
- startis the start of range partitioning, inclusive.
- endis the end of range partitioning, exclusive.
- intervalis the width of each range within the partition.
 - For example: - --range_partitioning=customer_id,0,10000,100 
- --require_partition_filter={true|false}
- To require a partition filter for queries over the supplied table, set to - true. This flag only applies to partitioned tables. The default value is- false.
- --schema={SCHEMA_FILE|SCHEMA}
- Specifies either the path to a local JSON schema file or a comma-separated list of column definitions in the form - FIELD:DATA_TYPE, FIELD:DATA_TYPE. If you use a schema file, don't add an extension to the filename.- Examples: - --schema=/tmp/tabledef - --schema=Region:STRING,Quarter:STRING,Total_sales:INTEGER 
- --time_partitioning_expiration=SECONDS
- An integer that specifies (in seconds) when a time-based partition should be deleted. The expiration time evaluates to the partition's UTC date plus the integer value. A negative number indicates no expiration. 
- --time_partitioning_field=COLUMN_NAME
- Specifies the field used to determine how to create a time-based partition. If time-based partitioning is enabled without this value, then the table is partitioned based on the load time. 
- --time_partitioning_type=INTERVAL
- Enables time-based partitioning on a table and sets the partition type. Use one of the following values: - DAY
- HOUR
- MONTH
- YEAR
 
- --use_avro_logical_types={true|false}
- If the - FORMATpart of the- --external_table_definitionflag is set to- AVRO, then this flag specifies whether to convert logical types into their corresponding types (such as- TIMESTAMP) instead of only using their raw types (such as- INTEGER).
- --parquet_enable_list_inference={true|false}
- If the - FORMATpart of the- --external_table_definitionflag is set to- PARQUET, then this flag specifies whether to use schema inference for Parquet- LISTlogical types.
- --parquet_enum_as_string={true|false}
- If the - FORMATpart of the- --external_table_definitionflag is set to- PARQUET, then this flag specifies whether to infer Parquet- ENUMlogical types as- STRINGvalues.
For more information, see Creating and using tables.
bq mk --transfer_config
Creates a transfer configuration. The following flags are supported:
- --data_source=DATA_SOURCE
- Specifies the data source. Required when creating a transfer configuration. Use one of the following values:
- amazon_s3- Amazon S3 data transfer
- azure_blob_storage- Azure Blob Storage data transfer
- dcm_dt- Campaign Manager data transfer
- google_cloud_storage- Cloud Storage data transfer
- cross_region_copy- Dataset Copy
- dfp_dt- Google Ad Manager data transfer
- displayvideo- Display & Video 360 data transfer
- google_ads- Google Ads data transfer
- merchant_center- Google Merchant Center data transfer
- mysql- MySQL data transfer
- play- Google Play data transfer
- scheduled_query- Scheduled queries data transfer
- search_ads- Search Ads 360 data transfer
- youtube_channel- YouTube Channel data transfer
- youtube_content_owner- YouTube Content Owner data transfer
- redshift- Amazon Redshift migration
- on_premises- Teradata migration
- --display_name=DISPLAY_NAME
- Specifies the display name for the transfer configuration.
- --no_auto_scheduling={true|false}
- Disables automatic scheduling of data transfer runs for this configuration.
The default value is false.
- --params={"PARAMETER":"VALUE"}or- -p={"PARAMETER":"VALUE"}
- Specifies the parameters for the transfer configuration in JSON format. The parameters vary depending on the data source.
- --refresh_window_days=DAYS
- An integer that specifies the refresh window for the transfer configuration in
days. The default value is 0.
- --service_account_name=SERVICE_ACCOUNT
- Specifies a service account to be used as the credential for the transfer configuration.
- --target_dataset=DATASET
- Specifies the target dataset for the transfer configuration.
- --table_filter=TABLES
- Only used with the google_adsdata source. TheTABLESparameter is a comma-separated list of tables to include in the transfer. To exclude a table, prefix it with a hyphen (-). The default value includes all tables in the transfer.
- --destination_kms_key=KEY
- Specifies a Cloud KMS key resource ID for encrypting the destination table data.
For information about using the bq mk command with the BigQuery
Data Transfer Service, see the following:
- Set up an Amazon S3 transfer
- Set up a Campaign Manager transfer
- Set up a Cloud Storage transfer
- Set up a Google Ad Manager transfer
- Set up a Google Ads transfer
- Set up a Google Merchant Center transfer (beta)
- Set up a Google Play transfer
- Set up a Search Ads 360 transfer (beta)
- Set up a YouTube Channel transfer
- Set up a YouTube Content Owner transfer
- Migrate data from Amazon Redshift
- Migrate data from Teradata
bq mk --transfer_run
Creates a data transfer run at the specified time or time range using the specified data transfer configuration.
Synopsis
bq mk --transfer_run [--run_time=RUN_TIME | --start_time=START_TIME --end_time=END_TIME] CONFIG
The following flags are supported:
- --run_time=RUN_TIME
- A timestamp that specifies the time to schedule the data transfer run.
- --start_time=START_TIME
- A timestamp that specifies the start time for a range of data transfer runs.
- --end_time=END_TIME
- A timestamp that specifies the end time for a range of data transfer runs.
The format for the timestamps is RFC3339 UTC "Zulu".
The CONFIG argument specifies a pre-existing data
transfer configuration.
Examples
bq mk --transfer_run \ --run_time=2021-01-20T17:00:00.00Z \ projects/p/locations/l/transferConfigs/c
bq mk --transfer_run \ --start_time=2020-12-19T16:39:57-08:00 \ --end_time=2020-12-19T20:39:57-08:00 \ projects/p/locations/l/transferConfigs/c
bq mk --view
Creates a view. The following flags are supported:
- --add_tags=TAGS
- Specifies the tags that you are attaching to the new
view, separated by commas. For example,
556741164180/env:prod,myProject/department:sales. Each tag must have the namespaced key name and value short name.
- --description=DESCRIPTION
- Specifies the description of the view.
- --expiration=SECONDS
- Specifies the lifetime for the view. If SECONDSis0, then the view doesn't expire. If you don't specify the--expirationflag, then BigQuery creates the view with the dataset's default table lifetime.
- --label=KEY:VALUE
- Specifies a label for the view. Repeat this flag to specify multiple labels.
- --use_legacy_sql={true|false}
- Set to falseto use a GoogleSQL query to create a view. The default value is determined by your configuration settings. If the configuration setting is not specified, the default value istrue; uses legacy SQL.
- --view_udf_resource=FILE
- Specifies the Cloud Storage URI or the path to a local code file that is loaded and evaluated immediately as a user-defined function resource used by a view's SQL query. Repeat this flag to specify multiple files.
For more information, see Creating views.
bq mkdef
Use the bq mkdef command to create a table definition in JSON format for data
stored in Cloud Storage or Google Drive.
Synopsis
bq mkdef [FLAGS] URI [ > FILE ]
Flags and arguments
The bq mkdef command uses the following flags and arguments:
- --autodetect={true|false}
- Specifies whether to use schema auto-detection for CSV and JSON
data. The default is false.
- --connection_id=CONNECTION_ID
- The ID of a connection resource to use for authentication.
- --hive_partitioning_mode
- Specifies how to determine the partitioning schema when BigQuery reads data. The following modes are supported: - AUTO: Automatically infer partition key names and types.
- STRINGS: Automatically infer partition key names. All types are treated as strings.
- CUSTOM: Specify the partitioning schema in the source URI prefix.
 - The default value is - AUTO.
- --hive_partitioning_source_uri_prefix
- Specifies the common prefix for the source URIs. The common prefix value is the portion of the URI that immediately precedes the partition key encoding. If you specified - CUSTOMfor the mode, then you must also identify the partitioning schema.- For example, consider files with the following structure: - gs://bucket/path_to_table/dt=2019-06-01/country=USA/id=7/file.avro
- gs://bucket/path_to_table/dt=2019-05-31/country=CA/id=3/file.avro
 - If you use - AUTOor- STRINGSmodes, then the following values are acceptable:- gs://bucket/path_to_table
- gs://bucket/path_to_table/
 - If you use - CUSTOMmode, then the following values are acceptable:- gs://bucket/path_to_table/{dt:DATE}/{country:STRING}/{id:INTEGER}
- gs://bucket/path_to_table/{dt:STRING}/{country:STRING}/{id:INTEGER}
- gs://bucket/path_to_table/{dt:DATE}/{country:STRING}/{id:STRING}
 
For more information about using the bq mkdef command, see
Creating a table definition file for an external data source.
- --ignore_unknown_values={true|false}or- -i={true|false}
- Specifies whether to ignore any values in a row that are not present in the
schema. The default is false.
- --metadata_cache_mode=STRING
- Specifies whether the metadata cache for the table is refreshed automatically or manually. - Set to - AUTOMATICfor the metadata cache to be refreshed at a system-defined interval, usually somewhere between 30 and 60 minutes.- Set to - MANUALif you want to refresh the metadata cache on a schedule you determine. In this case, you can call the- BQ.REFRESH_EXTERNAL_METADATA_CACHEsystem procedure to refresh the cache.- You must set the - --metadata_cache_modeflag if you set the- --max_stalenessflag with the- bq mkcommand.
- --parquet_enable_list_inference={true|false}
- If - source_formatis set to- PARQUET, then this flag specifies whether to use schema inference for Parquet- LISTlogical types. The default is- false.
- --parquet_enum_as_string={true|false}
- If - source_formatis set to- PARQUET, then this flag specifies whether to infer Parquet- ENUMlogical types as- STRINGvalues. The default is- false.
- --file_set_spec_type=FILE_SET_SPEC_TYPE
- Specifies how to interpret source URIs. - FILE_SYSTEM_MATCH: Expands source URIs by listing files from the object store. This is the default behavior if FileSetSpecType is not set.
- NEW_LINE_DELIMITED_MANIFEST: Indicates that the provided URIs are newline-delimited manifest files, with one URI per line. Wildcard URIs are not supported in the manifest files, and all referenced data files must be in the same bucket as the manifest.
 - For example, if you have a source URI of - "gs://bucket/path/file"and the- file_set_spec_typeis- FILE_SYSTEM_MATCH, then the file is used directly as a data file. If the- file_set_spec_typeis- NEW_LINE_DELIMITED_MANIFEST, then each line in the file is interpreted as a URI that points to a data file.
- --source_format=FORMAT
- Specifies the format of the source data. Use one of the following values: - AVRO
- CSV
- DATASTORE_BACKUP(use this value for Filestore)
- GOOGLE_SHEETS
- NEWLINE_DELIMITED_JSON
- ORC
- PARQUET
 - The default value is - CSV.
- --use_avro_logical_types={true|false}
- If the - --source_formatflag is set to- AVRO, then this flag specifies whether to convert logical types into their corresponding types (such as- TIMESTAMP) instead of only using their raw types (such as- INTEGER). The default is- false.
bq partition
Use the bq partition command to convert a group of tables with time-unit
suffixes, such as tables ending in YYYYMMDD for date
partitioning, into partitioned tables.
Synopsis
bq partition [FLAGS] SOURCE_TABLE_BASE_NAME PARTITION_TABLE
Flags and arguments
The bq partition command uses the following flags and arguments:
- --no_clobber={true|false}or- -n={true|false}
- To disallow overwriting an existing partition,
set to true. The default value isfalse; if the partition exists, then it is overwritten.
- --time_partitioning_expiration=SECONDS
- An integer that specifies (in seconds) when a time-based partition should be deleted. The expiration time evaluates to the partition's UTC date plus the integer value. A negative number indicates no expiration.
- --time_partitioning_type=INTERVAL
- Specifies the partition type. The following table provides the possible values for the - INTERVALflag and the expected time-unit-suffix format for each:- INTERVAL- Suffix - HOUR- YYYYMMDDHH- DAY- YYYYMMDD- MONTH- YYYYMM- YEAR- YYYY
- SOURCE_TABLE_BASE_NAME
- The base name of the group of tables with time-unit suffixes. 
- PARTITION_TABLE
- The name of the destination partitioned table. 
For more information about using the bq partition command, see
Converting date-sharded tables into ingestion-time partitioned tables.
bq query
Use the bq query command to create a query job that runs the specified SQL
query.
Synopsis
bq query [FLAGS] 'QUERY'
Flags and arguments
The bq query command uses the following flags and arguments:
- --allow_large_results={true|false}
- To enable large destination table
sizes for legacy SQL queries, set to true. The default value isfalse.
- --append_table={true|false}
- To append data to a destination table, set to true. The default value isfalse.
- --batch={true|false}
- To run the query in batch mode, set to true. The default value isfalse.
- --clustering_fields=COLUMNS
- A comma-separated list of up to four column names that specifies fields to use to cluster the destination table in a query. If specified with partitioning, then the table is first partitioned, and then each partition is clustered using the supplied columns.
- --connection_property=KEY=VALUE
- A key-value pair that lets you specify connection-level properties to customize query behavior. Repeat this flag to specify additional properties. - Supported connection properties are as follows: - dataset_project_id: represents the default project for datasets that are used in the query, similar to the- @@dataset_project_idsystem variable.
- query_label: associates the query with a given job label. If set, all subsequent queries in a script or session have this label. For details on query label formatting requirements, see the- labelsfield in the- JobConfigurationresource.
- service_account: specifies a service account to use to run the query. For example,- --connection_property=service_account=myserviceaccount@project.iam.gserviceaccount.com.
- session_id: associates the query with a given session.
- time_zone: represents the default timezone to use to run the query.
 
- --continuous={true|false}
- To run a continuous query, set to - true. The default value is- false.
- --destination_kms_key=KEY
- Specifies a Cloud KMS key resource ID for encrypting the destination table data. 
- --destination_schema={PATH_TO_FILE|SCHEMA}
- The path to a local JSON schema file or a comma-separated list of column definitions in the form - FIELD:DATA_TYPE, FIELD:DATA_TYPE.- Schema changes happen in a separate operation from query execution. If you write query results to a table by specifying the - --destination_tableflag, and the query subsequently raises an exception, it is possible that any schema changes will be skipped. If this occurs, check the destination table schema and manually update it if necessary.
- --destination_table=TABLE
- When specified, the query results are saved to - TABLE. Specify- TABLEin the following format:- PROJECT:- DATASET.- TABLE. If- PROJECTis not specified, then the current project is assumed. If the- --destination_tableflag is not specified, then the query results are saved to a temporary table.- Examples: - --destination_table myProject:myDataset.myTable - --destination_table myDataset.myTable 
- --dry_run={true|false}
- When specified, the query is validated but not run. 
- --external_table_definition={TABLE::PATH_TO_FILE|TABLE::DEFINITION}
- Specifies the table name and table definition for an external table query. The table definition can be a path to a local JSON schema file or an inline table definition. The format for supplying the inline table definition is - SCHEMA@SOURCE_FORMAT=CLOUD_STORAGE_URI. The format for the- SCHEMAvalue is a comma-separated list of column definitions in the form- FIELD:DATA_TYPE, FIELD:DATA_TYPE. If you use a table definition file, don't add an extension to the filename.- For example: - --external_table_definition=myTable::/tmp/tabledef - --external_table_definition=myTable::Region:STRING,Quarter:STRING,Total_sales:INTEGER@CSV=gs://mybucket/sales.csv - Repeat this flag to query multiple tables. 
- --flatten_results={true|false}
- To disallow flattening nested and repeated fields in the results for legacy SQL queries, set to - false. The default value is- true.
- --label=KEY:VALUE
- Specifies a label for the query job. Repeat this flag to specify multiple labels. 
- --max_rows=MAX_ROWSor- -n=MAX_ROWS
- An integer specifying the number of rows to return in the query results. The default value is - 100.
- --maximum_bytes_billed=MAX_BYTES
- An integer that limits the bytes billed for the query. If the query goes beyond the limit, then the query fails (without incurring a charge). If this flag is not specified, then the bytes billed is set to the project default. 
- --max_statement_results=VALUE
- An integer specifying the maximum number of script statements displayed for the query results. The default value is - 100.
- --min_completion_ratio=RATIO
- (Experimental) A number from 0 through 1.0 that specifies the minimum fraction of data that must be scanned before a query returns. If the flag is not specified, then the default server value - 1.0is used.
- --parameter={PATH_TO_FILE|PARAMETER}
- Either a JSON file containing a list of query parameters, or a query parameter in the form - NAME:TYPE:VALUE. An empty name creates a positional parameter. If- TYPEis omitted, then the- STRINGtype is assumed.- NULLspecifies a null value. Repeat this flag to specify multiple parameters.- For example: - --parameter=/tmp/queryParams - --parameter=Name::Oscar - --parameter=Count:INTEGER:42 
- --range_partitioning=COLUMN_NAME,START,END,INTERVAL
- Use with the - --destination_tableflag. Specifies options for integer-range partitioning in the destination table. The value is a comma-separated list of the form- column_name,start,end,interval, where- column_nameis the column used to create the integer range partitions.
- startis the start of range partitioning, inclusive.
- endis the end of range partitioning, exclusive.
- intervalis the width of each range within the partition.
 - For example: - --range_partitioning=customer_id,0,10000,100 
- --replace={true|false}
- To overwrite the destination table with the query results, set to - true. Any existing data and schema are erased. Any Cloud KMS key is also removed, unless you specify the- --destination_kms_keyflag. The default value is- false.
- --require_cache={true|false}
- If specified, then run the query only if results can be retrieved from the cache. 
- --require_partition_filter={true|false}
- If specified, then a partition filter is required for queries over the supplied table. This flag can only be used with a partitioned table. 
- **--reservation_id=RESERVATION
- Preview. If specified, the reservation the query runs in. 
- --rpc={true|false}
- To use the RPC-style query API instead of the REST API - jobs.insertmethod, set to- true. The default value is- false.
- --schedule="SCHEDULE"
- Makes a query a recurring scheduled query. A schedule for how often the query should run is required. - Examples: - --schedule="every 24 hours" - --schedule="every 3 hours" - For a description of the schedule syntax, see Formatting the schedule. 
- --schema_update_option=OPTION
- When appending data to a table in a load job or a query job, or when overwriting a table partition, specifies how to update the schema of the destination table. Use one of the following values: - ALLOW_FIELD_ADDITION: Allow new fields to be added.
- ALLOW_FIELD_RELAXATION: Allow relaxing- REQUIREDfields to- NULLABLE.
 - Repeat this flag to specify multiple schema update options. 
- --start_row=ROW_NUMBERor- -s=ROW_NUMBER
- An integer that specifies the first row to return in the query result. The default value is - 0.
- --target_dataset=DATASET
- When specified with - --schedule, updates the target dataset for a scheduled query. The query must be DDL or DML.
- --time_partitioning_expiration=SECONDS
- Use with the - --destination_tableflag. An integer that specifies (in seconds) when a time-based partition should be deleted. The expiration time evaluates to the partition's UTC date plus the integer value. A negative number indicates no expiration.
- --time_partitioning_field=COLUMN_NAME
- Use with the - --destination_tableflag. Specifies the partitioning column for time-based partitioning. If time-based partitioning is enabled without this value, then the table is partitioned based on the ingestion time.
- --time_partitioning_type=INTERVAL
- Use with the - --destination_tableflag. Specifies the partition type for the destination table. Use one of the following values:- DAY
- HOUR
- MONTH
- YEAR
 
- --udf_resource=FILE
- This flag applies only to legacy SQL queries. Specifies the Cloud Storage URI or the path to a local file containing a user-defined function resource to be used by a legacy SQL query. Repeat this flag to specify multiple files. 
- --use_cache={true|false}
- To disallow caching query results, set to - false. The default value is- true.
- --use_legacy_sql={true|false}
- To run a GoogleSQL query, set to - false. The default value is determined by your configuration settings. If the configuration setting is not specified, the default value is- true; the command uses legacy SQL.
- --job_timeout_ms={string (Int64Value)}
- Specifies the maximum time to run a query in milliseconds. If this time limit is exceeded, BigQuery attempts to stop the job. 
- QUERY
- The query that you want to run. You can specify the query by using one of the following methods: - Specify a string that contains the query. - If you need to use additional string literals within the query, you must follow the quoting rules for the shell that you are using, such as Bash or PowerShell. - The following example shows a typical approach in Bash, which is to use double quotes to denote the string literals in the query, and then enclose the query itself in single quotes: - 'SELECT * FROM mydataset.mytable WHERE column1 = "value";' - If you are copying the query from another location, you must also remove any comments in the query. 
- Pass in a SQL script that contains the query. The following example shows how to pass in a SQL script in the Bash shell: - bq query --use_legacy_sql=false < query.sql 
 
For more information about using the bq query command, see
Run a query.
bq remove-iam-policy-binding
Use the bq remove-iam-policy-binding command to retrieve the
IAM policy
for a resource and remove a binding from the policy, in one step.
The resource can be a table or a view.
This command is an alternative to the following three-step process:
- Using the bq get-iam-policycommand to retrieve the policy file (in JSON format).
- Editing the policy file.
- Using the bq set-iam-policycommand to update the policy without the binding.
Synopsis
bq remove-iam-policy-binding FLAGS --member=MEMBER_TYPE:MEMBER --role=ROLE RESOURCE
Flags and arguments
The bq remove-iam-policy-binding command uses the following
flags and arguments:
- --member=MEMBER_TYPE:MEMBER
- Required. Use the - --memberflag to specify the member part of the IAM policy binding. The- --memberflag is required along with the- --roleflag. One combination of- --memberand- --roleflags equals one binding.- The - MEMBER_TYPEvalue specifies the type of member in the IAM policy binding. Use one of the following values:- user
- serviceAccount
- group
- domain
 - The - MEMBERvalue specifies the email address or domain of the member in the IAM policy binding.
- --role=ROLE
- Required. Specifies the role part of the IAM policy binding. The - --roleflag is required along with the- --memberflag. One combination of- --memberand- --roleflags equals one binding.
- --table={true|false}or- -t={true|false}
- Optional. To remove a binding from the IAM policy of a table or view, set to - true. The default value is- false.
RESOURCE is the table or view whose policy binding
you want to remove.
For more information, see the IAM policy reference.
bq rm
Use the bq rm command to delete a BigQuery resource.
Synopsis
bq rm [FLAGS] RESOURCE
Flags and arguments
The bq rm command uses the following flags and arguments:
- --capacity_commitment={false|true}
- To delete a capacity commitment, set to true, specify the location of the commitment you want to remove by using the--locationflag, and replaceRESOURCEwith the ID of the commitment you want to remove.
- --dataset={true|false}or- -d={true|false}
- To delete a dataset, set to true. The default value isfalse.
- --force={true|false}or- -f={true|false}
- To delete a resource
without prompting, set to true. The default value isfalse.
- --job={true|false}or- -j={true|false}
- To delete a job, set to true. The default value is false.
- --model={true|false}or- -m={true|false}
- To delete a BigQuery ML model, set to true. The default isfalse.
- --recursive={true|false}or- -r{true|false}
- To delete a dataset and any tables, table data, or models
in it, set to true. The default value isfalse.
- --reservation={true|false}
- To delete a reservation, set to true. The default value isfalse.
- --reservation_assignment={true|false}
- To delete a reservation assignment, set to true. The default value isfalse.
- --routine={true|false}
- To delete a routine, set to true. The default value isfalse. A routine can be a persistent user-defined function, a table function (Preview), or a stored procedure.
- --table={true|false}or- -t={true|false}
- To delete a table or view, set to true. The default value isfalse.
- --transfer_config={true|false}
- To delete a transfer configuration, set to true. The default value isfalse.
- RESOURCE
- The resource that you want to remove.
For more information about using the bq rm command, see the following:
- Managing datasets
- Managing jobs
- Managing tables
- Managing views
- Working with transfers
- Deleting table snapshots
bq set-iam-policy
Use the bq set-iam-policy command to specify or update the
IAM policy
for a resource. The resource can be a table, a view, or a
slot reservation.
After setting the policy, the new policy is
printed to stdout. The policy is in JSON format.
The etag field in the updated policy must match the etag value of the
current policy, otherwise the update fails. This feature prevents concurrent
updates.
You can obtain the current policy and etag value for a resource by using the
bq get-iam-policy command.
Synopsis
bq set-iam-policy [FLAGS] RESOURCE FILE_NAME
Examples
bq set-iam-policy myDataset.myTable policy.json
bq set-iam-policy --reservation myReservation policy.json
Flags and arguments
The bq set-iam-policy command uses the following flags and
arguments.
- --table={true|false}or- -t={true|false}
- Optional. To set the IAM policy
of a table or view, set to true. The default value isfalse.
- --reservation={true|false}
- To set the IAM policy of a reservation, set to true(Preview). The default value isfalse. When this flag is used,RESOURCEis treated as a reservation identifier. The reservation can have optional project and location prefixes:myProject:myLocation.myReservation.
RESOURCE is the table or view whose policy you want to update.
FILE_NAME is the name of a file containing the policy in JSON format.
For more information about the bq set-iam-policy command, see
Control access to resources with IAM.
bq show
Use the bq show command to display information about a resource.
Synopsis
bq show [FLAGS] [RESOURCE]
Flags and arguments
The bq show command uses the following flags and arguments:
- --assignee_id=ASSIGNEE
- When used with the --reservation_assignmentflag, specifies the ID of a folder, organization, or project. Use the--assignee_typeflag to specify which type of assignee to show.
- --assignee_type=TYPE
- When used with the --reservation_assignmentflag, specifies the type of entity to show. Use one of the following values:- FOLDER
- ORGANIZATION
- PROJECT
 
- --connection={true|false}
- To show information about a connection, set to true. The default value isfalse. For more information, see Viewing a connection resource.
- --dataset={true|false}or- -d={true|false}
- To show information about a dataset, set to true. The default value isfalse.
- --encryption_service_account={true|false}
- To show the encryption service account for a project, if it exists, or create
one if it doesn't exist, set to true. The default value isfalse. Use with the--project_idflag.
- --job={true|false}or- -j={true|false}
- To show information about a job, set to true. The default value isfalse.
- --job_type=JOB_TYPE
- When used with the --reservation_assignmentflag, specifies the job type of the reservation assignments you want to show. Use one of the following values:- QUERY
- PIPELINE
- ML_EXTERNAL
 
- --model={true|false}or- -m={true|false}
- To show information about a BigQuery ML model, set to true. The default value isfalse.
- --reservation={true|false}
- To show information about a reservation, set to true. The default value isfalse.
- --reservation_assignment={true|false}
- When set to true, the command displays reservation assignments for a specified folder, organization, or project. The command displays the target resource's explicit assignments, if any; otherwise, displays assignments inherited from parent resources. For example, a project might inherit assignments from its parent folder. When using this flag, the--job_type,--assignee_type, and--assignee_idflags apply. The default value isfalse.
- --routine={true|false}
- To show information about a routine, set to true. The default value isfalse. A routine can be a persistent user-defined function, a table function (Preview), or a stored procedure.
- --schema={true|false}
- To display only the table's schema, set to true. The default value isfalse.
- --transfer_config={true|false}
- To display information about a transfer configuration, set to true. The default value isfalse.
- --transfer_run={true|false}
- To display information about a transfer run, set to true. The default value isfalse.
- --view={true|false}
- To display information about a view, set to true. The default value isfalse.
- RESOURCE
- The resource whose information you want to show.
For more information about using the bq show command, see the following:
- Getting information about datasets
- Creating and using tables
- Getting information about views
- Working with transfers
- Managing jobs
- Getting information about a table snapshot
bq update
Use the bq update command to change a resource.
Synopsis
bq update [FLAGS] [RESOURCE]
Flags and arguments
The bq update command uses the following flags and arguments:
- --add_tags=TAGS
- Only available on datasets and tables. Specifies the
tags that you
are attaching to the resource, separated by commas. For example,
556741164180/env:prod,myProject/department:sales. Each tag must have the namespaced key name and value short name.
- --autoscale_max_slots=NUMBER_OF_AUTOSCALING_SLOTS
- the number of autoscaling slots assigned to the reservation. This is equal to
the value of the max reservation size minus the number of baseline slots. Only
available with the --reservationflag and if the reservation was created with an edition.
- --capacity_commitment={true|false}
- To update a capacity commitment, set to true. Use this flag with the--merge,--plan,--renewal_plan,--split, and--slotsflags.
- --clear_all_tags={true|false}
- Only available on datasets and tables. To clear all
tags from a
resource, set to true. The default value isfalse.
- --clear_label=KEY:VALUE
- Removes a label from the resource. Use the format
KEY:VALUEto specify the label to remove. Repeat this flag to remove multiple labels.
- --clustering_fields=COLUMNS
- Updates a table's clustering specification. The
COLUMNS value is a comma-separated
list of column names to use for clustering. To remove the clustering, set
COLUMNS to ""(the empty string). For more information, see Modify the clustering specification.
- --target_job_concurrency=CONCURRENCY
- When used with the
--reservationflag, specifies the target number of queries that execute simultaneously. The default value is 0, which means that the concurrency is automatically set based on the reservation size. For more information, see Use query queues.
- --dataset={true|false}or- -d={true|false}
- To update a dataset, set to true. The default value isfalse.
- --default_kms_key=KEY
- Specifies the default Cloud KMS key resource ID for encrypting table data in a dataset. The default key is used if no explicit key is provided for a table creation or a query.
- --default_partition_expiration=SECONDS
- An integer that specifies the default expiration time, in seconds, for all partitions in newly created partitioned tables in the dataset. This flag has no minimum value. - A partition's expiration time is set to the partition's UTC date plus the integer value. If this property is set, then it overrides the dataset-level default table expiration if it exists. If you supply the - --time_partitioning_expirationflag when you create or update a partitioned table, then the table-level partition expiration takes precedence over the dataset-level default partition expiration. Specify- 0to remove an existing expiration.
- --default_table_expiration=SECONDS
- An integer that updates the default lifetime, in seconds, for newly created tables in a dataset. The expiration time is set to the current UTC time plus this value. Specify - 0to remove the existing expiration.
- --description=DESCRIPTION
- Updates the description of a dataset, table, table snapshot, model, or view. 
- --destination_reservation_id=RESERVATION_ID
- When used with the - --reservation_assignmentflag, moves an existing reservation assignment to the specified reservation. The value is the ID of the destination reservation. For more information, see Move an assignment to a different reservation.
- --display_name=DISPLAY_NAME
- Updates the display name for a transfer configuration. 
- --etag=ETAG
- Acts as a filter; updates the resource only if the resource has an ETag that matches the string specified in the - ETAGargument.
- --expiration SECONDS
- To update the expiration for the table, model, table snapshot, or view, include this flag. Replace - SECONDSwith the number of seconds from the update time to the expiration time. To remove the expiration for a table, model, table snapshot, or view, set the- SECONDSargument to 0.
- --external_table_definition={TABLE::PATH_TO_FILE|TABLE::DEFINITION}
- Updates an external table with the specified table definition. The table definition can be a path to a local JSON table definition file or an inline table definition in the format - SCHEMA@SOURCE_FORMAT=CLOUD_STORAGE_URI. The- SCHEMAvalue is a comma-separated list of column definitions in the form- FIELD:DATA_TYPE, FIELD:DATA_TYPE. If you use a table definition file, don't add an extension to the filename.- For example: - --external_table_definition=myTable::/tmp/tabledef - --external_table_definition=myTable::Region:STRING,Quarter:STRING,Total_sales:INTEGER@CSV=gs://mybucket/sales.csv 
- --ignore_idle_slots={true|false}
- Use with the - --reservationflag. To restrict jobs running in the specified reservation to only use slots allocated to that reservation, set to- true. The default value is- false; jobs in the specified reservation can use idle slots from other reservations, or slots that are not allocated to any reservation. For more information, see Idle slots.
- --max_staleness=INTERVAL
- Specifies an INTERVAL value that determines the maximum staleness allowed when querying a materialized view or an external table. The default value is - 0-0 0 0:0:0.- For example: - 1 day: 0-0 1 0:0:0
- 1 hour: 0-0 0 1:0:0
 - To use this flag, you must specify a table definition with the - --external_table_definitionflag.
- 1 day: 
- --max_time_travel_hours=HOURS
- Specifies the duration in hours of the time travel window for the dataset. The - --max_time_travel_hoursvalue must be an integer expressed in multiples of 24 (48, 72, 96, 120, 144, 168) between 48 (2 days) and 168 (7 days).
- --merge={true|false}
- To merge two capacity commitments, set - --mergeto- true. Set the- --capacity_commitmentflag to- true, specify the location of the commitments you want to merge by using the- --locationflag, and replace- RESOURCEwith the IDs of the two commitments you want to merge, separated by a comma. For more information, see Merge two commitments.
- --metadata_cache_mode=METADATA_CACHE_MODE
- Enables metadata cache for an external table with a connection. Use one of the following values: - AUTOMATIC
- MANUAL
 - Specify - AUTOMATICto automatically refresh the cached metadata. Specify- MANUALto stop the automatic refresh. To use this flag, you must specify a table definition with the- --external_table_definitionflag.
- --model={true|false}or- -m={true|false}
- To update metadata for a BigQuery ML model, set to - true. The default value is- false.
- --params={"PARAMETER":"VALUE"} or -p={"PARAMETER":"VALUE"}
- Updates parameters for a transfer configuration. The parameters vary depending on the data source. For more information, see Introduction to BigQuery Data Transfer Service. 
- --plan=PLAN
- When used with the - --capacity_commitmentflag, converts a capacity commitment to the specified longer-duration commitment plan. Replace- PLANwith one of the following:- ANNUAL
- THREE_YEAR
 
- --refresh_window_days=DAYS
- An integer that specifies an updated refresh window (in days) for a transfer configuration. 
- --remove_tags=TAG_KEYS
- Only available on datasets and tables. Specifies the tags that you are removing from the resource, separated by commas—for example, - 556741164180/env,myProject/department. Each tag key must have the namespaced key name.
- --renewal_plan=PLAN
- When used with the - --capacity_commitmentflag, updates the renewal plan for an annual capacity commitment. Replace- PLANwith one of the following:- ANNUAL
- THREE_YEAR
- NONE
 - Customers using legacy flat-rate pricing also may use one of the following values: - FLEX
- MONTHLY
- ANNUAL
 
- --reservation={true|false}
- Specifies whether to update a reservation. The default value is - false.
- --reservation_assignment={true|false}
- Specifies whether to update a reservation assignment. The default value is - false.
- --schema={SCHEMA_FILE|SCHEMA}
- Specifies either the path to a local JSON schema file or a comma-separated list of column definitions in the form - FIELD:DATA_TYPE, FIELD:DATA_TYPE. If you use a schema file, don't add an extension to the filename.- For example: - --schema=/tmp/tabledef - --schema=Region:STRING,Quarter:STRING,Total_sales:INTEGER 
- --service_account_name=SERVICE_ACCOUNT
- Specifies a service account to be used as the credential for a transfer configuration. 
- --set_label=KEY:VALUE
- Specifies a label to update. To update multiple labels, repeat this flag. 
- --slots=NUMBER_OF_BASELINE_SLOTS
- When used with the - --capacity_commitmentand- --splitflags, specifies the number of baseline slots to split from an existing capacity commitment into a new commitment. Replace- RESOURCEwith the ID of the commitment you want to split from.- When used with the - --reservationflag, updates the number of slots in a reservation.
- --source=FILE
- The path to a local JSON file containing a payload used to update a resource. For example, you can use this flag to specify a JSON file that contains a dataset resource with an updated - accessproperty. The file is used to overwrite the dataset's access controls. The JSON file must not include a byte order mark (BOM).
- --split={true|false}
- When set to - trueand used with the- --capacity_commitmentflag, specifies that you want to split an existing capacity commitment. Use the- --locationflag to specify the location of the commitment you want to split from, and use the- --slotsflag to specify the number of slots you want to split off. Replace- RESOURCEwith the ID of the commitment you want to split from. For more information, see Split a commitment.
- --storage_billing_model=BILLING_MODEL
- Specifies the storage billing model for a dataset. Set the - --storage_billing_modelvalue to- PHYSICALto use physical bytes when calculating storage charges, or to- LOGICALto use logical bytes.- When you change a dataset's billing model, it takes 24 hours for the change to take effect. - Once you change a dataset's storage billing model, you must wait 14 days before you can change the storage billing model again. 
- --table={true|false}or- -t={true|false}
- Specifies whether to update a table. The default value is - false.
- --target_dataset=DATASET
- When specified, updates the target dataset for a transfer configuration. 
- --time_partitioning_expiration=SECONDS
- An integer that updates (in seconds) when a time-based partition should be deleted. The expiration time evaluates to the partition's UTC date plus the integer value. A negative number indicates no expiration. 
- --time_partitioning_field=COLUMN_NAME
- Updates the field used to determine how to create a time-based partition. If time-based partitioning is enabled without this value, then the table is partitioned based on the load time. 
- --time_partitioning_type=INTERVAL
- Specifies the partitioning type. Use one of the following values: - DAY
- HOUR
- MONTH
- YEAR
 - You can't change the partitioning type of an existing table. 
- --transfer_config={true|false}
- Specifies whether to update a transfer configuration. The default value is - false.
- --update_credentials={true|false}
- Specifies whether to update the transfer configuration credentials. The default value is - false.
- --use_legacy_sql={true|false}
- Set to - falseto update the SQL query for a view from legacy SQL to GoogleSQL. The default value is determined by your configuration settings. If the configuration setting is not specified, the default value is- true; the query uses legacy SQL.
- --vertex_ai_model_id=VERTEX_AI_MODEL_ID
- When specified, updates the model ID for a BigQuery ML model that is registered in the Vertex AI Model Registry. 
- --view=QUERY
- When specified, updates the SQL query for a view. 
- --view_udf_resource=FILE
- Updates the Cloud Storage URI or the path to a local code file that is loaded and evaluated immediately as a user-defined function resource in a view's SQL query. Repeat this flag to specify multiple files. 
- RESOURCE
- The resource that you want to update. 
For more information about using the bq update command, see the following:
- Updating dataset properties
- Managing tables
- Updating views
- Updating labels
- Working with transfers
- Updating table snapshot metadata
bq version
Use the bq version command to display the version number of your bq command-line tool.
Synopsis
bq version
bq wait
Use the bq wait command to wait a specified number of seconds for a job
to finish. If a job isn't specified, then the command waits for the current
job to finish.
Synopsis
bq wait [FLAGS] [JOB] [SECONDS]
Examples
bq wait
bq wait --wait_for_status=RUNNING 12345 100
Flags and arguments
The bq wait command uses the following flags and arguments:
- --fail_on_error={true|false}
- To return success if the job completed during the wait time, even if the job
failed, set to false. The default value istrue; after the wait time elapses, the command exits with an error if the job is still running, or if the job completed but failed.
- --wait_for_status=STATUS
- When specified, waits for a particular job status before exiting. Use one of the following values: - PENDING
- RUNNING
- DONE
 - The default value is - DONE.
- JOB
- Specifies the job to wait for. You can use the - bq ls --jobs myProjectcommand to find a job identifier.
- SECONDS
- Specifies the maximum number of seconds to wait until the job is finished. If you enter - 0, then the command polls for job completion and returns immediately. If you don't specify an integer value, then the command waits until the job is finished.