Supported File Formats

NOTE: To work with formats that are proprietary to a desktop application, such as Microsoft Excel, you do not need the supporting application installed on your desktop.

Native Input File Formats:

Cloud Dataprep can read and import directly these file formats:

  • Excel (XLS/XLSX), upload only

    Tip: You may import multiple worksheets from a single workbook at one time. See Import Excel Data.

  • CSV
  • JSON, including nested

    NOTE: Cloud Dataprep requires that JSON files be submitted with one valid JSON object per line. Consistently malformed JSON objects or objects that overlap linebreaks might cause import to fail. See Initial Parsing Steps.

  • Plain Text
  • LOG
  • TSV

  • Avro

    NOTE: When working with datasets sourced from Avro files, lineage information and the SOURCEROWNUMBER function are not supported.

For more information on data is handled initially, see Initial Parsing Steps.

Native Output File Formats:

Cloud Dataprep can write to these file formats:

  • CSV
  • JSON

  • Avro

  • BQ Table

Compression Algorithms:Read Native File Formats:

CSV SupportedSupportedSupported
Avro Supported
Write Native File Formats:
CSVSupportedSupportedNot supported
JSONSupportedSupportedNot supported

Send feedback about...

Google Cloud Dataprep Documentation