Supported File Formats

This section contains information on the fie formats and compression schemes that are supported for input to and output of Cloud Dataprep by TRIFACTA®.

NOTE: To work with formats that are proprietary to a desktop application, such as Microsoft Excel, you do not need the supporting application installed on your desktop.

Native Input File Formats:

Cloud Dataprep by TRIFACTA® can read and import directly these file formats:

  • Excel (XLS/XLSX), upload only

    Tip: You may import multiple worksheets from a single workbook at one time. See Import Excel Data.

  • CSV
  • JSON, including nested

    NOTE: Cloud Dataprep by TRIFACTA requires that JSON files be submitted with one valid JSON object per line. Consistently malformed JSON objects or objects that overlap linebreaks might cause import to fail. See Initial Parsing Steps.

  • Plain Text
  • LOG
  • TSV

  • Avro

    NOTE: When working with datasets sourced from Avro files, lineage information and the SOURCEROWNUMBER function are not supported.

For more information on data is handled initially, see Initial Parsing Steps.

Native Output File Formats:

Cloud Dataprep by TRIFACTA can write to these file formats:

  • CSV
  • JSON

  • Avro

  • BQ Table

Compression Algorithms:Read Native File Formats:

GZIPBZIPSnappy
CSV SupportedSupportedSupported
JSONSupportedSupportedSupported
Avro Supported
Write Native File Formats:
GZIPBZIPSnappy
CSVSupportedSupportedNot supported
JSONSupportedSupportedNot supported

Was this page helpful? Let us know how we did:

Send feedback about...

Google Cloud Dataprep Documentation