Remove Initial Structure

When you import a dataset from a file, Cloud Dataprep by TRIFACTA® INC. attempts to detect the structure of the file and to apply an initial set of parsing steps to the data to render it in tabular form for display in the Transformer page. For example, JSON files may be turned into a table of data as long as the structure of the data supports this structuring.

NOTE: Initial parsing steps are applied only to file-based sources of data.

These steps vary based on the file format of data that is being imported. For more information, see Initial Parsing Steps.

Depending on the dataset, you may need to modify these steps or rebuild them altogether. You can use the following steps to prevent Cloud Dataprep by TRIFACTA® INC. from detecting the structure and automatically hiding these steps.

Tip: You should allow the product to detect the structure first. If it does not detect the structure well, you can experiment with disabling it and rebuilding the steps to meet your dataset requirements.

Tip: If you have already imported the dataset and need to change this setting, you can re-import the source and change the settings. In any flows that use the previously imported version of this dataset, you can change the input for any recipe that uses the old version to use this newly imported version. See Flow View Page.

NOTE: When the steps are completed, the initial parsing steps are listed in any recipe that you create from the imported dataset. If you wish to remove them altogether, you can delete them from the recipe.

Steps:

  1. After you have selected or specified the file to import in the Import Data page, click Edit Settings for the dataset card in the right panel.
  2. Deselect the Detect Structure checkbox.
  3. Continue the import process.
  4. When the imported dataset is added to a flow, it is listed as an unstructured dataset.
  5. Select the dataset and click Create new recipe.
  6. When you select the recipe, the initial parsing steps are listed in the right panel.
  7. When the dataset is loaded into the Transformer page, you can modify these steps to improve the parsing or delete them altogether.

    NOTE: Any step that breaks up the data into individual rows into individual rows must be the first step in the recipe. To create, enter Break into rows in the Search panel. See Search Panel.

Was this page helpful? Let us know how we did:

Send feedback about...

Google Cloud Dataprep Documentation
Need help? Visit our support page.