Header Transform

NOTE: Transforms are a part of the underlying language, which is not directly accessible to users. This content is maintained for reference purposes only. For more information on the user-accessible equivalent to transforms, see Transformation Reference.

Uses one row from the dataset sample as the header row for the table. Each value in this row becomes the name of the column in which it is located.

This transform might be automatically applied as one of the first steps of your recipe. See Initial Parsing Steps.

NOTE: If source row number information is not available due to changes in the dataset, this transform may not be available.

Basic Usage

header sourcerownumber: 4

Output: The values from Row #4 of the original dataset are used, if available, as the names for each column. If the row is not available, the specified row data can be retrieved, and the transform fails.

Parameters

header sourcerownumber: row_num

TokenRequired?Data TypeDescription
headerYtransformName of the transform
sourcerownumberYinteger (positive)Row number from the original data to use as the header.

For more information on syntax standards, see Language Documentation Syntax Notes.

sourcerownumber

The sourcerownumber parameter defines the row number to apply to the transform step.

This parameter references the original row number of the sample in the dataset.

  • sourceownumber parameter must be an integer that is less than or equal to the total number of rows in the original sample.
  • If the corresponding row has been deleted from the dataset, the transform step generates an error.

Example:

header sourcerownumber: 4

Output: Uses row #4 from the source row numbers of the sample as the header the columns.

Usage Notes:

Required?Data Type
Yesinteger (positive)

Examples

Example - Header from row that is not the first one

Source:

You have imported the following racer data on heat times from a CSV file. When loaded in the Transformer page, it looks like the following:

(rowId)column2column3column4column5
1RacerHeat 1Heat 2Heat 3
2Racer X37.2238.2237.61
3Racer Y41.33DQ38.04
4Racer Z39.2739.0438.85

In the above, the (rowId) column references the row numbers displayed in the data grid; it is not part of the dataset. This information is available when you hover over the black dot on the left side of the screen.

Transform:

You have examined the best performance in each heat according to the sample. You then notice that the data contains headers, but you forget how it was originally sorted. The data now looks like the following:

(rowId)column2column3column4column5
1Racer Y41.33DQ38.04
2RacerHeat 1Heat 2Heat 3
3Racer X37.2238.2237.61
4Racer Z39.2739.0438.85

You can use the following transformation to use the third row as your header for each column:

NOTE: The following does not use the header transform.

rename type: header method: index sourcerownumber: 3

Results:

After you have applied the last header transform, your data should look like the following:

(rowId)RacerHeat_1Heat_2Heat_3
3Racer Y41.33DQ38.04
2Racer X37.2238.2237.61
4Racer Z39.2739.0438.85

Var denne siden nyttig? Si fra hva du synes:

Send tilbakemelding om ...

Google Cloud Dataprep Documentation
Trenger du hjelp? Gå til brukerstøttesiden vår.