In the following sections, you can review short summaries of specific features and explore more detailed information on them.
These features can be applied to individual flows to simplify job execution.
Parameterization enables you to specify parameters that capture variability in your data source paths or names. For example, you can parameterize the names of folders in your filepaths to capture files within multiple folders. Or, you can parameterize your inputs to capture datasets named within a specific time range. Nested folders of data can be parameterized, too.
- dataset parameters: Parameterize the input paths to your data, allowing you to process data in parallel files and tables through the same flow.
- output parameters: Parameterize the output paths for your results.
flow parameters: Define parameters that can be applied in your flows, including recipe steps.
Tip: You can apply overrides to any parameter at the flow level. These parameter override values are applied to any parameter that is referenced within the flow for any supported parameter type.
NOTE: Some of the following may not be available in your product edition.
Use regular expressions or Cloud Dataprep patterns in your paths or queries to sources to capture a broader set of inputs.
|Wildcard||Replace parts of your paths or queries with wildcards.|
|Datetime||You can specify parameterized Datetime values in one of the supported formats.|
|Variable||Variable values can be specified as overrides during import, job execution, and output.|
Parameterization is available for the following:
For more information, see Overview of Parameterization.
The scheduling feature, also known as Automator, enables you to schedule the execution of individual flows on a specified frequency. Frequencies can be specified through the Cloud Dataprep application through a simple interface or, if needed, in a modified form of cron syntax.
Tip: Automator is often used with parameterization to fully automate data preparation processes in Cloud Dataprep by TRIFACTA INC..
For more information, see Overview of Automator.
After a job has been launched, detailed monitoring permits you to track the progress of your job during all phases of execution. Status, job stats, inputs, outputs and a flow snapshot are available through the Cloud Dataprep application.For more information, see Overview of Job Monitoring.