Jobs Page

In the Jobs page, you can track the status of all of your jobs. You can only see jobs for the flows to which you have access in your currently selected project.Jobs can be initiated from:

Figure: Jobs page

Job Types:

Each job listed in the Jobs page is a grouping of related jobs acting on the same recipe and dataset(s). Each of these jobgroups breaks down into one or more of the following job types.

Tip: To review the status of individual jobs within a jobgroup, hover over the value in the Status column for the jobgroup.

  • Transform: These jobs perform transformations on imported datasets based on the recipe from which the job was launched.
  • Profile: If enabled as part of the job definition, a Profile job generates a visual summary of the results of your transformation job.
    • Profiling jobs may take longer than transformation jobs.
    • Even when selected, profiling jobs may not appear in the Jobs page. In some cases, the running environment may fold a profiling job into a transform job for optimization reasons.

      NOTE: When the profiling job is run as part of the transform job, there is no listing for profiling in the mouse-over popup.

    • See Job Results Page.
  • Publishing: As needed, job results can be published from their target location to another location or data store. These jobs are tracked separately as publishing jobs. For more information, see Export Results Window.

    Publishing jobs also include internal tasks of writing results in file format to the designated location in the base storage layer. These jobs exist in all jobgroups.

Access:

  • You can review and drill into any job that you initiated.

Columns:

  • Job: Internal identifier for the job. This value is unique for all jobs in your Cloud Dataprep deployment.
  • User: The Cloud Dataprep user that initiated the job.

  • Flow/Output:
  • Status:

    • Queued: Job has been queued for execution.
    • Running: Job is in progress.
    • Completed: Job has successfully executed.

    • Canceled: Job was canceled by user.

      Failed: Job failed to complete.

      NOTE: You can re-run a failed job from the Transformer page. If you have since modified the recipe, those changes are applied during the second run. See Transformer Page.

    • Publish Failed: Publishing job failed to complete.
  • Started: Start timestamp for the job.

Actions:

  • Filter: Click one of the tabs to filter the display to show only the listings for the selected job status.
  • Search: Enter text in the search field to filter the listed jobs by job ID, flow name, dataset name, or the name of the user who created it. The list is filtered as you type.

Context menu:

Next to the job listing, click the options menu to see the following:

  • Cloud Dataflow Job: View the job on Cloud Dataflow.
  • Export Results: Export the results of a completed job. See Export Results Window.
  • View steps and dependencies: View steps of the recipe being executed and any dependencies referenced in the recipe.

Was this page helpful? Let us know how we did:

Send feedback about...

Google Cloud Dataprep Documentation