- Resource: Job
- JobReference
- JobPlacement
- JobStatus
- State
- Substate
- YarnApplication
- State
- DriverSchedulingConfig
- Methods
Resource: Job
A Dataproc job resource.
JSON representation |
---|
{ "reference": { object ( |
Fields | |
---|---|
reference |
Optional. The fully qualified reference to the job, which can be used to obtain the equivalent REST path of the job resource. If this property is not specified when a job is created, the server generates a
. |
placement |
Required. Job information, including how, when, and where to run the job. |
status |
Output only. The job status. Additional application-specific status information might be contained in the
and
fields. |
status |
Output only. The previous job status. |
yarn |
Output only. The collection of YARN applications spun up by this job. Beta Feature: This report is available for testing purposes only. It might be changed before final release. |
driver |
Output only. A URI pointing to the location of the stdout of the job's driver program. |
driver |
Output only. If present, the location of miscellaneous control files which can be used as part of job setup and handling. If not present, control files might be placed in the same location as |
labels |
Optional. The labels to associate with this job. Label keys must contain 1 to 63 characters, and must conform to RFC 1035. Label values can be empty, but, if present, must contain 1 to 63 characters, and must conform to RFC 1035. No more than 32 labels can be associated with a job. An object containing a list of |
scheduling |
Optional. Job scheduling configuration. |
job |
Output only. A UUID that uniquely identifies a job within the project over time. This is in contrast to a user-settable reference.job_id that might be reused over time. |
done |
Output only. Indicates whether the job is completed. If the value is |
driver |
Optional. Driver scheduling configuration. |
Union field type_job . Required. The application/framework-specific portion of the job. type_job can be only one of the following: |
|
hadoop |
Optional. Job is a Hadoop job. |
spark |
Optional. Job is a Spark job. |
pyspark |
Optional. Job is a PySpark job. |
hive |
Optional. Job is a Hive job. |
pig |
Optional. Job is a Pig job. |
spark |
Optional. Job is a SparkR job. |
spark |
Optional. Job is a SparkSql job. |
presto |
Optional. Job is a Presto job. |
flink |
Optional. Job is a Flink job. |
JobReference
Encapsulates the full scoping used to reference a job.
JSON representation |
---|
{ "projectId": string, "jobId": string } |
Fields | |
---|---|
project |
Optional. The ID of the Google Cloud Platform project that the job belongs to. If specified, must match the request project ID. |
job |
Optional. The job ID, which must be unique within the project. The ID must contain only letters (a-z, A-Z), numbers (0-9), underscores (_), or hyphens (-). The maximum length is 100 characters. If not specified by the caller, the job ID will be provided by the server. |
JobPlacement
Dataproc job config.
JSON representation |
---|
{ "clusterName": string, "clusterUuid": string, "clusterLabels": { string: string, ... } } |
Fields | |
---|---|
cluster |
Required. The name of the cluster where the job will be submitted. |
cluster |
Output only. A cluster UUID generated by the Dataproc service when the job is submitted. |
cluster |
Optional. Cluster labels to identify a cluster where the job will be submitted. An object containing a list of |
JobStatus
Dataproc job status.
JSON representation |
---|
{ "state": enum ( |
Fields | |
---|---|
state |
Output only. A state message specifying the overall job state. |
details |
Optional. Output only. Job state details, such as an error description if the state is |
state |
Output only. The time when this state was entered. A timestamp in RFC3339 UTC "Zulu" format, with nanosecond resolution and up to nine fractional digits. Examples: |
substate |
Output only. Additional state information, which includes status reported by the agent. |
State
The job state.
Enums | |
---|---|
STATE_UNSPECIFIED |
The job state is unknown. |
PENDING |
The job is pending; it has been submitted, but is not yet running. |
SETUP_DONE |
Job has been received by the service and completed initial setup; it will soon be submitted to the cluster. |
RUNNING |
The job is running on the cluster. |
CANCEL_PENDING |
A jobs.cancel request has been received, but is pending. |
CANCEL_STARTED |
Transient in-flight resources have been canceled, and the request to cancel the running job has been issued to the cluster. |
CANCELLED |
The job cancellation was successful. |
DONE |
The job has completed successfully. |
ERROR |
The job has completed, but encountered an error. |
ATTEMPT_FAILURE |
Job attempt has failed. The detail field contains failure details for this attempt. Applies to restartable jobs only. |
Substate
The job substate.
Enums | |
---|---|
UNSPECIFIED |
The job substate is unknown. |
SUBMITTED |
The Job is submitted to the agent. Applies to RUNNING state. |
QUEUED |
The Job has been received and is awaiting execution (it might be waiting for a condition to be met). See the "details" field for the reason for the delay. Applies to RUNNING state. |
STALE_STATUS |
The agent-reported status is out of date, which can be caused by a loss of communication between the agent and Dataproc. If the agent does not send a timely update, the job will fail. Applies to RUNNING state. |
YarnApplication
A YARN application created by a job. Application information is a subset of
org.apache.hadoop.yarn.proto.YarnProtos.ApplicationReportProto
.
Beta Feature: This report is available for testing purposes only. It may be changed before final release.
JSON representation |
---|
{
"name": string,
"state": enum ( |
Fields | |
---|---|
name |
Required. The application name. |
state |
Required. The application state. |
progress |
Required. The numerical progress of the application, from 1 to 100. |
tracking |
Optional. The HTTP URL of the ApplicationMaster, HistoryServer, or TimelineServer that provides application-specific information. The URL uses the internal hostname, and requires a proxy server for resolution and, possibly, access. |
State
The application state, corresponding to
YarnProtos.YarnApplicationStateProto
.
Enums | |
---|---|
STATE_UNSPECIFIED |
Status is unspecified. |
NEW |
Status is NEW. |
NEW_SAVING |
Status is NEW_SAVING. |
SUBMITTED |
Status is SUBMITTED. |
ACCEPTED |
Status is ACCEPTED. |
RUNNING |
Status is RUNNING. |
FINISHED |
Status is FINISHED. |
FAILED |
Status is FAILED. |
KILLED |
Status is KILLED. |
DriverSchedulingConfig
Driver scheduling configuration.
JSON representation |
---|
{ "memoryMb": integer, "vcores": integer } |
Fields | |
---|---|
memory |
Required. The amount of memory in MB the driver is requesting. |
vcores |
Required. The number of vCPUs the driver is requesting. |
Methods |
|
---|---|
|
Starts a job cancellation request. |
|
Deletes the job from the project. |
|
Gets the resource representation for a job in a project. |
|
Gets the access control policy for a resource. |
|
Lists regions/{region}/jobs in a project. |
|
Updates a job in a project. |
|
Sets the access control policy on the specified resource. |
|
Submits a job to a cluster. |
|
Submits job to a cluster. |
|
Returns permissions that a caller has on the specified resource. |