- Resource: Job
- JobReference
- JobPlacement
- JobStatus
- State
- Substate
- YarnApplication
- State
- DriverSchedulingConfig
- Methods
Resource: Job
A Dataproc job resource.
JSON representation |
---|
{ "reference": { object ( |
Fields | |
---|---|
reference |
Optional. The fully qualified reference to the job, which can be used to obtain the equivalent REST path of the job resource. If this property is not specified when a job is created, the server generates a
. |
placement |
Required. Job information, including how, when, and where to run the job. |
status |
Output only. The job status. Additional application-specific status information might be contained in the
and
fields. |
statusHistory[] |
Output only. The previous job status. |
yarnApplications[] |
Output only. The collection of YARN applications spun up by this job. Beta Feature: This report is available for testing purposes only. It might be changed before final release. |
driverOutputResourceUri |
Output only. A URI pointing to the ___location of the stdout of the job's driver program. |
driverControlFilesUri |
Output only. If present, the ___location of miscellaneous control files which can be used as part of job setup and handling. If not present, control files might be placed in the same ___location as |
labels |
Optional. The labels to associate with this job. Label keys must contain 1 to 63 characters, and must conform to RFC 1035. Label values can be empty, but, if present, must contain 1 to 63 characters, and must conform to RFC 1035. No more than 32 labels can be associated with a job. An object containing a list of |
scheduling |
Optional. Job scheduling configuration. |
jobUuid |
Output only. A UUID that uniquely identifies a job within the project over time. This is in contrast to a user-settable reference.job_id that might be reused over time. |
done |
Output only. Indicates whether the job is completed. If the value is |
driverSchedulingConfig |
Optional. Driver scheduling configuration. |
Union field type_job . Required. The application/framework-specific portion of the job. type_job can be only one of the following: |
|
hadoopJob |
Optional. Job is a Hadoop job. |
sparkJob |
Optional. Job is a Spark job. |
pysparkJob |
Optional. Job is a PySpark job. |
hiveJob |
Optional. Job is a Hive job. |
pigJob |
Optional. Job is a Pig job. |
sparkRJob |
Optional. Job is a SparkR job. |
sparkSqlJob |
Optional. Job is a SparkSql job. |
prestoJob |
Optional. Job is a Presto job. |
flinkJob |
Optional. Job is a Flink job. |
JobReference
Encapsulates the full scoping used to reference a job.
JSON representation |
---|
{ "projectId": string, "jobId": string } |
Fields | |
---|---|
projectId |
Optional. The ID of the Google Cloud Platform project that the job belongs to. If specified, must match the request project ID. |
jobId |
Optional. The job ID, which must be unique within the project. The ID must contain only letters (a-z, A-Z), numbers (0-9), underscores (_), or hyphens (-). The maximum length is 100 characters. If not specified by the caller, the job ID will be provided by the server. |
JobPlacement
Dataproc job config.
JSON representation |
---|
{ "clusterName": string, "clusterUuid": string, "clusterLabels": { string: string, ... } } |
Fields | |
---|---|
clusterName |
Required. The name of the cluster where the job will be submitted. |
clusterUuid |
Output only. A cluster UUID generated by the Dataproc service when the job is submitted. |
clusterLabels |
Optional. Cluster labels to identify a cluster where the job will be submitted. An object containing a list of |
JobStatus
Dataproc job status.
JSON representation |
---|
{ "state": enum ( |
Fields | |
---|---|
state |
Output only. A state message specifying the overall job state. |
details |
Optional. Output only. Job state details, such as an error description if the state is |
stateStartTime |
Output only. The time when this state was entered. Uses RFC 3339, where generated output will always be Z-normalized and uses 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples: |
substate |
Output only. Additional state information, which includes status reported by the agent. |
State
The job state.
Enums | |
---|---|
STATE_UNSPECIFIED |
The job state is unknown. |
PENDING |
The job is pending; it has been submitted, but is not yet running. |
SETUP_DONE |
Job has been received by the service and completed initial setup; it will soon be submitted to the cluster. |
RUNNING |
The job is running on the cluster. |
CANCEL_PENDING |
A jobs.cancel request has been received, but is pending. |
CANCEL_STARTED |
Transient in-flight resources have been canceled, and the request to cancel the running job has been issued to the cluster. |
CANCELLED |
The job cancellation was successful. |
DONE |
The job has completed successfully. |
ERROR |
The job has completed, but encountered an error. |
ATTEMPT_FAILURE |
Job attempt has failed. The detail field contains failure details for this attempt. Applies to restartable jobs only. |
Substate
The job substate.
Enums | |
---|---|
UNSPECIFIED |
The job substate is unknown. |
SUBMITTED |
The Job is submitted to the agent. Applies to RUNNING state. |
QUEUED |
The Job has been received and is awaiting execution (it might be waiting for a condition to be met). See the "details" field for the reason for the delay. Applies to RUNNING state. |
STALE_STATUS |
The agent-reported status is out of date, which can be caused by a loss of communication between the agent and Dataproc. If the agent does not send a timely update, the job will fail. Applies to RUNNING state. |
YarnApplication
A YARN application created by a job. Application information is a subset of
org.apache.hadoop.yarn.proto.YarnProtos.ApplicationReportProto
.
Beta Feature: This report is available for testing purposes only. It may be changed before final release.
JSON representation |
---|
{
"name": string,
"state": enum ( |
Fields | |
---|---|
name |
Required. The application name. |
state |
Required. The application state. |
progress |
Required. The numerical progress of the application, from 1 to 100. |
trackingUrl |
Optional. The HTTP URL of the ApplicationMaster, HistoryServer, or TimelineServer that provides application-specific information. The URL uses the internal hostname, and requires a proxy server for resolution and, possibly, access. |
State
The application state, corresponding to
YarnProtos.YarnApplicationStateProto
.
Enums | |
---|---|
STATE_UNSPECIFIED |
Status is unspecified. |
NEW |
Status is NEW. |
NEW_SAVING |
Status is NEW_SAVING. |
SUBMITTED |
Status is SUBMITTED. |
ACCEPTED |
Status is ACCEPTED. |
RUNNING |
Status is RUNNING. |
FINISHED |
Status is FINISHED. |
FAILED |
Status is FAILED. |
KILLED |
Status is KILLED. |
DriverSchedulingConfig
Driver scheduling configuration.
JSON representation |
---|
{ "memoryMb": integer, "vcores": integer } |
Fields | |
---|---|
memoryMb |
Required. The amount of memory in MB the driver is requesting. |
vcores |
Required. The number of vCPUs the driver is requesting. |
Methods |
|
---|---|
|
Starts a job cancellation request. |
|
Deletes the job from the project. |
|
Gets the resource representation for a job in a project. |
|
Gets the access control policy for a resource. |
|
Lists regions/{region}/jobs in a project. |
|
Updates a job in a project. |
|
Sets the access control policy on the specified resource. |
|
Submits a job to a cluster. |
|
Submits job to a cluster. |
|
Returns permissions that a caller has on the specified resource. |