public sealed class PySparkJob : IMessage<PySparkJob>, IEquatable<PySparkJob>, IDeepCloneable<PySparkJob>, IBufferMessage, IMessage
Reference documentation and code samples for the Google Cloud Dataproc v1 API class PySparkJob.
A Dataproc job for running Apache PySpark applications on YARN.
Implements
IMessagePySparkJob, IEquatablePySparkJob, IDeepCloneablePySparkJob, IBufferMessage, IMessageNamespace
GoogleCloudGoogle.Cloud.DataprocV1Assembly
Google.Cloud.Dataproc.V1.dll
Constructors
PySparkJob()
public PySparkJob()
PySparkJob(PySparkJob)
public PySparkJob(PySparkJob other)
Parameter | |
---|---|
Name | Description |
other | PySparkJob |
Properties
ArchiveUris
public RepeatedField<string> ArchiveUris { get; }
Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |
Args
public RepeatedField<string> Args { get; }
Optional. The arguments to pass to the driver. Do not include arguments,
such as --conf
, that can be set as job properties, since a collision may
occur that causes an incorrect job submission.
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |
FileUris
public RepeatedField<string> FileUris { get; }
Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |
JarFileUris
public RepeatedField<string> JarFileUris { get; }
Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |
LoggingConfig
public LoggingConfig LoggingConfig { get; set; }
Optional. The runtime log config for job execution.
Property Value | |
---|---|
Type | Description |
LoggingConfig |
MainPythonFileUri
public string MainPythonFileUri { get; set; }
Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
Property Value | |
---|---|
Type | Description |
string |
Properties
public MapField<string, string> Properties { get; }
Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
Property Value | |
---|---|
Type | Description |
MapFieldstringstring |
PythonFileUris
public RepeatedField<string> PythonFileUris { get; }
Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.
Property Value | |
---|---|
Type | Description |
RepeatedFieldstring |