Class SparkRJob

A Dataproc job for running Apache SparkR <https://spark.apache.org/docs/latest/sparkr.html>__ applications on YARN.

Attributes
NameDescription
strmain_r_file_uri
Required. The HCFS URI of the main R file to use as the driver. Must be a .R file.
Sequence[str]args
Optional. The arguments to pass to the driver. Do not include arguments, such as ``--conf``, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
Sequence[str]file_uris
Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
Sequence[str]archive_uris
Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
Sequence[google.cloud.dataproc_v1.types.SparkRJob.PropertiesEntry]properties
Optional. A mapping of property names to values, used to configure SparkR. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
google.cloud.dataproc_v1.types.LoggingConfiglogging_config
Optional. The runtime log config for job execution.

Inheritance

builtins.object > proto.message.Message > SparkRJob

Classes

PropertiesEntry

PropertiesEntry(mapping=None, *, ignore_unknown_fields=False, **kwargs)

API documentation for dataproc_v1.types.SparkRJob.PropertiesEntry class.