Cloud Dataproc V1 API - Class Google::Cloud::Dataproc::V1::PySparkJob (v1.0.0)

Reference documentation and code samples for the Cloud Dataproc V1 API class Google::Cloud::Dataproc::V1::PySparkJob.

A Dataproc job for running Apache PySpark applications on YARN.

Inherits

  • Object

Extended By

  • Google::Protobuf::MessageExts::ClassMethods

Includes

  • Google::Protobuf::MessageExts

Methods

#archive_uris

def archive_uris() -> ::Array<::String>
Returns
  • (::Array<::String>) — Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.

#archive_uris=

def archive_uris=(value) -> ::Array<::String>
Parameter
  • value (::Array<::String>) — Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.
Returns
  • (::Array<::String>) — Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.

#args

def args() -> ::Array<::String>
Returns
  • (::Array<::String>) — Optional. The arguments to pass to the driver. Do not include arguments, such as --conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission.

#args=

def args=(value) -> ::Array<::String>
Parameter
  • value (::Array<::String>) — Optional. The arguments to pass to the driver. Do not include arguments, such as --conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission.
Returns
  • (::Array<::String>) — Optional. The arguments to pass to the driver. Do not include arguments, such as --conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission.

#file_uris

def file_uris() -> ::Array<::String>
Returns
  • (::Array<::String>) — Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.

#file_uris=

def file_uris=(value) -> ::Array<::String>
Parameter
  • value (::Array<::String>) — Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.
Returns
  • (::Array<::String>) — Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks.

#jar_file_uris

def jar_file_uris() -> ::Array<::String>
Returns
  • (::Array<::String>) — Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.

#jar_file_uris=

def jar_file_uris=(value) -> ::Array<::String>
Parameter
  • value (::Array<::String>) — Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.
Returns
  • (::Array<::String>) — Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks.

#logging_config

def logging_config() -> ::Google::Cloud::Dataproc::V1::LoggingConfig
Returns

#logging_config=

def logging_config=(value) -> ::Google::Cloud::Dataproc::V1::LoggingConfig
Parameter
Returns

#main_python_file_uri

def main_python_file_uri() -> ::String
Returns
  • (::String) — Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.

#main_python_file_uri=

def main_python_file_uri=(value) -> ::String
Parameter
  • value (::String) — Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.
Returns
  • (::String) — Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file.

#properties

def properties() -> ::Google::Protobuf::Map{::String => ::String}
Returns
  • (::Google::Protobuf::Map{::String => ::String}) — Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.

#properties=

def properties=(value) -> ::Google::Protobuf::Map{::String => ::String}
Parameter
  • value (::Google::Protobuf::Map{::String => ::String}) — Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.
Returns
  • (::Google::Protobuf::Map{::String => ::String}) — Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API may be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code.

#python_file_uris

def python_file_uris() -> ::Array<::String>
Returns
  • (::Array<::String>) — Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.

#python_file_uris=

def python_file_uris=(value) -> ::Array<::String>
Parameter
  • value (::Array<::String>) — Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.
Returns
  • (::Array<::String>) — Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.