Classificar o Cloud Storage
Mantenha tudo organizado com as coleções
Salve e categorize o conteúdo com base nas suas preferências.
Um exemplo de job do PySpark para classificar o conteúdo de um arquivo de texto no Cloud Storage
Exemplo de código
Exceto em caso de indicação contrária, o conteúdo desta página é licenciado de acordo com a Licença de atribuição 4.0 do Creative Commons, e as amostras de código são licenciadas de acordo com a Licença Apache 2.0. Para mais detalhes, consulte as políticas do site do Google Developers. Java é uma marca registrada da Oracle e/ou afiliadas.
[[["Fácil de entender","easyToUnderstand","thumb-up"],["Meu problema foi resolvido","solvedMyProblem","thumb-up"],["Outro","otherUp","thumb-up"]],[["Difícil de entender","hardToUnderstand","thumb-down"],["Informações incorretas ou exemplo de código","incorrectInformationOrSampleCode","thumb-down"],["Não contém as informações/amostras de que eu preciso","missingTheInformationSamplesINeed","thumb-down"],["Problema na tradução","translationIssue","thumb-down"],["Outro","otherDown","thumb-down"]],[],[[["\u003cp\u003eThis webpage provides an example PySpark job for sorting text file contents stored in Cloud Storage.\u003c/p\u003e\n"],["\u003cp\u003eThe code sample is written in Python and utilizes the \u003ccode\u003epyspark\u003c/code\u003e library for Spark operations.\u003c/p\u003e\n"],["\u003cp\u003eIt guides users to follow Python setup instructions from the Dataproc quickstart.\u003c/p\u003e\n"],["\u003cp\u003eAuthentication to Dataproc requires setting up Application Default Credentials.\u003c/p\u003e\n"],["\u003cp\u003eUsers can explore additional code samples for other Google Cloud products through the Google Cloud sample browser.\u003c/p\u003e\n"]]],[],null,["An example PySpark job to sort the contents of a text file in Cloud Storage.\n\nCode sample \n\nPython\n\n\nBefore trying this sample, follow the Python setup instructions in the\n[Dataproc quickstart using\nclient libraries](/dataproc/docs/quickstarts/quickstart-lib).\n\n\nFor more information, see the\n[Dataproc Python API\nreference documentation](/python/docs/reference/dataproc/latest).\n\n\nTo authenticate to Dataproc, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n import pyspark\n\n sc = pyspark.SparkContext()\n rdd = sc.textFile(\"gs://path-to-your-GCS-file\")\n print(sorted(rdd.collect()))\n\nWhat's next\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=dataproc)."]]