Ordinamento di Cloud Storage
Mantieni tutto organizzato con le raccolte
Salva e classifica i contenuti in base alle tue preferenze.
Un esempio di job PySpark per ordinare i contenuti di un file di testo in Cloud Storage.
Esempio di codice
Salvo quando diversamente specificato, i contenuti di questa pagina sono concessi in base alla licenza Creative Commons Attribution 4.0, mentre gli esempi di codice sono concessi in base alla licenza Apache 2.0. Per ulteriori dettagli, consulta le norme del sito di Google Developers. Java è un marchio registrato di Oracle e/o delle sue consociate.
[[["Facile da capire","easyToUnderstand","thumb-up"],["Il problema è stato risolto","solvedMyProblem","thumb-up"],["Altra","otherUp","thumb-up"]],[["Difficile da capire","hardToUnderstand","thumb-down"],["Informazioni o codice di esempio errati","incorrectInformationOrSampleCode","thumb-down"],["Mancano le informazioni o gli esempi di cui ho bisogno","missingTheInformationSamplesINeed","thumb-down"],["Problema di traduzione","translationIssue","thumb-down"],["Altra","otherDown","thumb-down"]],[],[[["\u003cp\u003eThis webpage provides an example PySpark job for sorting text file contents stored in Cloud Storage.\u003c/p\u003e\n"],["\u003cp\u003eThe code sample is written in Python and utilizes the \u003ccode\u003epyspark\u003c/code\u003e library for Spark operations.\u003c/p\u003e\n"],["\u003cp\u003eIt guides users to follow Python setup instructions from the Dataproc quickstart.\u003c/p\u003e\n"],["\u003cp\u003eAuthentication to Dataproc requires setting up Application Default Credentials.\u003c/p\u003e\n"],["\u003cp\u003eUsers can explore additional code samples for other Google Cloud products through the Google Cloud sample browser.\u003c/p\u003e\n"]]],[],null,["An example PySpark job to sort the contents of a text file in Cloud Storage.\n\nCode sample \n\nPython\n\n\nBefore trying this sample, follow the Python setup instructions in the\n[Dataproc quickstart using\nclient libraries](/dataproc/docs/quickstarts/quickstart-lib).\n\n\nFor more information, see the\n[Dataproc Python API\nreference documentation](/python/docs/reference/dataproc/latest).\n\n\nTo authenticate to Dataproc, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n import pyspark\n\n sc = pyspark.SparkContext()\n rdd = sc.textFile(\"gs://path-to-your-GCS-file\")\n print(sorted(rdd.collect()))\n\nWhat's next\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=dataproc)."]]