Cloud Storage 정렬
컬렉션을 사용해 정리하기
내 환경설정을 기준으로 콘텐츠를 저장하고 분류하세요.
Cloud Storage에서 텍스트 파일의 콘텐츠를 정렬하는 PySpark 작업 예시입니다.
코드 샘플
달리 명시되지 않는 한 이 페이지의 콘텐츠에는 Creative Commons Attribution 4.0 라이선스에 따라 라이선스가 부여되며, 코드 샘플에는 Apache 2.0 라이선스에 따라 라이선스가 부여됩니다. 자세한 내용은 Google Developers 사이트 정책을 참조하세요. 자바는 Oracle 및/또는 Oracle 계열사의 등록 상표입니다.
[[["이해하기 쉬움","easyToUnderstand","thumb-up"],["문제가 해결됨","solvedMyProblem","thumb-up"],["기타","otherUp","thumb-up"]],[["이해하기 어려움","hardToUnderstand","thumb-down"],["잘못된 정보 또는 샘플 코드","incorrectInformationOrSampleCode","thumb-down"],["필요한 정보/샘플이 없음","missingTheInformationSamplesINeed","thumb-down"],["번역 문제","translationIssue","thumb-down"],["기타","otherDown","thumb-down"]],[],[[["\u003cp\u003eThis webpage provides an example PySpark job for sorting text file contents stored in Cloud Storage.\u003c/p\u003e\n"],["\u003cp\u003eThe code sample is written in Python and utilizes the \u003ccode\u003epyspark\u003c/code\u003e library for Spark operations.\u003c/p\u003e\n"],["\u003cp\u003eIt guides users to follow Python setup instructions from the Dataproc quickstart.\u003c/p\u003e\n"],["\u003cp\u003eAuthentication to Dataproc requires setting up Application Default Credentials.\u003c/p\u003e\n"],["\u003cp\u003eUsers can explore additional code samples for other Google Cloud products through the Google Cloud sample browser.\u003c/p\u003e\n"]]],[],null,["An example PySpark job to sort the contents of a text file in Cloud Storage.\n\nCode sample \n\nPython\n\n\nBefore trying this sample, follow the Python setup instructions in the\n[Dataproc quickstart using\nclient libraries](/dataproc/docs/quickstarts/quickstart-lib).\n\n\nFor more information, see the\n[Dataproc Python API\nreference documentation](/python/docs/reference/dataproc/latest).\n\n\nTo authenticate to Dataproc, set up Application Default Credentials.\nFor more information, see\n\n[Set up authentication for a local development environment](/docs/authentication/set-up-adc-local-dev-environment).\n\n import pyspark\n\n sc = pyspark.SparkContext()\n rdd = sc.textFile(\"gs://path-to-your-GCS-file\")\n print(sorted(rdd.collect()))\n\nWhat's next\n\n\nTo search and filter code samples for other Google Cloud products, see the\n[Google Cloud sample browser](/docs/samples?product=dataproc)."]]