명령줄을 사용하여 이미지에서 라벨 인식

이 페이지에서는 REST 인터페이스 및 curl 명령어를 사용하여 Vision API에 세 가지 인식 기능 및 주석 요청을 보내는 방법을 보여줍니다.

Vision API를 사용하면 Vision 인식 기술을 개발자 애플리케이션에 간편하게 통합할 수 있습니다. Vision API에 이미지 데이터와 원하는 기능 유형을 보내면 관심 있는 이미지 속성을 기반으로 해당 응답을 반환합니다. 제공되는 기능 유형에 대한 자세한 내용은 모든 Vision API 기능 목록을 참조하세요.

시작하기 전에

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

Install the Google Cloud CLI.

To initialize the gcloud CLI, run the following command:

gcloud init

Create or select a Google Cloud project.

Create a Google Cloud project:
```
gcloud projects create PROJECT_ID
```
Replace PROJECT_ID with a name for the Google Cloud project you are creating.
Select the Google Cloud project that you created:
```
gcloud config set project PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project name.

Google Cloud 프로젝트에 결제가 사용 설정되어 있는지 확인합니다.

Enable the Vision API:

gcloud services enable vision.googleapis.com

Google 계정에 역할을 부여합니다. 다음 각 IAM 역할에 대해 다음 명령어를 한 번씩 실행합니다. roles/storage.objectViewer

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:EMAIL_ADDRESS" --role=ROLE

PROJECT_ID를 프로젝트 ID로 바꿉니다.
EMAIL_ADDRESS를 이메일 주소로 바꿉니다.
ROLE을 각 개별 역할로 바꿉니다.

Install the Google Cloud CLI.

To initialize the gcloud CLI, run the following command:

gcloud init

Create or select a Google Cloud project.

Create a Google Cloud project:
```
gcloud projects create PROJECT_ID
```
Replace PROJECT_ID with a name for the Google Cloud project you are creating.
Select the Google Cloud project that you created:
```
gcloud config set project PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project name.

Google Cloud 프로젝트에 결제가 사용 설정되어 있는지 확인합니다.

Enable the Vision API:

gcloud services enable vision.googleapis.com

Google 계정에 역할을 부여합니다. 다음 각 IAM 역할에 대해 다음 명령어를 한 번씩 실행합니다. roles/storage.objectViewer

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:EMAIL_ADDRESS" --role=ROLE

PROJECT_ID를 프로젝트 ID로 바꿉니다.
EMAIL_ADDRESS를 이메일 주소로 바꿉니다.
ROLE을 각 개별 역할로 바꿉니다.

이미지 주석 요청하기

시작하기 전에 단계를 완료한 후에 Vision API를 사용하여 이미지 파일에 주석을 추가할 수 있습니다.

이 예시에서는 curl을 사용해 Vision API에 다음 이미지를 사용하는 요청을 보냅니다.

Cloud Storage URI:

gs://cloud-samples-data/vision/using_curl/shanghai.jpeg

HTTPS URL:

https://console.cloud.google.com/storage/browser/cloud-samples-data/vision/using_curl/shanghai.jpeg

요청 JSON 만들기

다음 request.json 파일에서 3가지 images:annotate 기능을 요청하고 응답에서 결과를 제한하는 방법을 보여줍니다.

다음 텍스트로 JSON 요청 파일을 만들고 작업 디렉터리에 request.json 일반 텍스트 파일로 저장합니다.

request.json

{
  "requests": [
    {
      "image": {
        "source": {
          "imageUri": "gs://cloud-samples-data/vision/using_curl/shanghai.jpeg"
        }
      },
      "features": [
        {
          "type": "LABEL_DETECTION",
          "maxResults": 3
        },
        {
          "type": "OBJECT_LOCALIZATION",
          "maxResults": 1
        },
        {
          "type": "TEXT_DETECTION",
          "maxResults": 1,
          "model": "builtin/latest"
        }
      ]
    }
  ]
}

필드 값 세부정보

image.source.gcsImageUri - Cloud Storage 버킷에 저장된 이미지를 나타냅니다. 공개적으로 사용 가능한 URI의 경우 이 요청을 image.source.imageUri로 변경하거나 이미지의 base64로 인코딩된 문자열 표현을 전달하려면 image.content로 변경합니다.
features - 특정 기능 유형을 나타내는 객체입니다. 단일 이미지에 대해 여러 기능 유형을 요청할 수 있습니다.

type - 기능을 지정하는 열거형 값입니다.
maxResults(선택사항) - 반환된 결과에 대한 제한 값입니다.
model(선택사항) - 해당되는 경우 builtin/stable(설정되지 않은 경우 기본값) 또는 builtin/latest를 지정하여 모델을 선택할 수 있습니다. 최근 업데이트된 모델 목록은 출시 노트 항목을 참조하세요.

요청 전송

request.json의 curl 및 본문 콘텐츠를 사용하여 Vision API에 요청을 보냅니다. 명령줄에 다음을 입력합니다.

curl -X POST \
    -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    -H "x-goog-user-project: PROJECT_ID" \
    -H "Content-Type: application/json; charset=utf-8" \
    https://vision.googleapis.com/v1/images:annotate -d @request.json

응답 해석

다음과 유사한 JSON 응답이 표시됩니다.

요청 JSON 본문에서는 각 주석 유형에 대해 maxResults를 지정했습니다. 따라서 응답 JSON에 다음이 표시됩니다.

labelAnnotations 결과 3개
textAnnotations 결과 1개(명확성을 위해 축약됨)
localizedObjectAnnotations 결과 1개

응답

{
  "responses": [
    {
      "labelAnnotations": [
        {
          "mid": "/m/09g5pq",
          "description": "People",
          "score": 0.9504782,
          "topicality": 0.9504782
        },
        {
          "mid": "/m/01c8br",
          "description": "Street",
          "score": 0.8911568,
          "topicality": 0.8911568
        },
        {
          "mid": "/m/079bkr",
          "description": "Mode of transport",
          "score": 0.89089024,
          "topicality": 0.89089024
        }
      ],
      "textAnnotations": [
        {
          "locale": "zh",
          "description": "牛牛面馆\n",
          "boundingPoly": {
            "vertices": [
              {
                "x": 159,
                "y": 212
              },
              {
                "x": 947,
                "y": 212
              },
              {
                "x": 947,
                "y": 354
              },
              {
                "x": 159,
                "y": 354
              }
            ]
          }
        },
        ...
      ],
      "fullTextAnnotation": {
        "pages": [
          {
            ...
                "paragraphs": [
                  {
                    ...
                    "words": [
                      {
                        ...
                        "symbols": [
                          {
                            ...
                ],
                "blockType": "TEXT"
              }
            ]
          }
        ],
        "text": "牛牛面馆\n"
      },
      "localizedObjectAnnotations": [
        {
          "mid": "/m/01g317",
          "name": "Person",
          "score": 0.94413143,
          "boundingPoly": {
            "normalizedVertices": [
              {
                "x": 0.26063988,
                "y": 0.46869153
              },
              {
                "x": 0.40736017,
                "y": 0.46869153
              },
              {
                "x": 0.40736017,
                "y": 0.8957791
              },
              {
                "x": 0.26063988,
                "y": 0.8957791
              }
            ]
          }
        }
      ]
    }
  ]
}

라벨 인식 결과

설명: 'People', 점수: 0.950
설명: 'Street', 점수: 0.891
설명: 'Mode of transport', 점수: 0.890

텍스트 인식 결과

텍스트: 牛牛面馆\n
꼭짓점: (x: 159, y: 212), (x: 947, y: 212), (x: 947, y: 354), (x: 159, y: 354 )

객체 인식 결과

이름: 'Person', 점수: 0.944
정규화된 꼭짓점: (x: 0.260, y: 0.468), (x: 0.407, y: 0.468), (x: 0.407, y: 0.895), (x: 0.260, y: 0.895)

수고하셨습니다. 첫 번째 요청을 Vision API로 보냈습니다.

삭제

이 페이지에서 사용한 리소스 비용이 Google Cloud 계정에 청구되지 않도록 하려면 리소스가 포함된 Google Cloud 프로젝트를 삭제하면 됩니다.

Optional: Revoke credentials from the gcloud CLI.

gcloud auth revoke

다음 단계

모든 기능 유형과 그 용도에 대한 목록을 참조하세요.
Vision API 클라이언트 라이브러리를 사용하여 Vision API를 원하는 언어로 시작하세요.
안내 가이드를 사용하여 특정 기능에 대해 자세히 알아보고, 예시 주석을 참조하고, 개별 파일이나 이미지에 대한 주석을 가져오세요.
배치 이미지 및 파일(PDF/TIFF/GIF) 주석에 대해 알아보세요.
클라이언트 라이브러리 코드 샘플의 전체 목록을 확인하세요.