Halaman ini diterjemahkan oleh Cloud Translation API.

Mendeteksi label pada gambar menggunakan command line

Halaman ini menunjukkan cara mengirim tiga permintaan deteksi fitur dan anotasi ke Vision API menggunakan antarmuka REST dan perintah curl.

Vision API memungkinkan integrasi teknologi pengenalan visi Google yang mudah ke dalam aplikasi developer. Anda dapat mengirim data gambar dan jenis fitur yang diinginkan ke Vision API, yang kemudian menampilkan respons yang sesuai berdasarkan atribut gambar yang Anda minati. Untuk mengetahui informasi selengkapnya tentang jenis fitur yang ditawarkan, lihat Daftar semua fitur Vision API.

Sebelum memulai

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

Install the Google Cloud CLI.

To initialize the gcloud CLI, run the following command:

gcloud init

Create or select a Google Cloud project.

Create a Google Cloud project:
```
gcloud projects create PROJECT_ID
```
Replace PROJECT_ID with a name for the Google Cloud project you are creating.
Select the Google Cloud project that you created:
```
gcloud config set project PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project name.

Make sure that billing is enabled for your Google Cloud project.

Enable the Vision API:

gcloud services enable vision.googleapis.com

Grant roles to your user account. Run the following command once for each of the following IAM roles: roles/storage.objectViewer

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:USER_IDENTIFIER" --role=ROLE

Replace PROJECT_ID with your project ID.
Replace USER_IDENTIFIER with the identifier for your user account. For example, user:myemail@example.com.
Replace ROLE with each individual role.

Install the Google Cloud CLI.

To initialize the gcloud CLI, run the following command:

gcloud init

Create or select a Google Cloud project.

Create a Google Cloud project:
```
gcloud projects create PROJECT_ID
```
Replace PROJECT_ID with a name for the Google Cloud project you are creating.
Select the Google Cloud project that you created:
```
gcloud config set project PROJECT_ID
```
Replace PROJECT_ID with your Google Cloud project name.

Make sure that billing is enabled for your Google Cloud project.

Enable the Vision API:

gcloud services enable vision.googleapis.com

Grant roles to your user account. Run the following command once for each of the following IAM roles: roles/storage.objectViewer

gcloud projects add-iam-policy-binding PROJECT_ID --member="user:USER_IDENTIFIER" --role=ROLE

Replace PROJECT_ID with your project ID.
Replace USER_IDENTIFIER with the identifier for your user account. For example, user:myemail@example.com.
Replace ROLE with each individual role.

Membuat permintaan anotasi gambar

Setelah menyelesaikan langkah Sebelum memulai, Anda dapat menggunakan Vision API untuk menganotasi file gambar.

Dalam contoh ini, Anda menggunakan curl untuk mengirim permintaan ke Vision API menggunakan gambar berikut:

Cloud Storage URI:

gs://cloud-samples-data/vision/using_curl/shanghai.jpeg

HTTPS URL:

https://console.cloud.google.com/storage/browser/cloud-samples-data/vision/using_curl/shanghai.jpeg

Gambar jalan Shanghai. — *Kredit gambar*: Steve Long di Unsplash.

Membuat JSON permintaan

File request.json berikut menunjukkan cara meminta tiga fitur images:annotate dan membatasi hasil dalam respons.

Buat file permintaan JSON dengan teks berikut, dan simpan sebagai file teks biasa request.json di direktori kerja Anda:

request.json

{
  "requests": [
    {
      "image": {
        "source": {
          "imageUri": "gs://cloud-samples-data/vision/using_curl/shanghai.jpeg"
        }
      },
      "features": [
        {
          "type": "LABEL_DETECTION",
          "maxResults": 3
        },
        {
          "type": "OBJECT_LOCALIZATION",
          "maxResults": 1
        },
        {
          "type": "TEXT_DETECTION",
          "maxResults": 1,
          "model": "builtin/latest"
        }
      ]
    }
  ]
}

Detail nilai kolom

image.source.gcsImageUri - Menunjukkan image yang disimpan di bucket Cloud Storage. Anda mengubah permintaan ini menjadi image.source.imageUri untuk URI yang tersedia secara publik, atau image.content untuk meneruskan representasi string berenkode base64 untuk gambar.
features - Objek yang mewakili jenis fitur tertentu. Anda dapat meminta beberapa jenis fitur untuk satu image.

type - Nilai enum yang menentukan fitur.
maxResults (opsional) - Nilai yang membatasi pada hasil yang ditampilkan.
model (opsional) - Jika berlaku, Anda dapat menentukan builtin/stable (default jika tidak disetel) atau builtin/latest untuk memilih model Anda. Lihat topik Catatan rilis untuk daftar model yang baru saja diperbarui.

Kirim permintaan

Anda menggunakan curl dan konten isi dari request.json untuk mengirim permintaan ke Vision API. Masukkan baris berikut pada command line Anda:

curl -X POST \
    -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    -H "x-goog-user-project: PROJECT_ID" \
    -H "Content-Type: application/json; charset=utf-8" \
    https://vision.googleapis.com/v1/images:annotate -d @request.json

Menafsirkan respons

Anda akan melihat respons JSON yang mirip dengan yang di bawah ini.

Isi JSON permintaan menentukan maxResults untuk setiap jenis anotasi. Akibatnya, Anda akan melihat hal berikut dalam JSON respons:

tiga hasil labelAnnotations
satu textAnnotations hasil (disingkat agar lebih jelas)
satu localizedObjectAnnotations hasil

Tanggapan

{
  "responses": [
    {
      "labelAnnotations": [
        {
          "mid": "/m/09g5pq",
          "description": "People",
          "score": 0.9504782,
          "topicality": 0.9504782
        },
        {
          "mid": "/m/01c8br",
          "description": "Street",
          "score": 0.8911568,
          "topicality": 0.8911568
        },
        {
          "mid": "/m/079bkr",
          "description": "Mode of transport",
          "score": 0.89089024,
          "topicality": 0.89089024
        }
      ],
      "textAnnotations": [
        {
          "locale": "zh",
          "description": "牛牛面馆\n",
          "boundingPoly": {
            "vertices": [
              {
                "x": 159,
                "y": 212
              },
              {
                "x": 947,
                "y": 212
              },
              {
                "x": 947,
                "y": 354
              },
              {
                "x": 159,
                "y": 354
              }
            ]
          }
        },
        ...
      ],
      "fullTextAnnotation": {
        "pages": [
          {
            ...
                "paragraphs": [
                  {
                    ...
                    "words": [
                      {
                        ...
                        "symbols": [
                          {
                            ...
                ],
                "blockType": "TEXT"
              }
            ]
          }
        ],
        "text": "牛牛面馆\n"
      },
      "localizedObjectAnnotations": [
        {
          "mid": "/m/01g317",
          "name": "Person",
          "score": 0.94413143,
          "boundingPoly": {
            "normalizedVertices": [
              {
                "x": 0.26063988,
                "y": 0.46869153
              },
              {
                "x": 0.40736017,
                "y": 0.46869153
              },
              {
                "x": 0.40736017,
                "y": 0.8957791
              },
              {
                "x": 0.26063988,
                "y": 0.8957791
              }
            ]
          }
        }
      ]
    }
  ]
}

Hasil deteksi label

deskripsi: "Orang", skor: 0,950
deskripsi: "Jalan", skor: 0,891
deskripsi: "Moda transportasi", skor: 0,890

Hasil deteksi teks

teks: 牛牛面馆\n
verteks: (x: 159, y: 212), (x: 947, y: 212), (x: 947, y: 354), (x: 159, y: 354 )

Hasil deteksi objek

nama: "Orang", skor: 0,944
verteks yang dinormalisasi: (x: 0,260, y: 0,468), (x: 0,407, y: 0,468), (x: 0,407, y: 0,895), (x: 0,260, y: 0,895)

Selamat! Anda telah mengirimkan permintaan pertama ke Vision API.

Pembersihan

Agar tidak menimbulkan biaya pada akun Google Cloud Anda untuk resource yang digunakan di halaman ini, hapus project Google Cloud yang berisi resource tersebut.

Optional: Revoke credentials from the gcloud CLI.

gcloud auth revoke

Langkah selanjutnya

Lihat daftar semua jenis fitur dan penggunaannya.
Mulai Vision API dalam bahasa pilihan Anda dengan menggunakan Library Klien Vision API.
Gunakan Panduan cara kerja untuk mempelajari lebih lanjut fitur tertentu, melihat contoh anotasi, dan mendapatkan anotasi untuk setiap file atau gambar.
Pelajari anotasi gambar dan file (PDF/TIFF/GIF) dalam batch.
Lihat daftar lengkap contoh kode library klien.