このページは Cloud Translation API によって翻訳されました。

Vertex AI RAG Engine で Weaviate データベースを使用する

このページでは、RAG Engine コーパスを Weaviate データベースに接続する方法について説明します。

このノートブック Weaviate を使用した RAG Engine を使用して、手順に沿って操作することもできます。

オープンソースデータベースである Weaviate データベースインスタンスと RAG Engine を使用して、インデックスを作成し、ベクトルベースの類似性検索を実行できます。類似性検索は、検索しているテキストに類似するテキストを見つける方法です。この場合、エンベディングモデルを使用する必要があります。エンベディングモデルは、比較対象のテキストごとにベクトルデータを生成します。類似性検索は、グラウンディングのためのセマンティックコンテキストを取得し、LLM から最も正確なコンテンツを返すために使用されます。

RAG Engine を使用すると、フルマネージドベクトルデータベースインスタンスを引き続き使用できます（プロビジョニングはお客様の責任で行っていただく必要があります）。RAG Engine は、ストレージ、インデックス管理、検索にベクトルデータベースを使用します。

考慮事項

Weaviate データベースを使用する前に、次の点を考慮してください。

Weaviate データベースインスタンスとコレクションを作成、構成、デプロイする必要があります。Weaviate コレクションを作成するの手順に沿って、スキーマに基づいてコレクションを設定します。
Weaviate API キーを指定する必要があります。これにより、RAG Engine が Weaviate データベースとやり取りできるようになります。RAG Engine は、API キーベースの AuthN と AuthZ をサポートしています。これにより、Weaviate データベースに接続し、HTTPS 接続をサポートします。
RAG Engine は、Weaviate API キーの保存と管理を行いません。代わりに、次のことを行う必要があります。
1. キーを Google Cloud Secret Manager に保存します。
2. プロジェクトのサービスアカウントにシークレットへのアクセス権を付与します。
3. RAG Engine に、シークレットのリソース名へのアクセス権を付与します。
4. Weaviate データベースを操作すると、RAG Engine はサービスアカウントを使用してシークレットリソースにアクセスします。
RAG Engine コーパスと Weaviate コレクションには 1 対 1 のマッピングがあります。RAG ファイルは Weaviate データベースコレクションに保存されます。CreateRagCorpus API または UpdateRagCorpus API が呼び出されると、RAG コーパスがデータベースコレクションに関連付けられます。
密エンベディングベースのセマンティック検索に加えて、Weaviate データベースを介して RAG Engine でハイブリッド検索を行うこともできます。ハイブリッド検索では、密ベクトルとスパースベクトルの類似性間の重みを調整することもできます。

Weaviate データベースをプロビジョニングする

RAG Engine で Weaviate データベースを使用する前に、次のことを行う必要があります。

Weaviate データベースインスタンスを構成してデプロイします。
HTTPS エンドポイントを準備します。
Weaviate コレクションを作成します。
API キー、AuthN、AuthZ を使用して、Weaviate をプロビジョニングします。
RAG Engine サービスアカウントをプロビジョニングします。

Weaviate データベースインスタンスを構成してデプロイする

Weaviate 公式ガイドのクイックスタートに沿って操作する必要があります。ただし、Google Cloud Marketplace ガイドを使用することもできます。これは省略可能です。

Weaviate エンドポイントがアクセス可能で、プロジェクトで構成してデプロイできる限り、Weaviate インスタンスはどこにでも設定できます。これで、Weaviate データベースインスタンスを完全に管理できるようになります。

RAG Engine は Weaviate データベースインスタンスのライフサイクルのどのステージにも関与しないため、Weaviate データベースにデータを保存して検索できるように、RAG Engine に権限を付与するのはお客様の責任となります。また、データベース内のデータが RAG Engine で使用できるようにする責任もお客様にあります。たとえば、データを変更した場合、その変更が原因で予期しない動作が発生しても、RAG Engine 側に責任はありません。

HTTPS エンドポイントを準備する

Weaviate のプロビジョニング中に、HTTPS エンドポイントを作成してください。HTTP 接続はサポートされていますが、RAG Engine と Weaviate データベースのトラフィックに HTTPS 接続を使用することをおすすめします。

Weaviate コレクションを作成する

RAG Engine コーパスと Weaviate コレクションには 1 対 1 のマッピングがあるため、コレクションを RAG Engine コーパスに関連付ける前に、Weaviate データベースにコレクションを作成する必要があります。この 1 回限りの関連付けは、CreateRagCorpus API または UpdateRagCorpus API を呼び出すときに行われます。

Weaviate でコレクションを作成する場合は、次のスキーマを使用する必要があります。

プロパティ名	データ型
`fileId`	`text`
`corpusId`	`text`
`chunkId`	`text`
`chunkDataType`	`text`
`chunkData`	`text`
`fileOriginalUri`	`text`

API キーを使用し、`AuthN` と `AuthZ` を使用して Weaviate をプロビジョニングする。

Weaviate API キーのプロビジョニングは、次の手順で行います。

Weaviate API キーを作成します。
Weaviate API キーを使用して Weaviate を構成します。
Weaviate API キーを Secret Manager に保存します。

API キーを作成する

RAG Engine は、認証と認可に API キーを使用して Weaviate データベースインスタンスに接続します。Weaviate データベースインスタンスで API キーベースの認証を構成するには、Weaviate の認証に関する公式ガイドに沿って操作する必要があります。

Weaviate API キーの作成で、RAG Engine から取得した ID 情報に関連付ける ID 情報が必要になる場合は、最初のコーパスを作成し、RAG Engine サービスアカウントを ID として使用する必要があります。

API キーを Secret Manager に保存する

API キーには、個人を特定できる機密情報（SPII）が保持されます。この情報には法的要件が適用されます。SPII データが漏洩または不正使用された場合、個人に重大なリスクや損害が及ぶ可能性があります。RAG Engine を使用する際の個人に対するリスクを最小限に抑えるため、API キーを保存して管理しないでください。また、暗号化されていない API キーを共有しないようにします。

SPII を保護するには、次の操作を行います。

API キーを Secret Manager に保存します。
RAG Engine サービスアカウントにシークレットへの権限を付与し、シークレットリソースレベルでアクセス制御を管理します。
1. プロジェクトの権限に移動します。
2. [Google 提供のロール付与を含める] オプションを有効にします。
3. サービスアカウントを見つけます。形式は次のとおりです。
  service-{project number}@gcp-sa-vertex-rag.iam.gserviceaccount.com
4. サービスアカウントのプリンシパルを編集します。
5. サービスアカウントに Secret Manager のシークレットアクセサー ロールを追加します。
RAG コーパスの作成または更新時に、シークレットリソース名を RAG Engine に渡し、シークレットリソース名を保存します。

Weaviate データベースインスタンスに API リクエストを送信すると、RAG Engine は各サービスアカウントを使用して、プロジェクトの Secret Manager のシークレットリソースに対応する API キーを読み取ります。

RAG Engine サービスアカウントをプロビジョニングする

プロジェクトで最初のリソースを作成すると、RAG Engine は専用のサービスアカウントを作成します。サービスアカウントは、プロジェクトの [IAM] ページで確認できます。サービスアカウントの形式は次のとおりです。

service-{project number}@gcp-sa-vertex-rag.iam.gserviceaccount.com

例: service-123456789@gcp-sa-vertex-rag.iam.gserviceaccount.com

Weaviate データベースと統合する際に、サービスアカウントは次のシナリオで使用されます。

サービスアカウントを使用して、認証用の Weaviate API キーを生成できます。API キーの生成にユーザー情報が必要ない場合もあります。その場合、API キーの生成時にサービスアカウントは必要ありません。
サービスアカウントを Weaviate データベースの API キーにバインドして、認証（AuthN）と認可（AuthZ）を構成できますが、サービスアカウントは必須ではありません。
API キーの Secret Manager をプロジェクトに保存し、これらのシークレットリソースに対する権限をサービスアカウントに付与できます。
RAG Engine は、サービスアカウントを使用して、プロジェクトの Secret Manager から API キーにアクセスします。

Google Cloud コンソール環境をセットアップする

環境の設定方法については、こちらをクリックしてください

環境の設定方法については、次のいずれかのタブを選択してください。

Python

Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

If you're using a local shell, then create local authentication credentials for your user account:
```
gcloud auth application-default login
```
You don't need to do this if you're using Cloud Shell.

If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.
次のコマンドを実行して、Vertex AI SDK for Python をインストールまたは更新します。
```
pip3 install --upgrade "google-cloud-aiplatform>=1.38"
    
```

Node.js

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

If you're using a local shell, then create local authentication credentials for your user account:
```
gcloud auth application-default login
```
You don't need to do this if you're using Cloud Shell.

If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.
次のコマンドを実行して、Vertex AI SDK for Node.js をインストールまたは更新します。
```
npm install @google-cloud/vertexai
    
```

Java

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

If you're using a local shell, then create local authentication credentials for your user account:
```
gcloud auth application-default login
```
You don't need to do this if you're using Cloud Shell.

If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.

google-cloud-vertexai を依存関係として追加するには、環境に適したコードを追加します。

BOM ありの Maven

pom.xml に次の HTML を追加します。

<dependencyManagement>
  <dependencies>
    <dependency>
      <groupId>com.google.cloud</groupId>
      <artifactId>libraries-bom</artifactId>
      <version>26.32.0</version>
      <type>pom</type>
      <scope>import</scope>
    </dependency>
  </dependencies>
</dependencyManagement>
<dependencies>
  <dependency>
    <groupId>com.google.cloud</groupId>
    <artifactId>google-cloud-vertexai</artifactId>
  </dependency>
</dependencies>

BOM なしの Maven

pom.xml に次の HTML を追加します。

<dependency>
  <groupId>com.google.cloud</groupId>
  <artifactId>google-cloud-vertexai</artifactId>
  <version>0.4.0</version>
</dependency>

Gradle without BOM

Add the following to your build.gradle

implementation 'com.google.cloud:google-cloud-vertexai:0.4.0'

Go

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

If you're using a local shell, then create local authentication credentials for your user account:
```
gcloud auth application-default login
```
You don't need to do this if you're using Cloud Shell.

If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.
利用可能な Vertex AI API Go パッケージを確認して、プロジェクトのニーズに最適なパッケージを判断します。
- パッケージ cloud.google.com/go/vertexai（推奨）
  
  vertexai は人の手で作成されたパッケージで、一般的な機能へのアクセスを提供します。
  
  Vertex AI API を使用して構築するほとんどのデベロッパーにとって、これは出発点としておすすめのパッケージです。このパッケージに含まれていない機能にアクセスするには、自動生成された aiplatform を使用してください。
- パッケージ cloud.google.com/go/aiplatform
  
  aiplatform は自動生成されたパッケージです。
  
  このパッケージは、人が作成した vertexai パッケージではまだ提供されていない Vertex AI API 機能にアクセスする必要があるプロジェクトを対象としています。

次のいずれかのコマンドを実行して、プロジェクトのニーズに基づいて必要な Go パッケージをインストールします。

# Human authored package. Recommended for most developers.
go get cloud.google.com/go/vertexai
    

# Auto-generated package.
go get cloud.google.com/go/aiplatform

C#

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

If you're using a local shell, then create local authentication credentials for your user account:
```
gcloud auth application-default login
```
You don't need to do this if you're using Cloud Shell.

If an authentication error is returned, and you are using an external identity provider (IdP), confirm that you have signed in to the gcloud CLI with your federated identity.

REST

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, on the project selector page, select or create a Google Cloud project.

Go to project selector

Enable the Vertex AI API.

Enable the API

In the Google Cloud console, activate Cloud Shell.

Activate Cloud Shell

At the bottom of the Google Cloud console, a Cloud Shell session starts and displays a command-line prompt. Cloud Shell is a shell environment with the Google Cloud CLI already installed and with values already set for your current project. It can take a few seconds for the session to initialize.

次のように入力して、環境変数を構成します。PROJECT_ID は、 Google Cloud プロジェクトの ID に置き換えます。
```
MODEL_ID="gemini-2.0-flash-001"
PROJECT_ID="PROJECT_ID"
    
```

エンドポイントをプロビジョニングします。

gcloud beta services identity create --service=aiplatform.googleapis.com --project=${PROJECT_ID}

省略可: Cloud Shell を使用しているときに Cloud Shell の承認を求められた場合は、[承認] をクリックします。

RAG コーパスを準備する

Weaviate データベースのデータにアクセスするには、RAG Engine が RAG コーパスにアクセスできる必要があります。このセクションでは、単一の RAG コーパスと追加の RAG コーパスを作成する手順について説明します。

`CreateRagCorpus` API と `UpdateRagCorpus` API を使用する

CreateRagCorpus API と UpdateRagCorpus API を呼び出す場合は、次のフィールドを指定する必要があります。

rag_vector_db_config.weaviate: CreateRagCorpus API を呼び出した後、ベクトルデータベースの構成が選択されます。ベクトルデータベースの構成には、すべての構成フィールドが含まれています。rag_vector_db_config.weaviate フィールドが設定されていない場合、デフォルトで rag_vector_db_config.rag_managed_db が設定されます。
weaviate.http_endpoint: HTTPS または HTTP Weaviate エンドポイントは、Weaviate データベースインスタンスのプロビジョニング中に作成されます。
weaviate.collection_name: Weaviate インスタンスのプロビジョニング中に作成されるコレクションの名前。名前の先頭は大文字にする必要があります。
api_auth.api_key_config: ベクトルデータベースへのアクセスを承認するために API キーを使用するように構成します。
api_key_config.api_key_secret_version: Secret Manager に保存されているシークレットのリソース名。ここには Weaviate API キーが含まれています。

RAG コーパスを作成して、データベースインスタンス内の Weaviate コレクションに関連付けることができます。ただし、API キーの生成や Weaviate データベースインスタンスの構成には、サービスアカウントが必要になる場合があります。最初の RAG コーパスを作成すると、サービスアカウントが生成されます。最初の RAG コーパスを作成した後、Weaviate データベースと API キーの関連付けが、別の RAG コーパスの作成で使用可能な状態になっていない場合があります。

データベースとキーを RAG コーパスに関連付ける準備ができていない場合は、RAG コーパスに対して次の操作を行います。

rag_vector_db_config の weaviate フィールドを設定します。
- 関連付けられたベクトルデータベースは変更できません。
- http_endpoint フィールドと collection_name フィールドの両方を空のままにします。どちらのフィールドも後で更新できます。
API キーを Secret Manager に保存していない場合は、api_auth フィールドを空白のままにします。UpdateRagCorpus API を呼び出すときに、api_auth フィールドを更新できます。Weaviate を使用するには、次のことを行う必要があります。
1. api_auth フィールドに api_key_config を設定します。
2. Secret Manager で Weaviate API キーの api_key_secret_version を設定します。api_key_secret_version フィールドの形式は次のとおりです。
  
  projects/{project}/secrets/{secret}/versions/{version}
http_endpoint や collection_name など、1 回しか設定できないフィールドを指定した場合、RAG コーパスを削除して RAG コーパスを再度作成しない限り、そのフィールドを変更することはできません。API キーフィールド api_key_secret_version などの他のフィールドは更新できます。
UpdateRagCorpus を呼び出すときに、vector_db フィールドを設定できます。vector_db は、CreateRagCorpus API 呼び出しによって weaviate に設定する必要があります。指定しない場合、デフォルトの [RAG Managed Database] オプションが選択されます。このオプションは、UpdateRagCorpus API を呼び出すときに変更できません。UpdateRagCorpus を呼び出して vector_db フィールドが部分的に設定されている場合、「変更可能」（可変）とマークされているフィールドを更新できます。

次の表に、コードで使用される WeaviateConfig の可変フィールドと不変フィールドを示します。

フィールド名	変更可能または変更不可
`http_endpoint`	設定後は変更不可
`collection_name`	設定後は変更不可
`api_key_authentication`	変更可

最初の RAG コーパスを作成する

RAG Engine サービスアカウントが存在しない場合は、次の操作を行います。

空の Weaviate 構成を使用して、RAG Engine に RAG コーパスを作成します。これにより、RAG Engine のプロビジョニングが開始され、サービスアカウントが作成されます。
次の形式の RAG Engine サービスアカウントの名前を選択します。
service-{project number}@gcp-sa-vertex-rag.iam.gserviceaccount.com

例: service-123456789@gcp-sa-vertex-rag.iam.gserviceaccount.com
サービスアカウントを使用して、プロジェクトの Secret Manager に保存されているシークレット（Weaviate API キーを含む）にアクセスします。
Weaviate のプロビジョニングが完了したら、次の情報を取得します。
- Weaviate の HTTPS または HTTP エンドポイント。
- Weaviate コレクションの名前。
CreateRagCorpus API を呼び出して空の Weaviate 構成で RAG コーパスを作成し、UpdateRagCorpus API を呼び出して次の情報で RAG コーパスに更新します。
- Weaviate の HTTPS または HTTP エンドポイント。
- Weaviate コレクションの名前。
- API キーのリソース名。

別の RAG コーパスを作成する

RAG Engine サービスアカウントが存在する場合は、次の操作を行います。

プロジェクトの権限から RAG Engine サービスアカウントを取得します。
[Google 提供のロール付与を含む] オプションを有効にします。
次の形式の RAG Engine サービスアカウントの名前を選択します。
service-{project number}@gcp-sa-vertex-rag.iam.gserviceaccount.com
サービスアカウントを使用して、プロジェクトの Secret Manager に保存されているシークレット（Weaviate API キーを含む）にアクセスします。
Weaviate のプロビジョニング中に、次の情報を取得します。
- Weaviate の HTTPS または HTTP エンドポイント。
- Weaviate コレクションの名前。
RAG Engine で RAG コーパスを作成し、次のいずれかの方法で Weaviate コレクションに接続します。
1. CreateRagCorpus API 呼び出しを行い、Weaviate 構成が入力された RAG コーパスを作成します。これは推奨の方法です。
2. CreateRagCorpus API 呼び出しを実行して空の Weaviate 構成で RAG コーパスを作成し、UpdateRagCorpus API 呼び出しを実行して次の情報で RAG コーパスに更新します。
  - Weaviate データベースの HTTP エンドポイント
  - Weaviate コレクション名
  - API キー

例

このセクションでは、Weaviate データベース、Secret Manager、RAG コーパス、RAG ファイルを設定する方法を説明するサンプルコードを示します。また、ファイルのインポート方法、コンテキストの取得方法、コンテンツの生成方法、RAG コーパスと RAG ファイルの削除方法を示すサンプルコードも用意されています。

Model Garden RAG API ノートブックを使用するには、Llama 3 で Weaviate を使用するをご覧ください。

Weaviate データベースを設定する

このコードサンプルは、Weaviate データと Secret Manager を設定する方法を示しています。

REST

# TODO(developer): Update the variables.
# The HTTPS/HTTP Weaviate endpoint you created during provisioning.
HTTP_ENDPOINT_NAME="https://your.weaviate.endpoint.com"

# Your Weaviate API Key.
WEAVIATE_API_KEY="example-api-key"

# Select your Weaviate collection name, which roughly corresponds to a Vertex AI Knowledge Engine Corpus.
# For example, "MyCollectionName"
# Note that the first letter needs to be capitalized.
# Otherwise, Weavaite will capitalize it for you.
WEAVIATE_COLLECTION_NAME="MyCollectionName"

# Create a collection in Weaviate which includes the required schema fields shown below.
echo '{
  "class": "'${WEAVIATE_COLLECTION_NAME}'",
  "properties": [
    { "name": "fileId", "dataType": [ "string" ] },
    { "name": "corpusId", "dataType": [ "string" ] },
    { "name": "chunkId", "dataType": [ "string" ] },
    { "name": "chunkDataType", "dataType": [ "string" ] },
    { "name": "chunkData", "dataType": [ "string" ] },
    { "name": "fileOriginalUri", "dataType": [ "string" ] }
  ]
}' | curl \
    -X POST \
    -H 'Content-Type: application/json' \
    -H "Authorization: Bearer "${WEAVIATE_API_KEY} \
    -d @- \
    ${HTTP_ENDPOINT_NAME}/v1/schema

Secret Manager を設定する

Secret Manager を設定するには、Secret Manager を有効にして権限を設定する必要があります。

シークレットの作成

Secret Manager を有効にするには、次の操作を行います。

コンソール

[Secret Manager] ページに移動します。
Secret Manager に移動
[+ シークレットの作成] をクリックします。
シークレットの名前を入力します。シークレット名には、英字（A～Z）、数字（0～9）、ダッシュ（-）、アンダースコア（_）のみを使用できます。
次のフィールドの指定は任意です。
1. シークレットを含むファイルをアップロードするには、[参照] をクリックします。
2. レプリケーションポリシーを確認します。
3. シークレットのロケーションを手動で管理する場合は、[このシークレットのロケーションを手動で管理する] チェックボックスをオンにします。少なくとも 1 つのリージョンを選択する必要があります。
4. 暗号化オプションを選択します。
5. ローテーション期間を手動で設定する場合は、[ローテーション期間を設定する] をオンにします。
6. イベント通知を受け取るために公開トピックまたはサブスクリプショントピックを指定する場合は、[Add topics] をクリックします。
7. デフォルトでは、シークレットは期限切れにはなりません。有効期限を設定する場合は、[有効期限を設定する] をオンにします。
8. デフォルトで、シークレットバージョンはリクエストされると破棄されます。シークレットバージョンの破棄を遅らせるには、[破棄の遅延期間を設定する] をオンにします。
9. ラベルを使用してシークレットを整理および分類する場合は、[+ ラベルを追加] をクリックします。
10. アノテーションを使用して非識別的なメタデータをシークレットに関連付ける場合は、[+ アノテーションを追加] をクリックします。
[シークレットの作成] をクリックします。

REST

# Create a secret in SecretManager.
curl "https://secretmanager.googleapis.com/v1/projects/${PROJECT_ID}/secrets?secretId=${SECRET_NAME}" \
    --request "POST" \
    --header "authorization: Bearer $(gcloud auth print-access-token)" \
    --header "content-type: application/json" \
    --data "{\"replication\": {\"automatic\": {}}}"

Python

このサンプルを試す前に、Vertex AI クイックスタート: クライアントライブラリの使用にある Python の設定手順を完了してください。詳細については、Vertex AI Python API のリファレンスドキュメントをご覧ください。

Vertex AI に対する認証を行うには、アプリケーションのデフォルト認証情報を設定します。詳細については、ローカル開発環境の認証を設定するをご覧ください。

# Import the Secret Manager client library.
from google.cloud import secretmanager


def create_secret(
    project_id: str, secret_id: str, ttl: Optional[str] = None
) -> secretmanager.Secret:
    """
    Create a new secret with the given name. A secret is a logical wrapper
    around a collection of secret versions. Secret versions hold the actual
    secret material.

     Args:
        project_id (str): The project ID where the secret is to be created.
        secret_id (str): The ID to assign to the new secret. This ID must be unique within the project.
        ttl (Optional[str]): An optional string that specifies the secret's time-to-live in seconds with
                             format (e.g., "900s" for 15 minutes). If specified, the secret
                             versions will be automatically deleted upon reaching the end of the TTL period.

    Returns:
        secretmanager.Secret: An object representing the newly created secret, containing details like the
                              secret's name, replication settings, and optionally its TTL.

    Example:
        # Create a secret with automatic replication and no TTL
        new_secret = create_secret("my-project", "my-new-secret")

        # Create a secret with a TTL of 30 days
        new_secret_with_ttl = create_secret("my-project", "my-timed-secret", "7776000s")
    """

    # Create the Secret Manager client.
    client = secretmanager.SecretManagerServiceClient()

    # Build the resource name of the parent project.
    parent = f"projects/{project_id}"

    # Create the secret.
    response = client.create_secret(
        request={
            "parent": parent,
            "secret_id": secret_id,
            "secret": {"replication": {"automatic": {}}, "ttl": ttl},
        }
    )

    # Print the new secret name.
    print(f"Created secret: {response.name}")

権限を設定

サービスアカウントに Secret Manager の権限を付与する必要があります。

コンソール

Google Cloud コンソールの [IAM と管理] セクションで、サービスアカウントを見つけて、鉛筆アイコンをクリックして編集します。
[ロール] フィールドで、[Secret Manager のシークレットアクセサー] を選択します。

Python

def iam_grant_access(
    project_id: str, secret_id: str, member: str
) -> iam_policy_pb2.SetIamPolicyRequest:
    """
    Grant the given member access to a secret.
    """

    # Import the Secret Manager client library.
    from google.cloud import secretmanager

    # Create the Secret Manager client.
    client = secretmanager.SecretManagerServiceClient()

    # Build the resource name of the secret.
    name = client.secret_path(project_id, secret_id)

    # Get the current IAM policy.
    policy = client.get_iam_policy(request={"resource": name})

    # Add the given member with access permissions.
    policy.bindings.add(role="roles/secretmanager.secretAccessor", members=[member])

    # Update the IAM Policy.
    new_policy = client.set_iam_policy(request={"resource": name, "policy": policy})

    # Print data about the secret.
    print(f"Updated IAM policy on {secret_id}")

Secret のバージョンを追加する

REST

# TODO(developer): Update the variables.
# Select a resource name for your Secret, which contains your API Key.
SECRET_NAME="MyWeaviateApiKeySecret"

# Your Weaviate API Key.
WEAVIATE_API_KEY="example-api-key"
# Encode your WEAVIATE_API_KEY using base 64.
SECRET_DATA=$(echo ${WEAVIATE_API_KEY} | base64)

# Create a new version of your secret which uses SECRET_DATA as payload
curl "https://secretmanager.googleapis.com/v1/projects/${PROJECT_ID}/secrets/${SECRET_NAME}:addVersion" \
    --request "POST" \
    --header "authorization: Bearer $(gcloud auth print-access-token)" \
    --header "content-type: application/json" \
    --data "{\"payload\": {\"data\": \"${SECRET_DATA}\"}}"

Python

from google.cloud import secretmanager
import google_crc32c  # type: ignore


def add_secret_version(
    project_id: str, secret_id: str, payload: str
) -> secretmanager.SecretVersion:
    """
    Add a new secret version to the given secret with the provided payload.
    """

    # Create the Secret Manager client.
    client = secretmanager.SecretManagerServiceClient()

    # Build the resource name of the parent secret.
    parent = client.secret_path(project_id, secret_id)

    # Convert the string payload into a bytes. This step can be omitted if you
    # pass in bytes instead of a str for the payload argument.
    payload_bytes = payload.encode("UTF-8")

    # Calculate payload checksum. Passing a checksum in add-version request
    # is optional.
    crc32c = google_crc32c.Checksum()
    crc32c.update(payload_bytes)

    # Add the secret version.
    response = client.add_secret_version(
        request={
            "parent": parent,
            "payload": {
                "data": payload_bytes,
                "data_crc32c": int(crc32c.hexdigest(), 16),
            },
        }
    )

    # Print the new secret version name.
    print(f"Added secret version: {response.name}")

Llama 3 で Weaviate を使用する

Model Garden RAG API ノートブックでは、Vertex AI SDK for Python で Weaviate コーパスと Llama 3 モデルを使用する方法を示しています。ノートブックを使用するには、次の操作を行う必要があります。

その他の例については、例をご覧ください。

RAG コーパスを作成する

このコードサンプルは、RAG コーパスを作成し、Weaviate インスタンスをベクトルデータベースとして設定する方法を示しています。

REST

  # TODO(developer): Update the variables.
  PROJECT_ID = "YOUR_PROJECT_ID"
  # The HTTPS/HTTP Weaviate endpoint you created during provisioning.
  HTTP_ENDPOINT_NAME="https://your.weaviate.endpoint.com"

  # Your Weaviate collection name, which roughly corresponds to a Vertex AI Knowledge Engine Corpus.
  # For example, "MyCollectionName"
  # Note that the first letter needs to be capitalized.
  # Otherwise, Weaviate will capitalize it for you.
  WEAVIATE_COLLECTION_NAME="MyCollectionName"

  # The resource name of your Weaviate API Key your Secret.
  SECRET_NAME="MyWeaviateApiKeySecret"
  # The Secret Manager resource name containing the API Key for your Weaviate endpoint.
  # For example, projects/{project}/secrets/{secret}/versions/latest
  APIKEY_SECRET_VERSION="projects/${PROJECT_ID}/secrets/${SECRET_NAME}/versions/latest"

  # Select a Corpus display name.
  CORPUS_DISPLAY_NAME="SpecialCorpus"

  # Call CreateRagCorpus API and set all Vector DB Config parameters for Weaviate to create a new corpus associated to your selected Weaviate collection.
  curl -X POST \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
  -H "Content-Type: application/json" \
  https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/ragCorpora \
  -d '{
        "display_name" : '\""${CORPUS_DISPLAY_NAME}"\"',
        "rag_vector_db_config" : {
                "weaviate": {
                      "http_endpoint": '\""${HTTP_ENDPOINT_NAME}"\"',
                      "collection_name": '\""${WEAVIATE_COLLECTION_NAME}"\"'
                },
          "api_auth" : {
                  "api_key_config": {
                        "api_key_secret_version": '\""${APIKEY_SECRET_VERSION}"\"'
                  }
          }
        }
    }'

  # TODO(developer): Update the variables.
  # Get operation_id returned in CreateRagCorpus.
  OPERATION_ID="your-operation-id"

  # Poll Operation status until done = true in the response.
  curl -X GET \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    -H "Content-Type: application/json" \
  https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/operations/${OPERATION_ID}

  # Call ListRagCorpora API to verify the RAG corpus is created successfully.
  curl -sS -X GET \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    "https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/ragCorpora"

Python


from vertexai.preview import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# weaviate_http_endpoint = "weaviate-http-endpoint"
# weaviate_collection_name = "weaviate-collection-name"
# weaviate_api_key_secret_manager_version = "projects/{PROJECT_ID}/secrets/{SECRET_NAME}/versions/latest"
# display_name = "test_corpus"
# description = "Corpus Description"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

# Configure embedding model (Optional)
embedding_model_config = rag.EmbeddingModelConfig(
    publisher_model="publishers/google/models/text-embedding-004"
)

# Configure Vector DB
vector_db = rag.Weaviate(
    weaviate_http_endpoint=weaviate_http_endpoint,
    collection_name=weaviate_collection_name,
    api_key=weaviate_api_key_secret_manager_version,
)

corpus = rag.create_corpus(
    display_name=display_name,
    description=description,
    embedding_model_config=embedding_model_config,
    vector_db=vector_db,
)
print(corpus)
# Example response:
# RagCorpus(name='projects/1234567890/locations/us-central1/ragCorpora/1234567890',
# display_name='test_corpus', description='Corpus Description', embedding_model_config=...
# ...

RAG ファイルを使用する

RAG API は、ファイルのアップロード、インポート、一覧取得、削除を処理します。

REST

リクエストのデータを使用する前に、次のように置き換えます。

PROJECT_ID: 。
LOCATION: リクエストを処理するリージョン。
RAG_CORPUS_ID: RagCorpus リソースの ID。
INPUT_FILE: ローカルファイルのパス。
FILE_DISPLAY_NAME: RagFile の表示名。
RAG_FILE_DESCRIPTION: RagFile の説明。

HTTP メソッドと URL:

POST https://LOCATION-aiplatform.googleapis.com/upload/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles:upload

リクエストの本文（JSON）:

{
 "rag_file": {
  "display_name": "FILE_DISPLAY_NAME",
  "description": "RAG_FILE_DESCRIPTION"
 }
}

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

リクエスト本文を INPUT_FILE という名前のファイルに保存して、次のコマンドを実行します。

curl -X POST \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @INPUT_FILE \
     "https://LOCATION-aiplatform.googleapis.com/upload/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles:upload"

PowerShell

リクエスト本文を INPUT_FILE という名前のファイルに保存して、次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method POST `
    -Headers $headers `
    -ContentType: "application/json; charset=utf-8" `
    -InFile INPUT_FILE `
    -Uri "https://LOCATION-aiplatform.googleapis.com/upload/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles:upload" | Select-Object -Expand Content

成功すると、RagFile リソースが返されます。RagFile.name フィールドの最後のコンポーネントは、サーバー生成の rag_file_id です。

Python

Vertex AI SDK for Python のインストールまたは更新の方法については、Vertex AI SDK for Python をインストールするをご覧ください。詳細については、 Python API リファレンスドキュメントをご覧ください。


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}"
# path = "path/to/local/file.txt"
# display_name = "file_display_name"
# description = "file description"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

rag_file = rag.upload_file(
    corpus_name=corpus_name,
    path=path,
    display_name=display_name,
    description=description,
)
print(rag_file)
# RagFile(name='projects/[PROJECT_ID]/locations/us-central1/ragCorpora/1234567890/ragFiles/09876543',
#  display_name='file_display_name', description='file description')

RAG ファイルをインポートする

ファイルとフォルダは、ドライブまたは Cloud Storage からインポートできます。

REST

response.metadata を使用して、SDK の response オブジェクトで部分的な失敗、リクエスト時間、レスポンス時間を確認します。

リクエストのデータを使用する前に、次のように置き換えます。

PROJECT_ID: 。
LOCATION: リクエストを処理するリージョン。
RAG_CORPUS_ID: RagCorpus リソースの ID。
GCS_URIS: Cloud Storage ロケーションのリスト。例: gs://my-bucket1, gs://my-bucket2
DRIVE_RESOURCE_ID: ドライブリソースの ID。例:

https://drive.google.com/file/d/ABCDE
https://drive.google.com/corp/drive/u/0/folders/ABCDEFG

DRIVE_RESOURCE_TYPE: ドライブリソースのタイプ。オプション:

RESOURCE_TYPE_FILE - ファイル
RESOURCE_TYPE_FOLDER - フォルダ

CHUNK_SIZE: 省略可。各チャンクに必要なトークンの数。
CHUNK_OVERLAP: 省略可: チャンク間で重複するトークンの数。

HTTP メソッドと URL:

POST https://LOCATION-aiplatform.googleapis.com/upload/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles:import

リクエストの本文（JSON）:

{
  "import_rag_files_config": {
    "gcs_source": {
      "uris": GCS_URIS
    },
    "google_drive_source": {
      "resource_ids": {
        "resource_id": DRIVE_RESOURCE_ID,
        "resource_type": DRIVE_RESOURCE_TYPE
      },
    }
  }
}

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

リクエスト本文を request.json という名前のファイルに保存して、次のコマンドを実行します。

curl -X POST \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @request.json \
     "https://LOCATION-aiplatform.googleapis.com/upload/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles:import"

PowerShell

リクエスト本文を request.json という名前のファイルに保存して、次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method POST `
    -Headers $headers `
    -ContentType: "application/json; charset=utf-8" `
    -InFile request.json `
    -Uri "https://LOCATION-aiplatform.googleapis.com/upload/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles:import" | Select-Object -Expand Content

成功すると、ImportRagFilesOperationMetadata リソースが返されます。

次のサンプルは、Cloud Storage からファイルをインポートする方法を示しています。max_embedding_requests_per_min 制御フィールドを使用して、RAG Engine が ImportRagFiles インデックス登録プロセス中にエンベディングモデルを呼び出すレートを制限します。このフィールドのデフォルト値は、1 分あたり 1000 回の呼び出しです。

// Cloud Storage bucket/file location.
// Such as "gs://rag-e2e-test/"
GCS_URIS=YOUR_GCS_LOCATION

// Enter the QPM rate to limit RAG's access to your embedding model
// Example: 1000
EMBEDDING_MODEL_QPM_RATE=MAX_EMBEDDING_REQUESTS_PER_MIN_LIMIT

// ImportRagFiles
// Import a single Cloud Storage file or all files in a Cloud Storage bucket.
// Input: ENDPOINT, PROJECT_ID, RAG_CORPUS_ID, GCS_URIS
// Output: ImportRagFilesOperationMetadataNumber
// Use ListRagFiles to find the server-generated rag_file_id.
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://${ENDPOINT}/v1beta1/projects/${PROJECT_ID}/locations/${LOCATION}/ragCorpora/${RAG_CORPUS_ID}/ragFiles:import \
-d '{
  "import_rag_files_config": {
    "gcs_source": {
      "uris": '\""${GCS_URIS}"\"'
    },
    "rag_file_chunking_config": {
      "chunk_size": 512
    },
    "max_embedding_requests_per_min": '"${EMBEDDING_MODEL_QPM_RATE}"'
  }
}'

// Poll the operation status.
// The response contains the number of files imported.
OPERATION_ID=OPERATION_ID
poll_op_wait ${OPERATION_ID}

次のサンプルは、ドライブからファイルをインポートする方法を示しています。max_embedding_requests_per_min 制御フィールドを使用して、RAG Engine が ImportRagFiles インデックス登録プロセス中にエンベディングモデルを呼び出すレートを制限します。このフィールドのデフォルト値は、1 分あたり 1000 回の呼び出しです。

// Google Drive folder location.
FOLDER_RESOURCE_ID=YOUR_GOOGLE_DRIVE_FOLDER_RESOURCE_ID

// Enter the QPM rate to limit RAG's access to your embedding model
// Example: 1000
EMBEDDING_MODEL_QPM_RATE=MAX_EMBEDDING_REQUESTS_PER_MIN_LIMIT

// ImportRagFiles
// Import all files in a Google Drive folder.
// Input: ENDPOINT, PROJECT_ID, RAG_CORPUS_ID, FOLDER_RESOURCE_ID
// Output: ImportRagFilesOperationMetadataNumber
// Use ListRagFiles to find the server-generated rag_file_id.
curl -X POST \
-H "Authorization: Bearer $(gcloud auth print-access-token)" \
-H "Content-Type: application/json" \
https://${ENDPOINT}/v1beta1/projects/${PROJECT_ID}/locations/${LOCATION}/ragCorpora/${RAG_CORPUS_ID}/ragFiles:import \
-d '{
  "import_rag_files_config": {
    "google_drive_source": {
      "resource_ids": {
        "resource_id": '\""${FOLDER_RESOURCE_ID}"\"',
        "resource_type": "RESOURCE_TYPE_FOLDER"
      }
    },
    "max_embedding_requests_per_min": '"${EMBEDDING_MODEL_QPM_RATE}"'
  }
}'

// Poll the operation status.
// The response contains the number of files imported.
OPERATION_ID=OPERATION_ID
poll_op_wait ${OPERATION_ID}

Python


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}"
# paths = ["https://drive.google.com/file/123", "gs://my_bucket/my_files_dir"]  # Supports Google Cloud Storage and Google Drive Links

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

response = rag.import_files(
    corpus_name=corpus_name,
    paths=paths,
    transformation_config=rag.TransformationConfig(
        rag.ChunkingConfig(chunk_size=512, chunk_overlap=100)
    ),
    import_result_sink="gs://sample-existing-folder/sample_import_result_unique.ndjson",  # Optional, this has to be an existing storage bucket folder, and file name has to be unique (non-existent).
    max_embedding_requests_per_min=900,  # Optional
)
print(f"Imported {response.imported_rag_files_count} files.")
# Example response:
# Imported 2 files.

RAG ファイルを取得する

REST

リクエストのデータを使用する前に、次のように置き換えます。

PROJECT_ID: 。
LOCATION: リクエストを処理するリージョン。
RAG_CORPUS_ID: RagCorpus リソースの ID。
RAG_FILE_ID: RagFile リソースの ID。

HTTP メソッドと URL:

GET https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles/RAG_FILE_ID

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

次のコマンドを実行します。

curl -X GET \
     "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles/RAG_FILE_ID"

PowerShell

次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method GET `
    -Headers $headers `
    -Uri "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles/RAG_FILE_ID" | Select-Object -Expand Content

成功すると、RagFile リソースが返されます。

Python


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# file_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}/ragFiles/{rag_file_id}"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

rag_file = rag.get_file(name=file_name)
print(rag_file)
# Example response:
# RagFile(name='projects/1234567890/locations/us-central1/ragCorpora/11111111111/ragFiles/22222222222',
# display_name='file_display_name', description='file description')

RAG ファイルの一覧を取得する

REST

リクエストのデータを使用する前に、次のように置き換えます。

PROJECT_ID: 。
LOCATION: リクエストを処理するリージョン。
RAG_CORPUS_ID: RagCorpus リソースの ID。
PAGE_SIZE: 標準的なリストのページサイズ。page_size パラメータを更新して、ページごとに返される RagFiles の数を調整できます。
PAGE_TOKEN: 標準的なリストのページトークン。通常は、前の VertexRagDataService.ListRagFiles 呼び出しの ListRagFilesResponse.next_page_token を使用して取得します。

HTTP メソッドと URL:

GET https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles?page_size=PAGE_SIZE&page_token=PAGE_TOKEN

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

次のコマンドを実行します。

curl -X GET \
     "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles?page_size=PAGE_SIZE&page_token=PAGE_TOKEN"

PowerShell

次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method GET `
    -Headers $headers `
    -Uri "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles?page_size=PAGE_SIZE&page_token=PAGE_TOKEN" | Select-Object -Expand Content

指定された RAG_CORPUS_ID の下で、成功ステータスコード（2xx）と RagFiles のリストが返されます。

Python


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

files = rag.list_files(corpus_name=corpus_name)
for file in files:
    print(file.display_name)
    print(file.name)
# Example response:
# g-drive_file.txt
# projects/1234567890/locations/us-central1/ragCorpora/111111111111/ragFiles/222222222222
# g_cloud_file.txt
# projects/1234567890/locations/us-central1/ragCorpora/111111111111/ragFiles/333333333333

RAG ファイルを削除する

REST

リクエストのデータを使用する前に、次のように置き換えます。

PROJECT_ID: 。
LOCATION: リクエストを処理するリージョン。
RAG_CORPUS_ID: RagCorpus リソースの ID。
RAG_FILE_ID: RagFile リソースの ID。形式: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}/ragFiles/{rag_file_id}。

HTTP メソッドと URL:

DELETE https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles/RAG_FILE_ID

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

次のコマンドを実行します。

curl -X DELETE \
     "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles/RAG_FILE_ID"

PowerShell

次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method DELETE `
    -Headers $headers `
    -Uri "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/ragCorpora/RAG_CORPUS_ID/ragFiles/RAG_FILE_ID" | Select-Object -Expand Content

成功すると、DeleteOperationMetadata リソースが返されます。

Python


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# file_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}/ragFiles/{rag_file_id}"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

rag.delete_file(name=file_name)
print(f"File {file_name} deleted.")
# Example response:
# Successfully deleted the RagFile.
# File projects/1234567890/locations/us-central1/ragCorpora/1111111111/ragFiles/2222222222 deleted.

コンテキストを取得する

ユーザーが質問をするか、プロンプトを指定すると、RAG の検索コンポーネントがナレッジベースを検索し、クエリに関連する情報を探します。

REST

リクエストのデータを使用する前に、次のように置き換えます。

LOCATION: リクエストを処理するリージョン。
PROJECT_ID: 。
RAG_CORPUS_RESOURCE: RagCorpus リソースの名前。形式: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}。
VECTOR_DISTANCE_THRESHOLD: ベクトル距離がしきい値より小さいコンテキストのみが返されます。
TEXT: 関連するコンテキストを取得するクエリテキスト。
SIMILARITY_TOP_K: 取得する上位コンテキストの数。

HTTP メソッドと URL:

POST https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION:retrieveContexts

リクエストの本文（JSON）:

{
 "vertex_rag_store": {
    "rag_resources": {
      "rag_corpus": "RAG_CORPUS_RESOURCE",
    },
    "vector_distance_threshold": 0.8
  },
  "query": {
   "text": "TEXT",
   "similarity_top_k": SIMILARITY_TOP_K
  }
 }

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

リクエスト本文を request.json という名前のファイルに保存して、次のコマンドを実行します。

curl -X POST \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @request.json \
     "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION:retrieveContexts"

PowerShell

リクエスト本文を request.json という名前のファイルに保存して、次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method POST `
    -Headers $headers `
    -ContentType: "application/json; charset=utf-8" `
    -InFile request.json `
    -Uri "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION:retrieveContexts" | Select-Object -Expand Content

成功ステータスコード（2xx）と関連する RagFiles のリストが返されます。

Python


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/[PROJECT_ID]/locations/us-central1/ragCorpora/[rag_corpus_id]"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

response = rag.retrieval_query(
    rag_resources=[
        rag.RagResource(
            rag_corpus=corpus_name,
            # Optional: supply IDs from `rag.list_files()`.
            # rag_file_ids=["rag-file-1", "rag-file-2", ...],
        )
    ],
    text="Hello World!",
    rag_retrieval_config=rag.RagRetrievalConfig(
        top_k=10,
        filter=rag.utils.resources.Filter(vector_distance_threshold=0.5),
    ),
)
print(response)
# Example response:
# contexts {
#   contexts {
#     source_uri: "gs://your-bucket-name/file.txt"
#     text: "....
#   ....

コンテンツを生成する

予測は、コンテンツを生成する LLM メソッドを制御します。

REST

リクエストのデータを使用する前に、次のように置き換えます。

PROJECT_ID: 。
LOCATION: リクエストを処理するリージョン。
MODEL_ID: コンテンツ生成用の LLM モデル。例: gemini-2.5-flash
GENERATION_METHOD: コンテンツ生成の LLM メソッド。オプション: generateContent、streamGenerateContent
INPUT_PROMPT: コンテンツ生成のために LLM に送信されるテキスト。アップロードされた Rag ファイルに関連するプロンプトを使用するようにしてください。
RAG_CORPUS_RESOURCE: RagCorpus リソースの名前。形式: projects/{project}/locations/{location}/ragCorpora/{rag_corpus}。
SIMILARITY_TOP_K: 省略可。取得する上位コンテキストの数。
VECTOR_DISTANCE_THRESHOLD: 省略可。ベクトル距離がしきい値より小さいコンテキストが返されます。

HTTP メソッドと URL:

POST https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:GENERATION_METHOD

リクエストの本文（JSON）:

{
 "contents": {
  "role": "user",
  "parts": {
    "text": "INPUT_PROMPT"
  }
 },
 "tools": {
  "retrieval": {
   "disable_attribution": false,
   "vertex_rag_store": {
    "rag_resources": {
      "rag_corpus": "RAG_CORPUS_RESOURCE",
    },
    "similarity_top_k": SIMILARITY_TOP_K,
    "vector_distance_threshold": VECTOR_DISTANCE_THRESHOLD
   }
  }
 }
}

リクエストを送信するには、次のいずれかのオプションを選択します。

curl

リクエスト本文を request.json という名前のファイルに保存して、次のコマンドを実行します。

curl -X POST \
     -H "Content-Type: application/json; charset=utf-8" \
     -d @request.json \
     "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:GENERATION_METHOD"

PowerShell

リクエスト本文を request.json という名前のファイルに保存して、次のコマンドを実行します。

$headers = @{  }

Invoke-WebRequest `
    -Method POST `
    -Headers $headers `
    -ContentType: "application/json; charset=utf-8" `
    -InFile request.json `
    -Uri "https://LOCATION-aiplatform.googleapis.com/v1beta1/projects/PROJECT_ID/locations/LOCATION/publishers/google/models/MODEL_ID:GENERATION_METHOD" | Select-Object -Expand Content

成功すると、生成されたコンテンツと引用が返されます。

Python


from vertexai import rag
from vertexai.generative_models import GenerativeModel, Tool
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

rag_retrieval_tool = Tool.from_retrieval(
    retrieval=rag.Retrieval(
        source=rag.VertexRagStore(
            rag_resources=[
                rag.RagResource(
                    rag_corpus=corpus_name,
                    # Optional: supply IDs from `rag.list_files()`.
                    # rag_file_ids=["rag-file-1", "rag-file-2", ...],
                )
            ],
            rag_retrieval_config=rag.RagRetrievalConfig(
                top_k=10,
                filter=rag.utils.resources.Filter(vector_distance_threshold=0.5),
            ),
        ),
    )
)

rag_model = GenerativeModel(
    model_name="gemini-2.0-flash-001", tools=[rag_retrieval_tool]
)
response = rag_model.generate_content("Why is the sky blue?")
print(response.text)
# Example response:
#   The sky appears blue due to a phenomenon called Rayleigh scattering.
#   Sunlight, which contains all colors of the rainbow, is scattered
#   by the tiny particles in the Earth's atmosphere....
#   ...

ハイブリッド検索

ハイブリッド検索は Weaviate データベースでサポートされています。Weaviate データベースは、セマンティック検索とキーワード検索の両方を組み合わせて、検索結果の関連性を高めます。検索結果の取得時に、セマンティック（密ベクトル）とキーワードマッチング（スパースベクトル）の類似性スコアを組み合わせて、最終的なランキング結果が生成されます。

RAG Engine 検索 API を使用したハイブリッド検索

これは、RAG Engine 検索 API を使用してハイブリッド検索を有効にする方法の例です。

REST

  # TODO(developer): Update the variables.
  PROJECT_ID = "YOUR_PROJECT_ID"
  # The HTTPS/HTTP Weaviate endpoint you created during provisioning.
  HTTP_ENDPOINT_NAME="https://your.weaviate.endpoint.com"

  # Your Weaviate collection name, which roughly corresponds to a Vertex AI Knowledge Engine Corpus.
  # For example, "MyCollectionName"
  # Note that the first letter needs to be capitalized.
  # Otherwise, Weaviate will capitalize it for you.
  WEAVIATE_COLLECTION_NAME="MyCollectionName"

  # The resource name of your Weaviate API Key your Secret.
  SECRET_NAME="MyWeaviateApiKeySecret"
  # The Secret Manager resource name containing the API Key for your Weaviate endpoint.
  # For example, projects/{project}/secrets/{secret}/versions/latest
  APIKEY_SECRET_VERSION="projects/${PROJECT_ID}/secrets/${SECRET_NAME}/versions/latest"

  # Select a Corpus display name.
  CORPUS_DISPLAY_NAME="SpecialCorpus"

  # Call CreateRagCorpus API and set all Vector DB Config parameters for Weaviate to create a new corpus associated to your selected Weaviate collection.
  curl -X POST \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
  -H "Content-Type: application/json" \
  https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/ragCorpora \
  -d '{
        "display_name" : '\""${CORPUS_DISPLAY_NAME}"\"',
        "rag_vector_db_config" : {
                "weaviate": {
                      "http_endpoint": '\""${HTTP_ENDPOINT_NAME}"\"',
                      "collection_name": '\""${WEAVIATE_COLLECTION_NAME}"\"'
                },
          "api_auth" : {
                  "api_key_config": {
                        "api_key_secret_version": '\""${APIKEY_SECRET_VERSION}"\"'
                  }
          }
        }
    }'

  # TODO(developer): Update the variables.
  # Get operation_id returned in CreateRagCorpus.
  OPERATION_ID="your-operation-id"

  # Poll Operation status until done = true in the response.
  curl -X GET \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    -H "Content-Type: application/json" \
  https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/operations/${OPERATION_ID}

  # Call ListRagCorpora API to verify the RAG corpus is created successfully.
  curl -sS -X GET \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    "https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/ragCorpora"

Python


from vertexai import rag
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/[PROJECT_ID]/locations/us-central1/ragCorpora/[rag_corpus_id]"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

response = rag.retrieval_query(
    rag_resources=[
        rag.RagResource(
            rag_corpus=corpus_name,
            # Optional: supply IDs from `rag.list_files()`.
            # rag_file_ids=["rag-file-1", "rag-file-2", ...],
        )
    ],
    text="Hello World!",
    rag_retrieval_config=rag.RagRetrievalConfig(
        top_k=10,
        filter=rag.utils.resources.Filter(vector_distance_threshold=0.5),
    ),
)
print(response)
# Example response:
# contexts {
#   contexts {
#     source_uri: "gs://your-bucket-name/file.txt"
#     text: "....
#   ....

ハイブリッド検索と RAG Engine を使用してグラウンディングされた結果を生成する

これは、ハイブリッド検索と RAG Engine を使用して、グラウンディングされた結果を生成する方法の例です。

REST

  # TODO(developer): Update the variables.
  PROJECT_ID = "YOUR_PROJECT_ID"
  # The HTTPS/HTTP Weaviate endpoint you created during provisioning.
  HTTP_ENDPOINT_NAME="https://your.weaviate.endpoint.com"

  # Your Weaviate collection name, which roughly corresponds to a Vertex AI Knowledge Engine Corpus.
  # For example, "MyCollectionName"
  # Note that the first letter needs to be capitalized.
  # Otherwise, Weaviate will capitalize it for you.
  WEAVIATE_COLLECTION_NAME="MyCollectionName"

  # The resource name of your Weaviate API Key your Secret.
  SECRET_NAME="MyWeaviateApiKeySecret"
  # The Secret Manager resource name containing the API Key for your Weaviate endpoint.
  # For example, projects/{project}/secrets/{secret}/versions/latest
  APIKEY_SECRET_VERSION="projects/${PROJECT_ID}/secrets/${SECRET_NAME}/versions/latest"

  # Select a Corpus display name.
  CORPUS_DISPLAY_NAME="SpecialCorpus"

  # Call CreateRagCorpus API and set all Vector DB Config parameters for Weaviate to create a new corpus associated to your selected Weaviate collection.
  curl -X POST \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
  -H "Content-Type: application/json" \
  https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/ragCorpora \
  -d '{
        "display_name" : '\""${CORPUS_DISPLAY_NAME}"\"',
        "rag_vector_db_config" : {
                "weaviate": {
                      "http_endpoint": '\""${HTTP_ENDPOINT_NAME}"\"',
                      "collection_name": '\""${WEAVIATE_COLLECTION_NAME}"\"'
                },
          "api_auth" : {
                  "api_key_config": {
                        "api_key_secret_version": '\""${APIKEY_SECRET_VERSION}"\"'
                  }
          }
        }
    }'

  # TODO(developer): Update the variables.
  # Get operation_id returned in CreateRagCorpus.
  OPERATION_ID="your-operation-id"

  # Poll Operation status until done = true in the response.
  curl -X GET \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    -H "Content-Type: application/json" \
  https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/operations/${OPERATION_ID}

  # Call ListRagCorpora API to verify the RAG corpus is created successfully.
  curl -sS -X GET \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $(gcloud auth print-access-token)" \
    "https://us-central1-aiplatform.googleapis.com/v1beta1/projects/${PROJECT_ID}/locations/us-central1/ragCorpora"

Python


from vertexai import rag
from vertexai.generative_models import GenerativeModel, Tool
import vertexai

# TODO(developer): Update and un-comment below lines
# PROJECT_ID = "your-project-id"
# corpus_name = "projects/{PROJECT_ID}/locations/us-central1/ragCorpora/{rag_corpus_id}"

# Initialize Vertex AI API once per session
vertexai.init(project=PROJECT_ID, location="us-central1")

rag_retrieval_tool = Tool.from_retrieval(
    retrieval=rag.Retrieval(
        source=rag.VertexRagStore(
            rag_resources=[
                rag.RagResource(
                    rag_corpus=corpus_name,
                    # Optional: supply IDs from `rag.list_files()`.
                    # rag_file_ids=["rag-file-1", "rag-file-2", ...],
                )
            ],
            rag_retrieval_config=rag.RagRetrievalConfig(
                top_k=10,
                filter=rag.utils.resources.Filter(vector_distance_threshold=0.5),
            ),
        ),
    )
)

rag_model = GenerativeModel(
    model_name="gemini-2.0-flash-001", tools=[rag_retrieval_tool]
)
response = rag_model.generate_content("Why is the sky blue?")
print(response.text)
# Example response:
#   The sky appears blue due to a phenomenon called Rayleigh scattering.
#   Sunlight, which contains all colors of the rainbow, is scattered
#   by the tiny particles in the Earth's atmosphere....
#   ...

次のステップ

Vertex AI RAG Engine で Pinecone を使用する

Vertex AI RAG Engine で Weaviate データベースを使用する コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

考慮事項

Weaviate データベースをプロビジョニングする

Weaviate データベース インスタンスを構成してデプロイする

HTTPS エンドポイントを準備する

Weaviate コレクションを作成する

API キーを使用し、AuthN と AuthZ を使用して Weaviate をプロビジョニングする。

API キーを作成する

API キーを Secret Manager に保存する

RAG Engine サービス アカウントをプロビジョニングする

Google Cloud コンソール環境をセットアップする

環境の設定方法については、こちらをクリックしてください

Python

Node.js

Java

BOM ありの Maven

BOM なしの Maven

Gradle without BOM

Go

C#

REST

RAG コーパスを準備する

CreateRagCorpus API と UpdateRagCorpus API を使用する

最初の RAG コーパスを作成する

別の RAG コーパスを作成する

例

Weaviate データベースを設定する

REST

Secret Manager を設定する

シークレットの作成

コンソール

REST

Python

権限を設定

コンソール

Python

Secret のバージョンを追加する

REST

Python

Llama 3 で Weaviate を使用する

RAG コーパスを作成する

REST

Python

RAG ファイルを使用する

REST

curl

PowerShell

Python

RAG ファイルをインポートする

REST

curl

PowerShell

Python

RAG ファイルを取得する

REST

curl

PowerShell

Python

RAG ファイルの一覧を取得する

REST

curl

PowerShell

Python

RAG ファイルを削除する

REST

curl

PowerShell

Python

コンテキストを取得する

REST

curl

PowerShell

Python

コンテンツを生成する

REST

curl

PowerShell

Python

ハイブリッド検索

RAG Engine 検索 API を使用したハイブリッド検索

Vertex AI RAG Engine で Weaviate データベースを使用する

Weaviate データベースインスタンスを構成してデプロイする

API キーを使用し、`AuthN` と `AuthZ` を使用して Weaviate をプロビジョニングする。

RAG Engine サービスアカウントをプロビジョニングする

`CreateRagCorpus` API と `UpdateRagCorpus` API を使用する