Manage data assets using Data Catalog
Data Catalog is a feature of Dataplex that automatically catalogs metadata about Bigtable resources. This feature helps your organization search for and manage your Bigtable data assets.
Metadata is automatically synced from Bigtable to Data Catalog for the following Bigtable resources:
- Instances
- Clusters
- Tables, including column family details
Data Catalog is a fully managed metadata management service within Dataplex. It provides unified view and tagging mechanisms for technical and business metadata. To learn more about Data Catalog and why you should use it, see What is Data Catalog.
Benefits of using Data Catalog with Bigtable
Data Catalog information about your data can help facilitate analysis, data reuse, application development, and data management.
You can search entries in Data Catalog by resource name or by details. For example, you can search for an instance name to find the entry for an instance or search for a column family to find the entry for a table. You can also use a faceted search, applying filters to narrow down your search. For more information, see Search and view data assets with Data Catalog.
Tagging data assets can help you understand and catalog the data that is available in your Bigtable tables to make the data more discoverable and searchable by other Data Catalog users, and as part of your larger data estate. You use tags to add your own metadata to your cataloged metadata, and then you can use those tags to conduct searches.
Get started with Data Catalog
To get started using Data Catalog to manage your Bigtable data assets, complete the following tasks:
If you are new to Data Catalog, enable the API.
Grant IAM roles to principals who require access to Bigtable entries in Data Catalog.
Learn how to search for data assets by reading How to search with Data Catalog.
Review Data Catalog search syntax to understand how to find your metadata.
Read the concepts on tagging Data Catalog entries at Tags and tag templates
Ways to search for Bigtable metadata
You can search synced Bigtable metadata using Data Catalog.
In addition to standard qualifiers, you can search for Bigtable resources with the following:
Instance
- Bigtable instance name
- Bigtable instance ID
- Cluster location
- Cluster ID
Table
- Table ID
- Column family name
- Bigtable instance name
- Bigtable instance ID
Required roles
Viewing, tagging, and searching Bigtable resources in Data Catalog have different permission requirements. In general, grant principals in your organization the minimum role required to accomplish a task.
Permission to view public and private tags
To get the permissions that you need to view public and private tags on Bigtable resources, ask your administrator to grant you the following IAM roles:
These predefined roles contain the permissions required to view public and private tags. The exact permissions that are required are the following:
bigtable.instances.get
bigtable.clusters.get
bigtable.tables.get
datacatalog.tagTemplates.getTag
Permission to search Bigtable resources
To get the permissions that you need to search Bigtable resources in Data Catalog, ask your administrator to grant you the following IAM roles:
These predefined roles contain the permissions required to search Bigtable resources. The exact permissions that are required are the following:
bigtable.instances.get
bigtable.clusters.get
bigtable.tables.get
Permission to attach tags to Bigtable resources
To get the permissions that you need to attach tags to Bigtable resources in Data Catalog, ask your administrator to grant you the following IAM roles:
These predefined roles contain the permissions required to add public and private tags. The exact permissions that are required are the following:
bigtable.instances.update
bigtable.clusters.update
bigtable.tables.update
datacatalog.tagTemplates.use
datacatalog.entries.updateTag
(Optional) Create a custom role
You can create a custom role to grant to users who require complete access to all Bigtable metadata in Data Catalog across your project or organization. Then you can make any user or service account that needs access a principal of the custom role. To learn more about custom roles, see Roles and permissions.
Choose a meaningful name for the role, such as Bigtable Data Catalog Editor, and then assign only the principals who need access to the role. For setup instructions, see Creating a custom role.
Grant the following permissions to the custom role:
bigtable.instances.get
bigtable.instances.update
bigtable.clusters.get
bigtable.clusters.update
bigtable.tables.get
bigtable.tables.update
datacatalog.tagTemplates.getTag
datacatalog.tagTemplates.use
For details on Data Catalog permissions, see Roles to search Google Cloud resources. For descriptions of Bigtable permissions, see Permissions.
Known constraints
The following section lists some constraints that you must note before using Data Catalog to manage your Bigtable metadata.
Location constraints
If your organization has set up location constraints, metadata entries for your Bigtable resources might not be synced to Data Catalog. Any entries created before enforcing the location constraints are retained by Data Catalog.
To allow Data Catalog to load resources, we recommend adding the global
multi-region to your location policy.
Column family name exception
The Data Catalog API rejects requests where the column family
name in your request includes a period (.
). For example, if the column family
name is personal.info
.