This page describes serving configs. A serving config is a serving entity that associates a model or a set of controls that are used to generate your recommendation or search results.
Relationship with models and controls
When you create a serving config, you select a model (for recommendations) or controls (for search) to attach. Serving configs are invoked by your site when surfacing recommendations or search results. At serving time, the serving config's associated model or controls are referenced to determine what the recommendations or search results generate.
Models
A recommendations serving config can have a single model associated with it. However, any model can be associated with multiple serving configurations, enabling you to deploy the same model on different pages via different serving configs.
Serving controls
Search serving configs have a multi-to-multi relationship with serving controls. You can add multiple serving controls to a serving config, and a single serving control can be associated with multiple serving configs.
You can create serving controls and then add or swap them into a live search serving config.
API resource and permissions
A serving config is passed into the API using the
placements
resource:
- Recommendations uses the URL
projects/PROJECT_ID/locations/global/catalogs/default_catalog/placements/SERVING_CONFIG_ID:predict
. - Search uses the URL
projects/PROJECT_ID/locations/global/catalogs/default_catalog/placements/SERVING_CONFIG_ID:search
.
The permissions used on these resources are the placements.search
and
placements.predict
permissions.
Support for placements in Vertex AI Search for commerce
Serving configurations are available as of Recommendations AI v2 and Retail Search v2alpha.
The servingConfig
resource is available in the API versions
v2beta and v2alpha. You can use this resource to create, view, edit, and remove
serving configurations.
If you have existing placements, or create new placements, serving configurations associated with each placement are automatically created. Creating a serving configuration does not create a corresponding placement.
Deleting a serving configuration deletes its corresponding placement, and deleting a placement deletes its corresponding serving configuration.
Serving configurations allow you to edit diversity and price reranking options and have them take effect in near real-time. With placements, diversity and pricing settings can only be changed from the recommendation model that the placement points to.
Placements are still supported, but using serving configurations instead is recommended.
Recommendations options for serving configs
The following options allow you to change the behavior of a recommendations serving config.
These options were previously available when creating models; they are now associated with serving configs instead.
Price reranking
Price reranking causes recommended catalog items with a similar recommendation probability to be ordered by price, with the highest-priced items first. Price reranking is disabled by default.
Enabling price reranking helps balance conversion rates and average order values. Because relevance is also used to order the items returned, enabling price reranking is not the same as sorting by price.
This option can be edited after creating a serving config.
Diversification
If you want to ensure that results returned from a single prediction request are more diversified rather than looking similar, you can enable diversification. Generally, diversification reduces the likelihood that similar products are shown in a recommendation panel, at the risk of removing some good recommendations. Diversification settings can be edited after creating a serving configuration. It is disabled by default.
Two types of diversification are available: rule-based diversity and data-driven diversity.
Rule-based diversity
Rule-based diversity relies on categories of your product catalog. Use rule-based diversity to recommend products from a variety of categories. Diversification is configured by level, with higher levels of diversification causing fewer items to be displayed per category. This diversification type works best if your catalog provides high-quality product categories.
Diversification level | Max items per category |
---|---|
None | Unlimited |
Low | 3 |
Medium | 2 |
High | 1 |
Auto | Depends on catalog |
Data-driven diversity
Use data-driven diversity to produce recommendations results that balance relevance and diversity. Data-driven diversity learns from product catalog metadata, such as titles or categories. Instead of relying on the title or category words, data-driven diversity captures semantic similarity to produce better-performing diversification.
Diversification level | Max similar items |
---|---|
None | Unlimited |
Low | 3 |
Medium | 2 |
High | 1 |
Auto | Depends on catalog |
Category matching
If the serving config includes a Similar Items model for recommendations, you can enable category matching. When category matching is used, the only product results returned are those that share at least one category with the context product.
If your categories have deeply nested hierarchies, they are
truncated using heuristics to improve the possibility of a match. For
example, if the context product's categories are a > b > c > d > e > f
, the
returned results are those whose categories are a > b > c
.
Category matching can be used in combination with other filtering options, such
as price, availability, and filter tags. For example, if you use the filter tag
fall_Sale
and have category match enabled, the returned results are those that
have both the required tag and a category match.
This option can be edited after creating a serving config. By default, category matching is disabled.
Search options for serving configs
Dynamic faceting
You can turn on dynamic faceting when you create or edit a serving configuration.
When dynamic faceting is enabled for a serving config, search can automatically use attributes as dynamic facets in search results for this configuration, based on past user behavior such as facet clicks and views. Whether a given attribute can be used as a facet is by default defined by the product-level attribute configuration. Dynamic faceting settings in the API can be overwritten by site-wide attribute controls in the Search for commerce console. See Managing site-wide controls.
Note that dynamic facets can be created based just on accurate product catalog data.
However, for the feature to work optimally for your site, the facet models need
to learn from activity on your site. For this, you need to set query, category
and filter fields in your search
event uploads accurately.
Next steps
- Request predictions from your new placement.