Class ConversationModelEvaluation (2.30.0)

ConversationModelEvaluation(mapping=None, *, ignore_unknown_fields=False, **kwargs)

Represents evaluation result of a conversation model.

.. _oneof:


name str
The resource name of the evaluation. Format: projects/
display_name str
Optional. The display name of the model evaluation. At most 64 bytes long.
Optional. The configuration of the evaluation task.
create_time google.protobuf.timestamp_pb2.Timestamp
Output only. Creation time of this model.
Output only. Only available when model is for smart reply. This field is a member of oneof_ metrics.
raw_human_eval_template_csv str
Output only. Human eval template in csv format. It tooks real-world conversations provided through input dataset, generates example suggestions for customer to verify quality of the model. For Smart Reply, the generated csv file contains columns of Context, (Suggestions,Q1,Q2)*3, Actual reply. Context contains at most 10 latest messages in the conversation prior to the current suggestion. Q1: "Would you send it as the next message of agent?" Evaluated based on whether the suggest is appropriate to be sent by agent in current context. Q2: "Does the suggestion move the conversation closer to resolution?" Evaluated based on whether the suggestion provide solutions, or answers customer's question or collect information from customer to resolve the customer's issue. Actual reply column contains the actual agent reply sent in the context.