Skip to main content
Feedback

AWS Bedrock: Monitoring agents

The metrics for the AWS Bedrock agents are currently in Beta mode and can currently display only certain items. You can monitor and access your agents through the Monitoring and Compliance screen.

You can filter out your AWS Bedrock agents based on Provider, accounts, models, and time frames. By default, the time frame selected is the Last 60 minutes UTC. For more information, read Monitoring and compliance.

Prerequisites

To view the metrics for your AWS Bedrock agents, ensure you have Enabled Bedrock Agent metric data access.

Dimensions and metrics

The metric ingestion into Agent Control tower from AWS Bedrock is split into two categories:

  • by Agent ARN and Alias ARN
  • by ModelIDs

The following table depicts the AWS metrics based on Alias ARN(Amazon Resource Name) used to calculate the metrics depicted in the Agent Control Tower:

AWS metrics based on Alias ARN(Amazon Resource Name)
Metric nameCalculated byDescription
TotalTimeOperationOperation, AgentArnOperation, AgentAliasArnTotal time for an operation to complete.
Time to first token (TTFT)OperationOperation, AgentArnOperation, AgentAliasArnTTFT for a given Operation, agent ARN across all agents and all model IDs
InvocationThrottlesOperationOperation, AgentArnOperation, AgentAliasArnAPI Level Throttling for a given Operation across all agents and all model IDs and for a given operation, given agent, given model ID.
InvocationServerErrorsOperationOperation, AgentArnOperation, AgentAliasArnInvocationServerErrors for a given Operation across all agents and all model IDs, for a given operation, given agent, across all model IDs and for a given operation, given agent, given model ID.
InvocationClientErrorsOperationOperation, AgentArnOperation, AgentAliasArnInvocationClientErrors for a given Operation across all agents and all model IDs, for a given operation, given agent, across all model IDs and for a given operation, given agent, given model ID.
ModelLatencyOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeModel invocation latency for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.
ModelInvocationCountOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeModel invocation count for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.
ModelInvocationThrottlesOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeModel invocation throttles for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.
ModelInvocationClientErrorsOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeModel invocation client errors for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.
ModelInvocationServerErrorsOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeModel invocation server errors for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.
InputTokenCountOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeInput token count for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.
OutputTokenCountOperationOperation, AgentArnOperation, AgentArn, PromptTypeOperation, AgentAliasArnOperation, AgentAliasArn, PromptTypeOutput token count for a given operation across all agents and all model IDs, for a given operation across all model IDs, all prompts and a given agent ARN and for a given operation across all models for a given agent ARN and given promptType.

The following metrics are calculated on basis of the AWS Model ID:

AWS metrics based on Model ID
Metric nameCalculated byDescription
TotalTimeOperation, ModelIdAPI total processing time for a given Operation and a specific model ID (across all agents).
ModelLatencyOperation, ModelIdOperation, ModelId, PromptTypeModel invocation latency for a given operation across all agents for a given model ID and Model invocation latency for a given operation, given modelID, given promptType across all agents.
ModelInvocationCountOperation, ModelIdOperation, ModelId, PromptTypeModel invocation count for a given operation across all agents for a given model ID and Model invocation count for a given operation, given modelID, given promptType across all agents.
ModelInvocationThrottlesOperation, ModelIdOperation, ModelId, PromptTypeModel invocation throttles for a given operation across all agents for a given model ID and Model invocation throttles for a given operation, given modelID, given promptType across all agents.
ModelInvocationClientErrorsOperation, ModelIdOperation, ModelId, PromptTypeModel invocation client errors for a given operation across all agents for a given model ID and Model invocation client errors for a given operation, given modelID, given promptType across all agents.
ModelInvocationServerErrorsOperation, ModelIdOperation, ModelId, PromptTypeModel invocation server errors for a given operation across all agents for a given model ID and Model invocation server errors for a given operation, given modelID, given promptType across all agents.
InputTokenCountOperation, ModelIdOperation, ModelId, PromptTypeInput Token Count for a given operation across all agents for a given model ID and Input token count for a given operation, given modelID, given promptType across all agents.
OutputTokenCountOperation, ModelIdOperation, ModelId, PromptTypeOutput token for a given operation across all agents for a given model ID and Output Token Count for a given operation, given modelID, given promptType across all agents.
On this Page