1. Oracle Cloud Infrastructure Documentation
"Generative AI - Pricing": This official pricing guide explicitly states that the service is billed based on the number of input and output tokens. This directly links the "tokens" metric to the "financial cost of running the agent".
Source: Oracle Cloud Infrastructure Official Documentation.
Section: Generative AI > Pricing.
2. Oracle Digital Assistant Documentation
"The Insights User Interface - Conversation-Quality Metrics": This document details the built-in analytics for Oracle Digital Assistant
which includes "Average Skill Response Time (ms)". This metric is a direct measurement of latency
used to confirm that the "agent response wait time is acceptable".
Source: Oracle Help Center
Oracle Digital Assistant Documentation.
Section: Use > Monitor and Analyze Your Digital Assistant > The Insights User Interface > Conversation-Quality Metrics.
3. Oracle Digital Assistant Documentation
"Use the Invoke Large Language Model Component": This section explains how the digital assistant integrates with LLMs. The interaction described is the process that consumes tokens and incurs latency. Testing this component involves evaluating its performance (latency) and resource consumption (tokens)
which translates to cost.
Source: Oracle Help Center
Oracle Digital Assistant Documentation.
Section: Develop > Integrate with Large Language Models > Use the Invoke Large Language Model Component.