Q: 17
A regulated claims agent must meet an accuracy SLO and operate under a monthly credits
cap, and it cannot use public internet grounding. The team has a daily labeled evaluation
set approved by compliance, and changes must be validated across dev/test/prod with no
downtime. Leadership wants one primary “health metric” that will catch early regressions
before callers notice.
What is the best primary metric to monitor under these constraints?
Options
Discussion
No comments yet. Be the first to comment.
Be respectful. No spam.