Skip to main content

AI/ML / Multi Agent Refarch / Capabilities / DEV

Runtime protection

CCC.MARefArc.CP22

Monitors agent actions and model outputs during execution to detect unsafe, non-compliant, or anomalous behavior, enforcing constraints, blocking disallowed actions, or triggering escalation.

Related Threats

IDTitleDescription
CCC.MARefArc.TH15Reputational harm from offensive or misleading outputsThe system generates offensive, misleading, or inappropriate outputs, or is manipulated into doing so, that are attributed to the organization, with reputational and regulatory impact when output filtering and human review are insufficient.
CCC.MARefArc.TH16Confident hallucination and fabricated factsLacking ground truth and faced with ambiguous prompts or helpfulness-biased tuning, the model fabricates plausible but false facts, figures, or citations, presented with high fluency that makes errors hard to catch and likely to be acted upon.