Skip to main content

AI/ML / Multi Agent Refarch / Controls / DEV

Human Feedback Loop for AI Systems

CCC.MARefArc.CN19 · DET

Capture human feedback on agent outputs through the Feedback Engine and Human Supervision capabilities and feed it into evaluation and improvement of agents and models.

Related Capabilities

IDTitleDescription
CCC.MARefArc.CP14Approved-model registry and lifecycleCatalog of approved models with metadata, version information, configuration parameters, and usage constraints, ensuring agents access only models meeting organizational, regulatory, and security standards.
CCC.MARefArc.CP12Authoritative knowledge source basesInternal and external repositories of structured data, unstructured documents, and graph-based representations that provide authoritative information for grounding.
CCC.MARefArc.CP20Feedback engineCollects and aggregates structured and unstructured feedback from users, evaluators, and automated systems, including correctness assessments, preference signals, and quality ratings, to inform system improvement.
CCC.MARefArc.CP21Human supervision and oversightMechanisms for human reviewers to inspect, approve, correct, or override agent outputs, supporting human-in-the-loop and human-over-the-loop workflows for sensitive or high-impact tasks.
CCC.MARefArc.CP15LLM inference gateway routingValidates inference requests and routes each to the correct model instance, abstracting model hosting behind a consistent interface.
CCC.MARefArc.CP13Vector-based semantic retrievalVector databases providing semantic search and grounding so agents can find relevant information from large text corpora.
CCC.MARefArc.CP16Model-interaction zero-trust guardrailsEnforces authentication and authorization for every inference request and applies input validation against prompt injection, output filtering and redaction, access control, rate limits, and cost management before and after model execution.
CCC.MARefArc.CP22Runtime protectionMonitors agent actions and model outputs during execution to detect unsafe, non-compliant, or anomalous behavior, enforcing constraints, blocking disallowed actions, or triggering escalation.
CCC.MARefArc.CP02Human-in-the-loop output reviewApplication-embedded controls that allow users to review, approve, or modify agent outputs before they are executed or shared.

Related Threats

IDTitleDescription
CCC.MARefArc.TH23Discriminatory outputs from biasBiased training data, architectural and feature choices, proxy variables such as postal codes, and uncorrected feedback loops cause systematically discriminatory outcomes against protected groups, with legal and reputational exposure.
CCC.MARefArc.TH19Silent model version, prompt, and deployment driftProviders silently retrain, re-prompt, or re-architect models, or change deployment and API defaults, shifting behaviour even when inputs are unchanged; without version pinning in the model registry this breaks reproducibility and validated behaviour.
CCC.MARefArc.TH17Non-deterministic and non-reproducible outputsProbabilistic sampling, internal-state variation, context sensitivity, and decoding parameters cause identical inputs to yield different outputs across runs, undermining testing, reproducibility, and reliable evaluation.
CCC.MARefArc.TH18RAG grounding failuresEven with retrieval, responses may contradict retrieved documents, drop caveats truncated by the context window, fill gaps with incorrect general knowledge, exceed authorized advisory scope, or adopt an inappropriate tone or certainty for the domain.
CCC.MARefArc.TH15Reputational harm from offensive or misleading outputsThe system generates offensive, misleading, or inappropriate outputs, or is manipulated into doing so, that are attributed to the organization, with reputational and regulatory impact when output filtering and human review are insufficient.

Assessment Requirements

IDTextApplicability
CCC.MARefArc.CN19.AR01The system MUST provide a mechanism for humans to rate, correct, or reject agent outputs, and MUST capture that feedback in the Feedback Engine.tlp-clear, tlp-green, tlp-amber, tlp-red
CCC.MARefArc.CN19.AR02Captured feedback MUST be reviewable and MUST inform retraining, prompt, or configuration changes.tlp-clear, tlp-green, tlp-amber, tlp-red

Guideline Mappings

FrameworkIDRemarks
finos-airAIR-DET-011