Skip to main content

14 docs tagged with "ai-intelligence"

View all tags

Eval Plane ①: Input

How to evaluate the Input plane — parsing, intent, injection resistance, and PII handling before inference begins.

Eval Plane ②: Data

How to evaluate the Data plane — source freshness, lineage, access boundaries, and factual correctness of underlying knowledge.

Eval Plane ③: Context

How to evaluate the Context plane — retrieval precision, ranking, scope, packing, and abstention when evidence is thin.

Eval Plane ④: Reasoning

How to evaluate the Reasoning plane — faithfulness to context, conclusion quality, tool selection, and multi-step logic.

Eval Plane ⑤: Tool

How to evaluate the Tool plane — selection, arguments, idempotency, error handling, and schema compliance for agent tool calls.

Eval Plane ⑥: Memory

How to evaluate the Memory plane — session scope, TTL, consistency, and cross-session leakage in agent and copilot systems.

Eval Plane ⑦: Action

How to evaluate the Action plane — policy enforcement, authorization, side effects, and auditability before irreversible operations execute.

Eval Plane ⑧: Outcome

How to evaluate the Outcome plane — end-user task success, clarity, usefulness, and trust in the final delivered response.