Evaluator

Evaluator is the release confidence layer.

It tracks important user journeys and shows whether the project still satisfies the flows that matter before work is considered ready to ship.

Why It Matters

Task completion is not the same as product readiness.

An agent can finish a local task while a critical journey remains broken. Evaluator exists to keep delivery focused on outcomes:

This turns "tests passed" into a stronger question: did the user journey pass?

Evaluator is designed around journeys, not files.

Examples:

Each journey can have checkpoints, latest run status, evidence, and human review state.

Evaluator should not replace the Board. It complements it:

Together they make agentic development observable from plan to release.

Need	Entry point
Inspect delivery health	`sinaris hub`
Review blocked journeys	Evaluator view
Connect evaluation to implementation	Board + Activity
Understand release confidence	Plan + Evaluator