AI Utilization, Projects, and Reliability Scorecard

Source: live a02 evidence under /opt/veritize-docker/ops. Window: last 7-14 days.

Scores are now tied to AGE evidence: check-in, WO/APAR/PTF/VIRA/checkout/consensus, rejects, holds, false-complete signals, alerts, and pickup timing.

Private dashboard path; no HL private endpoint data is shown here.
Assigned Work
CC39
39
SC36
36
SGPT23
23
SGrok22
22
SG14
14
Outputs / Evidence
SC130
130
CGPT41
41
SGPT35
35
SGrok32
32
SG26
26
Reliability Scorecard
AgentScoreGradeBasisRemediationEvidence
SGPT92A-Fast pickup on WO/PTF reviews; reliable relay; one R1 rejection found.Keep on implementation/risk review. Require concise evidence links in every answer.PTF-317 review, KB rejection
VIRA90A-Consistent gate/contact reports; low scope, low breakage risk.Keep as gate/report lane; do not overload with execution work.VIRA report
CGPT88B+High completion volume and deployment records; current operator lane. Risk is direct mutation velocity.Continue requiring backup, smoke test, and final evidence file for every deployment.AI wakeup, HL dashboard
SG84BStrong risk detection; can over-block when dispatch wording is broad. Good for security review.Give SG exact review boundaries and explicit no-probe/no-mutation scope.SG wakeup, Sales Ops arch
CC80B-Highest assigned-work count, low output count. Coordination lane, not evidence producer. Found SC pickup broken.Use as dispatcher/sequencer; assign output ownership to SC/SGPT/SG/SGrok.SC pickup broken, control plane fix
SC76C+Highest output and evidence volume, but repeated session/watcher alerts, one dispatch pickup broken finding, and HOLD records.Reduce workload. Add queue watchdog, pickup SLA, and auto-escalate stale dispatches.watcher alert, session alert, HOLD
SGrok72CUseful for UX/value/messaging, but rejects generic ops/status dispatches under current prompt and had an R2 rejection.Retask only for press, UX, market narrative, and value-filter work or update its system prompt.R2 reject, wakeup rejection

Scoring formula follows the AGE control model: 100 base, with penalties for REJECT, VIOLATION, FALSE/FALSE-COMPLETE, BROKEN pickup, ALERT, HOLD, and OVERDUE evidence. Credits come from check-in/checkout evidence, timely pickup, usable output, and successful remediation. “Hallucination/lying” is counted only where a reject/violation/false/broken file exists, not by subjective judgment.

Current 14-Day vs Cumulative Totals
Agent14D Assigned14D Outputs14D ScoreTotal AssignedTotal OutputsTotal ScoreTop Total Projects
SC6222667 D15012450 DDataCert, WK/MIUSA, KB Governance, AGE Recovery
SG154190 A-3415394 AKB Governance, Sales Ops, PTF-317, AGE Recovery
SGPT253959 D12832159 DKB Governance, PTF-317, WK/MIUSA, Sales Ops
SGrok233367 D6512472 CKB Governance, AGE Recovery, Sales Ops, PTF-317
CC391856 D475460 DDataCert, PTF-317, AGE Recovery, WK/MIUSA
VIRA01583 B0820 DGate/contact queue, WK/MIUSA references
CGPT04796 A05297 ADataCert, WK/MIUSA, Bitvision, AGE Recovery

Important: cumulative total score intentionally preserves all historical penalties, so long-running agents with old restart/session alerts can show low lifetime scores. For daily operations, use the 14-day score. For governance trend reports, use the total JSON and history JSONL.

Current 14-day JSON · Cumulative total JSON · History JSONL

Projects Worked On
AgentProjects / Workstreams FoundRecent Evidence
SC
DataCertGemWalletPTF-317Sales Ops ERP probePlatform healthAPAR fixesHL status sweep
DataCert APAR audit, status sweep
SG
Sales Ops architecturePTF-317 reviewsKB system mapAGE governanceSecurity/risk
Sales Ops arch, PTF-317 confirm
SGPT
PTF-317 mail templateSales Ops riskKB governanceWO Austin ManagerNYC SSD reviews
PTF-317 review, Sales Ops risk
SGrok
Sales Ops UXKB value filterScorecard UXSocial media lead engineADA compliance messaging
Sales Ops UX R3, scorecard UX
CC
Control planePTF sequencingDataCert wallet UIHL dashboard linkGemWallet queue
control plane, false wallet UI
CGPT
DataCertHL dashboard DockerBitvisionWK/MIUSAAI utilization report
utilization report, HL dashboard Docker
Pickup-Time Examples
TaskAgentApprox Pickup/ResponseEvidence
WO NYC SSD production reviewSGPT~7 secondsinbox / outbox
PTF-317 mail template reviewSGPT~15 secondsinbox / outbox
PTF-317 mail template reviewSG~18 secondsoutbox
PTF-317 confirmationSGrok~6 secondsoutbox
SC dispatch pickup control planeCC/SCBroken pickup found, then remediatedfinding / fix

Pickup times are inferred from file mtimes of matched inbox/outbox evidence. They are operational indicators, not API latency.

Top Handoffs
SC -> CC83
SC -> SGPT40
SGPT -> SC31
SGrok -> SC28
CC -> SC27
SC -> VIRA27
SGrok -> CC26
Evidence Index

Primary data links are copied into this private dashboard under /evidence/. Full raw inventory remains on a02 at /tmp/ai_utilization_inventory_2026_05_17.json.

Markdown utilization report · Raw utilization JSON · 14-day reliability file index

AGE mandatory directive · AGE briefing · AGE recovery WO · AI team AGE refresh