Documentation Index
Fetch the complete documentation index at: https://na-36-docs-v2.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Research Skill Workflow
This is the canonical operator runbook for the experimental page-content research workflow that landed indocs-v2-dev.
Use this page for:
- operational usage
- source-of-truth boundaries
- readiness status
- maintenance and improvement workflow
tests/README.md as the canonical narrative source.
What Is Canonical
Canonical sources:- skill behavior:
ai-tools/ai-skills/templates/*.template.md - fact storage:
workspace/research/claims/ - adjudication ledger:
workspace/research/adjudication/page-content-research-outcomes.json - registry validation:
operations/scripts/validators/content/veracity/docs-fact-registry.js - manual fact runner:
operations/scripts/audits/content/veracity/docs-page-research.js - PR advisory runner:
operations/scripts/dispatch/content/veracity/docs-page-research-pr-report.js - packet runner:
operations/scripts/dispatch/content/veracity/docs-research-packet.js - adjudication workflow:
operations/scripts/audits/content/veracity/docs-research-adjudication.js - research-to-plan template:
docs-guide/tooling/research-to-implementation-plan-template.md - local/manual PR prep integration:
operations/scripts/dispatch/ai/codex/create-codex-pr.js --advisory-research - packet planning template:
docs-guide/tooling/research-review-packet-plan-template.md - forward plan:
workspace/plan/future/page-content-research-trust-roadmap.md
- local installed Codex skills under
$CODEX_HOME/skills - saved advisory reports and validation artifacts
- rollout evidence and exploratory pilots under
workspace/plan/repo-ops-reports/
- route-centric claim-ledger advisory helpers are retained only as legacy comparison tooling and are not the active PR advisory path
Workflow Model
The workflow is claim-led rather than route-led. It is responsible for:- extracting material factual claims
- checking evidence sources
- detecting contradictions across related pages
- classifying claims by confidence and freshness risk
- producing a propagation queue for dependent pages
- MDX syntax validation
- style-guide compliance
- link and import integrity
- generic navigation cleanup
Readiness Status
Current status:- Codex-ready: yes
- Cross-agent-ready: portable with minor work
- Operating mode: experimental and advisory-first
- Codex skills can sync from the canonical template bundle immediately.
- The public and internal docs now explain how to use the workflow.
- Cross-agent portability exists structurally, but broader packaging and operator guidance still need hardening before claiming equal readiness across all agents.
Operator Commands
Validate the registry:- the request covers a full nav section or several logical tranches
- the findings need reusable packet artifacts for later fix execution
- you want page-run and PR-run views preserved together across a larger scope
- the request is limited to one page or one tight claim family
- a packet root would add more operational overhead than value
- the next action is immediate page editing rather than section-wide reporting
- the research output is complete but the fixes span content, registry, and runner behavior
- another agent needs a decision-complete implementation plan before execution
- the next task is sequencing, not more source verification
docs-research-to-implementation-plan. It consumes page reports, PR advisory reports, or research packets and turns them into a planning-only implementation artifact.
Expected Outputs
Every substantive run should surface some combination of:- verified claims
- conflicted claims
- time-sensitive claims
- unresolved or historical-only claims
- cross-page contradictions
- propagation queue items
- explicit evidence sources
- trust summary counts
- if evidence is weak, treat the claim as unresolved
- if wording is stronger than the evidence, downgrade the wording
- if the same claim appears elsewhere, queue propagation work instead of fixing one page in isolation
Discovery Boundaries
The runner can now discover supporting evidence beyond explicitevidence_refs, but the ranking stays strict:
- active repo files and official pages remain the highest default sources for current-state claims
v1/**is a historical lineage lane, not a silent current-state override_contextData/**,_plans-and-research/**,_workspace/research/**, andv2/x-archived/**are context lanes only- GitHub discovery is strongest for implementation-status and support-status families
- DeepWiki is corroboration only and should not become primary evidence for current product truth
Trust Summary
Each report now includes a compact trust summary with:unresolved_claims: how many tracked claims still lack strong enough evidencecontradiction_groups: how many factual collisions the run found across reviewed pagesevidence_sources: how many evidence records were actually checkedexplicit_page_targets: how many propagation targets came from explicit registry ownership or dependenciesinferred_page_targets: how many propagation targets came from IA/path inference
- higher
contradiction_groupsusually means a real review problem, not report noise - higher
unresolved_claimsmeans the registry or evidence adapters still need work before trusting wording changes - higher
inferred_page_targetsis acceptable when path inference is covering current siblings, but it should not dominate stable high-confidence families - low
evidence_sourceson a broad review usually means claim-family coverage or source mapping is still too thin
Source-of-Truth Boundaries
Use this split consistently:- public contributor usage:
v2/resources/documentation-guide/ - internal operator workflow: this runbook in
docs-guide/frameworks/ - rollout/adoption record:
workspace/plan/repo-ops-reports/ - future hardening plan:
workspace/plan/future/ - executable behavior: scripts, templates, tests, and claim registries
- scripts and tests define runtime behavior
- template bundles define skill behavior
- this runbook defines operator workflow and readiness
- public documentation guide pages summarize contributor usage
Maintenance Workflow
When improving the research skill:- expand claim-family coverage in
workspace/research/claims/ - improve evidence matching and classification logic in the runner
- validate on real orchestrator and gateway page clusters
- run PR advisory on tracked factual docs pages
- update this runbook when the operator contract changes
Operator Review Rubric
Use this rubric when deciding whether a run was useful enough to trust:- useful:
- primary evidence is current and from the right source class
- contradiction groups are concrete and explainable
- propagation queue points at pages that really repeat or depend on the claim
- noisy:
- weak sources outrank stronger official or GitHub evidence
- contradiction groups collapse unrelated wording into one family
- propagation is mostly speculative sibling fan-out
- expand a claim family when:
- the same fact keeps recurring across active pages
- reviewers repeatedly need to verify the same current-state claim manually
- source classes and canonical ownership are clear enough to defend
- narrow a claim family when:
- wording overlap keeps producing false contradictions
- the claim is really style guidance, not factual truth
- evidence quality is too weak to classify reliably
Adjudication Workflow
Adjudicate runs when:- a report is used to make or block a real content decision
- a contradiction group looks noisy or unexpectedly broad
- a reviewer had to manually rediscover facts that should have been tracked
- a gateway status claim is being considered for stronger PR-time trust
true_positive: the report surfaced a real issue or useful current-state warningfalse_positive: the report surfaced a claim family that was not actually useful or was misleadingly groupedfalse_negative: the reviewer had to manually verify a factual claim that the system failed to trackneeds_split: one family is collapsing multiple concepts and needs to be dividedneeds_narrowing: the family exists but its matching or propagation logic is too broadneeds_more_sources: the family is valid but current source coverage is too weak
stable: repeated adjudications show the family is useful and low-noiseadvisory-only: keep reporting, but do not move toward stronger PR behavior yetneeds-split: separate mixed concepts before trusting the family furtherneeds-narrowing: reduce matching or inference breadth before trusting the family furtherneeds-more-sources: expand or improve source coverage before trusting the family further
Trust Tiers
Trust tiers are metadata only in the current phase:experimental: not enough adjudicated evidence yetadvisory: usable, but still too noisy or under-evidenced for stronger handlingadvisory-high-confidence: a narrow family with low noise and strong current source fitnot-eligible: outside the current trust-candidate slice
clearinghouse-public-readinessremote-signer-current-scopeprogramme-availabilitycommunity-signer-testing-surfacegateway-support-contact-channel
- widen the workflow back into generic navigation QA
- let
tests/README.mdbecome the primary narrative home again - treat exploratory reports as canonical instructions
Related Docs
- Public contributor page:
/v2/resources/documentation-guide/research-and-fact-checking - AI tools index:
/docs-guide/tooling/ai-tools - Source of truth policy:
/docs-guide/policies/source-of-truth-policy - Trust roadmap:
workspace/plan/future/page-content-research-trust-roadmap.md