Skip to content

CongressionalInsights/theworkshop

TheWorkshop

Latest Release License: MIT

TheWorkshop Open Source Edition is a skill for Codex and Claude Code that runs non-coding work in a structured, auditable way.

It turns ambiguous requests into a living execution workflow:

Project -> Workstreams -> Jobs

with explicit gates, orchestration, monitoring, lessons learned, and spend visibility.

Plain-English Definition

When this repo says "project OS", it means:

  • a repeatable workflow system for the agent
  • not an operating system
  • implemented as a skill the agent runs inside Codex/Claude Code

Systems Architecture

The diagram below shows TheWorkshop's control plane (planning, gating, orchestration) and execution plane (work execution, outputs, monitoring).

TheWorkshop Systems Architecture

What It Is

  • A Codex/Claude Code skill, not a standalone app
  • A structured runtime for non-coding projects
  • Agreement-gated before execution starts
  • Truth-gated and reward-gated before completion claims
  • Parallel-orchestration aware (sub-agents when independent jobs exist)
  • Dashboard-first monitoring with token/spend telemetry

What It Is Not

  • A replacement for human strategic ownership
  • A generic code framework or web app product
  • A system that marks work complete on artifact presence alone

Core Model

  • Project: top-level outcome and success definition
  • Workstream: coherent thread in support of project goal
  • Job (Work Item): smallest executable/verifiable unit
  • Wave (optional): timeboxed grouping across workstreams

Completion promises are explicit:

  • <promise>{ID}-DONE</promise>

Gate Model

A job can only transition to done when all gates pass:

  1. Agreement gate (scope accepted before execution)
  2. Dependency/freshness gate (inputs are current)
  3. TruthGate (verification of correctness)
  4. Reward gate (meets reward_target)

Execution quality defaults:

  • job_start.py auto-applies ranked lessons into # Relevant Lessons Learned (override: --no-apply-lessons).
  • plan_check.py warns on weak placeholder content for planned jobs and hard-fails weak content for in_progress/done jobs.

Install

# One command from the repo root
git clone https://github.com/CongressionalInsights/theworkshop.git
mkdir -p "$CODEX_HOME/skills"
cp -R theworkshop "$CODEX_HOME/skills/theworkshop"

Typical destination:

  • $CODEX_HOME/skills/theworkshop
  • usually ~/.codex/skills/theworkshop

To update later:

cd "$CODEX_HOME/skills/theworkshop" && git pull origin main

Reproducible Quick Start

# create project
python3 scripts/project_new.py --name "Workshop Demo"

# add workstream + job
python3 scripts/workstream_add.py --project /path/to/project --title "Research"
python3 scripts/job_add.py --project /path/to/project --workstream WS-YYYYMMDD-001 --title "Draft options memo"
python3 scripts/job_add.py --project /path/to/project --workstream WS-YYYYMMDD-001 --title "Attribution sweep" --job-profile investigation_attribution
python3 scripts/job_add.py --project /path/to/project --workstream WS-YYYYMMDD-001 --title "Entity resolution" --job-profile identity_resolution
python3 scripts/discuss.py --project /path/to/project --work-item-id WI-YYYYMMDD-001 --decision "Use concise format" --required --no-interactive

# validate and orchestrate
python3 scripts/plan_check.py --project /path/to/project
python3 scripts/schema_validate.py --project /path/to/project
python3 scripts/optimize_plan.py --project /path/to/project
python3 scripts/orchestrate_plan.py --project /path/to/project
python3 scripts/dispatch_orchestration.py --project /path/to/project --dry-run
python3 scripts/council_plan.py --project /path/to/project --dry-run

# execute one job
python3 scripts/job_start.py --project /path/to/project --work-item-id WI-YYYYMMDD-001
python3 scripts/job_start.py --project /path/to/project --work-item-id WI-YYYYMMDD-001 --lessons-limit 5 --lessons-include-global
python3 scripts/verify_work.py --project /path/to/project --work-item-id WI-YYYYMMDD-001
python3 scripts/job_complete.py --project /path/to/project --work-item-id WI-YYYYMMDD-001 --cascade

# optional utility lanes
python3 scripts/health.py --project /path/to/project --repair
python3 scripts/quick.py --project /path/to/project --title "One-off patch" --command "echo done"
python3 scripts/dashboard_server.py --project /path/to/project --open

Expected core outputs:

  • outputs/dashboard.html
  • outputs/dashboard.json
  • outputs/dashboard.md
  • outputs/<date>-task-tracker.csv
  • logs/execution.jsonl
  • artifacts/truth-report.json
  • notes/context/<WS-or-WI>-CONTEXT.md
  • outputs/uat/<run-id>-UAT.md
  • outputs/uat/<run-id>-UAT.json
  • outputs/health.json
  • quick/<id>-<slug>/plan.md
  • quick/<id>-<slug>/summary.md

Monitoring + Spend Semantics

  • Dashboard auto-opens best-effort at execution start (unless disabled)
  • Auto-refresh supports stale detection and pause/resume
  • Optional local live transport: python3 scripts/dashboard_server.py --project /path/to/project
    • serves dashboard.html over http://127.0.0.1:*
    • publishes /events SSE updates so the page can switch from poll mode to live mode
  • Cost display is billing-aware:
    • subscription_auth: billed cost shown as $0 marginal, API-equivalent shown secondarily
    • metered_api: billed cost from exact telemetry when available
    • unknown: estimate-first fallback

Image Generation Path

Use work-item scoped image generation:

python3 scripts/imagegen_job.py --project /path/to/project --work-item-id WI-YYYYMMDD-002
python3 scripts/imagegen_job.py --project /path/to/project --work-item-id WI-YYYYMMDD-002 --credential-provider env
python3 scripts/imagegen_job.py --project /path/to/project --work-item-id WI-YYYYMMDD-002 --credential-provider keychain --approve ttl:1h

Set one provider before first run:

export THEWORKSHOP_IMAGEGEN_API_KEY=...

Compatibility for existing local setups:

export OPENAI_API_KEY=...

Optional legacy keychain flow:

export THEWORKSHOP_IMAGEGEN_CREDENTIAL_SOURCE=keychain
export THEWORKSHOP_KEYCHAIN_SERVICE=OPENAI_KEY

The apple-keychain skill remains optional and cross-platform fallback behavior is env-first.

Reliability Checks

python3 scripts/doctor.py
cd scripts && for t in *_test.py; do python3 "$t"; done

One-Command Install

./scripts/install_skill.sh --force

Use --link for a symlinked dev install.

Open-Source Workflow

Roadmap

Now

  • Stable v0.1.0 baseline for Project -> Workstreams -> Jobs control plane
  • TruthGate + stale invalidation + orchestration artifacts
  • Billing-aware spend in dashboard

Next

  • More robust synthetic scenario suite for document-quality outcomes
  • Additional dashboard drilldowns for truth/reward failure analysis
  • GitHub mirror ergonomics and dry-run diagnostics

Later

  • Optional docs site for deeper operators manual
  • Broader template library for non-coding domains
  • Extended export/report bundles for stakeholder handoff

Repository Layout

theworkshop/
  README.md
  SKILL.md
  CHANGELOG.md
  scripts/
  references/
  examples/
  docs/assets/
  .github/

License

MIT. See LICENSE.

About

Codex and Claude Code skill for structured non-coding work with TruthGate, orchestration, monitoring, and spend visibility.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors