Design Your AI Workflow
Part of: AI Workflow Framework
What This Is
Section titled “What This Is”The Design phase is where you decide how your workflow should be built — before you build it. You take the Workflow Requirements from the Deconstruct step (the what) and produce a Design Spec (the how) — the architectural blueprint Build consumes to generate skills, agents, and prompts.
The Workflow Requirements is the canonical source of truth. The Design Spec references it, never restates it. Build reads both together.
Design has three layers
Section titled “Design has three layers”Design walks through three layers of decisions that build on each other. Each layer is approved before the next begins, so strategic errors get caught before you’ve invested time in detailed specs:
| Layer | What it decides | Why it’s separate |
|---|---|---|
| Layer 1 — Architecture | Platform, mechanism (Prompt/Skill-Powered Prompt/Agent), autonomy level, packaging, model class, integration options | Strategic. Cheap to revisit. Wrong call here cascades everywhere. |
| Layer 2 — Decomposition | For each step (or capability domain), what AI building block delivers it: a new skill, an existing skill, an inline prompt block, an agent, or a human action | Structural. Decides what gets built and what gets reused. |
| Layer 3 — Component Blueprints | Field-level specs for each new skill and agent — name, description, inputs/outputs, decision logic, failure modes, tools, deployment | Detailed. Most expensive to redo. Build uses these to generate artifacts. |
The Design skill walks you through these layers in order, with a lightweight confirmation moment between each. The final Spec Approval Gate at the end is the only hard gate.
| What you’ll do | Confirm your platform, work through three layers of design decisions with lightweight confirmation at each handoff, and approve the final spec |
| What you’ll get | A Design Spec — three-layer architecture and component blueprints, with frontmatter, stable IDs, and a Self-Test Summary |
| Time | ~15–25 minutes |
Why This Matters
Section titled “Why This Matters”Not every workflow needs the same level of AI infrastructure. A weekly status report might need a single well-crafted prompt. A multi-department content pipeline might need specialized agents coordinating across stages. Choosing the wrong mechanism means either over-engineering (building agents when a prompt would do) or under-building (forcing a prompt to do agent-level work).
Design also makes the workflow portable. Because the Design Spec follows the agentskills.io standard and uses stable IDs for every component, the same spec can be built on Claude Code, Claude.ai, Cowork, Codex, ChatGPT, or Gemini CLI — with only the platform-specific bits (Packaging, Deployment Plan) adjusted at the handoff.
Layer 1 — Architecture
Section titled “Layer 1 — Architecture”Strategic decisions that shape everything downstream. You start here.
Architecture Approach
Section titled “Architecture Approach”| Approach | What it means | Build in |
|---|---|---|
| No-code | Build entirely in a platform’s UI — projects, workspace agents, gems, notebooks | Claude Projects, ChatGPT Workspace Agents, Gemini Gems, M365 Copilot |
| Code-first | Build with APIs and SDKs — programmatic model access, code-based agents, version-controlled workflows | Claude Agent SDK, OpenAI Agents SDK, Google ADK, LangChain, etc. |
The model analyzes your workflow and recommends an approach based on signals like integration needs, deployment scale, and developer experience. Most workflows start no-code and graduate to code-first when scale or integration demands it.
Platform decision
Section titled “Platform decision”You name your AI platform — at whatever level of specificity you have. “Claude Code,” “ChatGPT,” “Google Gemini,” “Claude” are all fine. The ecosystem is enough for Design decisions; the specific offering (Claude Code vs. Claude.ai vs. Cowork) is resolved when Build generates artifacts.
Other Architecture Decisions extracted from the Workflow Requirements
Section titled “Other Architecture Decisions extracted from the Workflow Requirements”Rather than walking through a checklist, the Design skill uses an extract-then-confirm approach: ask one question (platform), extract everything else from the Workflow Requirements, present the analysis for confirmation.
- Tool integrations — pulled from per-step Inputs, Context Needed, and the Context Inventory
- Trigger/schedule — pulled from the Metadata table; time-based triggers imply scheduled execution
- Context readiness flags — pulled from the Context Inventory’s AI Accessible column; flags items needing resolution before Build
Autonomy Assessment
Section titled “Autonomy Assessment”Where the whole workflow sits on the autonomy spectrum:
Human ———— Deterministic ———————— Guided ———————— Autonomous(human-performed) (fixed path) (bounded decisions) (context-driven path)| Level | Signals | Orchestration implications |
|---|---|---|
| Human | Step requires human judgment, creativity, or physical action; AI cannot perform | No AI artifact — captured as Human step in the Decomposition table |
| Deterministic | Steps execute in fixed order, no branching on quality, failure = stop or retry | Prompt or Skill-Powered Prompt likely sufficient |
| Guided | Bounded AI judgment at steps, human steers at checkpoints, mostly fixed sequence | Skill-Powered Prompt or Agent |
| Autonomous | Executor backtracks, re-invokes, adjusts on failure, checkpoints can redirect | Agent required |
Orchestration Mechanism
Section titled “Orchestration Mechanism”Based on the autonomy assessment and architecture decisions, the model recommends who drives the workflow:
| Mechanism | Description | Signals |
|---|---|---|
| Prompt | Human follows structured instructions step by step, all logic inline | Sequential steps, human provides inputs and makes decisions |
| Skill-Powered Prompt | Human invokes reusable skills in a defined sequence | Repeatable sub-routines, moderate complexity |
| Agent | Agent orchestrates the flow, invoking skills and making sequencing decisions | Tool use required, autonomous decisions, multi-step reasoning |
Plus an involvement mode:
| Mode | Description | Determined by |
|---|---|---|
| Augmented | Human in the loop — reviews, steers, or decides at key points during the run | Web/desktop deployment, no scheduled execution |
| Automated | AI runs solo — executes end-to-end without human involvement | Scheduled/unattended execution, CLI |
Packaging
Section titled “Packaging”How the artifacts ship together:
| Value | When to use |
|---|---|
| Plugin | Multiple related artifacts shipped together as a marketplace plugin (e.g., handsonai-plugins layout) |
| Standalone Skill | A single skill, uploaded directly (zip for Claude.ai, single SKILL.md for code-mode platforms, single skill for ChatGPT) |
| Workspace Agent | A ChatGPT Workspace Agent bundling orchestration + skills + tools (the current ChatGPT primitive; Custom GPTs are deprecated) |
| Loose Files | Individual files in a project directory, no distribution wrapper |
Model Recommendation
Section titled “Model Recommendation”A capability tier (reasoning-heavy / fast / vision) with per-step overrides if needed. The spec includes a per-platform mapping (Claude opus/sonnet/haiku, ChatGPT gpt-5/gpt-4o/gpt-4o-mini, Gemini 2.5-pro/2.5-flash) so Build knows which concrete model to use on the target platform.
Integration Options
Section titled “Integration Options”For each tool the workflow needs, the model researches available integration options — curated MCP servers, APIs, SDKs, and CLIs — with source URLs and trade-offs. The output is platform-agnostic; Build does the per-platform setup research.
End of Layer 1. Lightweight confirmation: “Architecture confirmed: [summary]. Moving to Decomposition. Confirm to proceed.”
Layer 2 — Decomposition
Section titled “Layer 2 — Decomposition”For each step (or capability domain), decide which AI building block delivers it.
Step-by-Step Decomposition
Section titled “Step-by-Step Decomposition”Steps are defined in the Workflow Requirements. This table adds the building-block classification and the concrete Build output for each:
| Column | Values |
|---|---|
| Step | Step ID from Workflow Requirements |
| Autonomy | Human / Deterministic / Guided / Autonomous |
| Orchestration | Prompt / Skill / Agent |
| Integration | Block + tool + use/build tag (e.g., “MCP: Notion (use)“) |
| Intelligence | Model class + context source IDs + memory flag |
| Build Output | One of: New skill: S1 / Use existing: [name] / New agent: A1 / Inline prompt → Workflow Requirements Step N / MCP server: [name] / Human (no artifact) |
| Human Gate? | Yes / No (from Workflow Requirements Human Gates) |
For outcome-driven workflows, this is replaced by a Capability Domain Mapping — capability domains are derived during Design (not present in the Workflow Requirements), each with integration needs, intelligence requirements, and Build Output.
Orchestrator Prompt Outline
Section titled “Orchestrator Prompt Outline”When mechanism is Prompt or Skill-Powered Prompt, the spec includes an Orchestrator Prompt Outline — the structural skeleton of the workflow’s main prompt. It names which step invokes which skill, where PAUSE points sit (from Human Gates), and what the user provides at each gate. Build expands the outline into the full orchestrator using Step Details from the Workflow Requirements.
Omitted for Agent mechanism — the agent itself is the orchestrator (see Agent Configuration in Layer 3).
Data Readiness Summary
Section titled “Data Readiness Summary”Context items from the Workflow Requirements’ Context Inventory flagged as Partial or No for AI Accessible — with the action needed to make each one accessible before Build runs.
Recommended Implementation Order
Section titled “Recommended Implementation Order”Quick Wins → Core → Future Enhancement. Within each tier, dependencies follow the Depends On field of each skill.
End of Layer 2. Lightweight confirmation: “Decomposition confirmed: [counts]. Moving to Component Blueprints. Confirm to proceed.”
Layer 3 — Component Blueprints
Section titled “Layer 3 — Component Blueprints”Field-level specs Build uses to generate each new skill and agent.
Skill Candidates (12 fields)
Section titled “Skill Candidates (12 fields)”For each step tagged New skill: SN:
| Field | Purpose |
|---|---|
| ID, Name, Description | Identity. Name must be lowercase-hyphenated, ≤64 chars. Description must start with “This skill should be used when…” and be ≤1024 chars — this is the verbatim text that goes into the SKILL.md frontmatter and drives auto-activation. |
| Purpose, Covers Steps/Domains | Internal context for spec readers |
| Inputs, Outputs | The skill’s contract |
| Decision Logic, Failure Modes | What the skill does and how it handles edge cases |
| Required Tools, Depends On | Runtime dependencies on tools and other skills |
| Stateful? | Memory building-block flag |
Agent Configuration (13 fields, mandatory when mechanism = Agent)
Section titled “Agent Configuration (13 fields, mandatory when mechanism = Agent)”For each step tagged New agent: AN:
| Field | Purpose |
|---|---|
| ID, Name, Description | Identity. Description must start with “Use this agent when…” and is the verbatim text for the agent file frontmatter. |
| Mission, Responsibilities, Output Format, Tone & Style, Constraints | The agent’s behavior — decomposed into structured sub-fields, not jammed into a single “Instructions” cell |
| Model, Memory Scope | Capability and persistence |
| Tools, Skills | Cross-references to Integration Options entries and Skill IDs |
| Trigger Examples | 2-3 structured examples (context → user message → expected behavior → invocation) Build uses verbatim to construct <example> blocks in the agent’s description |
Multi-Agent Configuration
Section titled “Multi-Agent Configuration”When more than one agent is defined, this section captures the orchestration pattern (Supervisor / Pipeline / Parallel / Network), the coordinator, the Handoff Contracts table (data passed between agents), and the Aggregation Strategy.
Prerequisites and Deployment Plan
Section titled “Prerequisites and Deployment Plan”Prerequisites lists platform setup, accounts, credentials, and plugin installs needed before the workflow can run. The Deployment Plan documents where each artifact lives and how it gets deployed — with a Packaging note explaining how artifacts ship together.
End of Layer 3. Final Spec Approval Gate — the only hard gate. Review the full spec; approve to move to Build.
Cross-Layer Sections
Section titled “Cross-Layer Sections”These sections sit outside the three layers:
- Evaluation Inputs — pointer to the Workflow Requirements’ Acceptance Criteria and Example Scenarios (not duplicated). Step 5 (Test) reads them from the Workflow Requirements directly.
- Deferred to Build — explicit list of decisions intentionally left for Build (specific platform offering, exact model version, integration setup specifics)
- Stakeholders (optional, organizational lens) — role swimlane and stakeholder details
- Self-Test Summary — populated by the Design skill after running the Build Skill Needs Checklist; each item ✓ (passed) or ⚠️ (issue described inline). Lets you see what was verified before approving.
How the Skill Works
Section titled “How the Skill Works”The three layers above are the conceptual structure of the Design Spec. In practice, the skill walks them in this chronological order:
- Load Workflow Requirements — Read the Workflow Requirements file from
outputs/. - Confirm understanding — Summarize the workflow and ask you to confirm.
- Architecture decisions (Layer 1) — Confirm platform (the one question), then extract tool integrations, trigger/schedule, and constraints from the Workflow Requirements and present a confirmation block.
- Autonomy assessment — Assess where the whole workflow sits on the autonomy spectrum (Deterministic, Guided, Autonomous).
- Orchestration mechanism — Recommend a mechanism (Prompt, Skill-Powered Prompt, or Agent) with an involvement mode (Augmented or Automated).
- Classify each step (Layer 2) — Per-step autonomy level, AI building blocks, tools, human review gates.
- Identify skill candidates — Steps tagged for skill creation with generation-ready detail.
- Agent configuration (Layer 3) — When applicable, generate a platform-agnostic agent blueprint.
- Generate Design Spec — Write the complete design document.
- Spec Approval Gate — Present the spec for approval. No artifacts are generated until you confirm.
How to Use This
Section titled “How to Use This”This step is facilitated by the design AI Workflow Framework skill. How you get it depends on your platform — see Set Up the Skills for installation.
Command: /handsonai:design (Claude Code) — or invoke by name on any other platform.
Platform compatibility: Claude Code ✓ | Claude.ai ✓ | Claude Cowork ✓ | ChatGPT ✓ | Gemini ✓ | M365 Copilot ✓ | Cursor / Codex / Antigravity ✓
Start with this prompt:
Design the AI workflow from my Workflow Requirements.Assess the autonomy level, recommend an orchestration mechanism, and map building blocks.Upload or paste your Workflow Requirements file ([workflow-name]-requirements.md) from the Deconstruct step. The skill runs the three layers in order, with lightweight confirmation between each, and produces a Design Spec.
Example prompts
Section titled “Example prompts”"Design the AI workflow from my Workflow Requirements"→ Reads the most recent Workflow Requirements, runs Design, produces the Design Spec for approval
"Design the expense-reporting workflow"→ Reads outputs/expense-reporting-requirements.md, recommends an orchestration mechanism, and generates the specPlan mode and the three layers
Section titled “Plan mode and the three layers”The Design skill works best in plan mode. The layered flow maps naturally onto how you’ll work with the model:
| Layer | Mode |
|---|---|
| L1 Architecture | Conversational — discuss platform, mechanism, packaging. Quick back-and-forth. |
| L2 Decomposition | The natural plan-mode entry point — start planning here. The model classifies each step and proposes Build Outputs. |
| L3 Component Blueprints | Stays in plan mode — detailed field-level spec generation. |
How to activate plan mode on your platform:
| Platform | How to activate plan mode |
|---|---|
| Claude Code | Press Shift+Tab twice, or type /plan |
| Cursor | Select “Plan” in the composer mode |
| Codex CLI | Run with the --plan flag |
| Other AI tools | Ask the model: “Switch to plan mode. Walk me through Layers 2 and 3 of the Design — classify each step, identify skill candidates, write the component blueprints.” |
After the model produces the spec, review and approve at the Spec Approval Gate before moving to Build.
What This Produces
Section titled “What This Produces”The Design Spec is organized into the three layers above, plus cross-layer sections. The spec opens with YAML frontmatter so Build can summarize it in one read.
Layer 1 — Architecture sections: Execution Pattern, Architecture Decisions (with Packaging), Autonomy Spectrum Summary, Integration Options (with Source URLs), Model Recommendation (with per-platform mapping).
Layer 2 — Decomposition sections: Step-by-Step Decomposition (or Capability Domain Mapping for outcome-driven), Orchestrator Prompt Outline (when mechanism is Prompt or Skill-Powered Prompt), Data Readiness Summary, Recommended Implementation Order.
Layer 3 — Component Blueprint sections: Skill Candidates (12 fields each), Agent Configuration (13 fields each; mandatory for outcome-driven), Multi-Agent Configuration (when applicable), Prerequisites, Deployment Plan.
Cross-layer sections: Evaluation Inputs (pointer), Deferred to Build, Stakeholders (optional), Self-Test Summary (verification results).
Build reads both the Design Spec and the Workflow Requirements together — the Design Spec is the architectural blueprint; the Workflow Requirements remains the canonical source for per-step content, acceptance criteria, and human gates.
Consuming the Design Spec
Section titled “Consuming the Design Spec”The Design Spec is structured for machine consumption — by the Build skill, by other AI agents, or by engineers building tooling around the spec.
Schema
Section titled “Schema”| Element | Format |
|---|---|
| Frontmatter | YAML. Required fields: workflow, requirements_file, spec_version, definition_type, mechanism, involvement, platform, platform_mode, packaging, counts |
| Stable IDs | S1, S2, … for skills; A1, A2, … for agents; C1, C2, … for context items (in the Workflow Requirements); E1, E2, … for example scenarios (in the Workflow Requirements) |
| Canonical vocabulary | Autonomy: Human / Deterministic / Guided / Autonomous. Mechanism: Prompt / Skill-Powered Prompt / Agent. Packaging: Plugin / Standalone Skill / Workspace Agent / Loose Files. Build Output: New skill: SN / Use existing: [name] / New agent: AN / Inline prompt → Workflow Requirements Step N / MCP server: [name] / Human (no artifact) |
| Section ordering | Layer 1 → Layer 2 → Layer 3 → Cross-layer. Section names within each layer are fixed (consumers can locate any section by name). |
| Self-Test Summary | Each Build Skill Needs Checklist item marked ✓ or ⚠️ with inline description. Lets consumers see what was verified. |
Consuming the spec standalone (without the Build skill)
Section titled “Consuming the spec standalone (without the Build skill)”A capable agent (Claude Code, Codex, ChatGPT with skill support, Gemini CLI) can build the artifacts from the Design Spec alone, given access to:
- The Workflow Requirements file — referenced from frontmatter
requirements_file: - The agentskills.io specification — the open standard skill format (agentskills.io/specification)
- The target platform’s agent format docs — Claude Code’s sub-agents docs, Codex’s subagents docs, or equivalent
For best results, run Build via the framework’s Build skill — it auto-resolves these via the platform registry and handles deployment.
Cross-platform portability
Section titled “Cross-platform portability”The Design Spec is portable across platforms because every major platform (Claude Code, Claude.ai, Cowork, Codex, ChatGPT, Gemini CLI) now follows the agentskills.io standard for skills. Three practical workflows:
- Same platform throughout (most common) — Design on Claude Code → Build on Claude Code. Platform in the spec matches Build’s runtime platform.
- Design once, Build elsewhere — Skill Candidates and Agent Configuration carry over unchanged because they’re agentskills.io-compatible. Update Architecture Decisions (Platform, Packaging) and Deployment Plan when moving. Example: design on Claude Code at home, build on Codex at work.
- Plan for a different target platform — Design on Claude Code targeting ChatGPT Workspace Agents (or any other platform). Build resolves the platform-specific translation at generation time.
agentskills.io is the portability layer. Every major platform converges on the standard, so Skill Candidates from one platform’s Design Spec build cleanly on another with minimal rewriting.
Related
Section titled “Related”- Previous step: Deconstruct Workflows (Step 2) — produces the Workflow Requirements that Design consumes
- Next step: Build the Workflow (Step 4) — generates platform artifacts from the Design Spec
- AI Workflow Framework — the full seven-step methodology
- Workflow Design Matrix — how autonomy and involvement combine into six workflow archetypes
- Workflow Architecture Patterns — implementation blueprints (prompt chaining, routing, orchestrator-workers, evaluator-optimizer, autonomous agents)
- Agents, Skills, Prompts