Skip to content

Design Your AI Workflow

Part of: AI Workflow Framework

The Design phase is where you decide how your workflow should be built — before you build it. You take the Workflow Requirements from the Deconstruct step (the what) and produce a Design Spec (the how) — the architectural blueprint Build consumes to generate skills, agents, and prompts.

The Workflow Requirements is the canonical source of truth. The Design Spec references it, never restates it. Build reads both together.

Design walks through three layers of decisions that build on each other. Each layer is approved before the next begins, so strategic errors get caught before you’ve invested time in detailed specs:

LayerWhat it decidesWhy it’s separate
Layer 1 — ArchitecturePlatform, mechanism (Prompt/Skill-Powered Prompt/Agent), autonomy level, packaging, model class, integration optionsStrategic. Cheap to revisit. Wrong call here cascades everywhere.
Layer 2 — DecompositionFor each step (or capability domain), what AI building block delivers it: a new skill, an existing skill, an inline prompt block, an agent, or a human actionStructural. Decides what gets built and what gets reused.
Layer 3 — Component BlueprintsField-level specs for each new skill and agent — name, description, inputs/outputs, decision logic, failure modes, tools, deploymentDetailed. Most expensive to redo. Build uses these to generate artifacts.

The Design skill walks you through these layers in order, with a lightweight confirmation moment between each. The final Spec Approval Gate at the end is the only hard gate.

What you’ll doConfirm your platform, work through three layers of design decisions with lightweight confirmation at each handoff, and approve the final spec
What you’ll getA Design Spec — three-layer architecture and component blueprints, with frontmatter, stable IDs, and a Self-Test Summary
Time~15–25 minutes

Not every workflow needs the same level of AI infrastructure. A weekly status report might need a single well-crafted prompt. A multi-department content pipeline might need specialized agents coordinating across stages. Choosing the wrong mechanism means either over-engineering (building agents when a prompt would do) or under-building (forcing a prompt to do agent-level work).

Design also makes the workflow portable. Because the Design Spec follows the agentskills.io standard and uses stable IDs for every component, the same spec can be built on Claude Code, Claude.ai, Cowork, Codex, ChatGPT, or Gemini CLI — with only the platform-specific bits (Packaging, Deployment Plan) adjusted at the handoff.


Strategic decisions that shape everything downstream. You start here.

ApproachWhat it meansBuild in
No-codeBuild entirely in a platform’s UI — projects, workspace agents, gems, notebooksClaude Projects, ChatGPT Workspace Agents, Gemini Gems, M365 Copilot
Code-firstBuild with APIs and SDKs — programmatic model access, code-based agents, version-controlled workflowsClaude Agent SDK, OpenAI Agents SDK, Google ADK, LangChain, etc.

The model analyzes your workflow and recommends an approach based on signals like integration needs, deployment scale, and developer experience. Most workflows start no-code and graduate to code-first when scale or integration demands it.

You name your AI platform — at whatever level of specificity you have. “Claude Code,” “ChatGPT,” “Google Gemini,” “Claude” are all fine. The ecosystem is enough for Design decisions; the specific offering (Claude Code vs. Claude.ai vs. Cowork) is resolved when Build generates artifacts.

Other Architecture Decisions extracted from the Workflow Requirements

Section titled “Other Architecture Decisions extracted from the Workflow Requirements”

Rather than walking through a checklist, the Design skill uses an extract-then-confirm approach: ask one question (platform), extract everything else from the Workflow Requirements, present the analysis for confirmation.

  • Tool integrations — pulled from per-step Inputs, Context Needed, and the Context Inventory
  • Trigger/schedule — pulled from the Metadata table; time-based triggers imply scheduled execution
  • Context readiness flags — pulled from the Context Inventory’s AI Accessible column; flags items needing resolution before Build

Where the whole workflow sits on the autonomy spectrum:

Human ———— Deterministic ———————— Guided ———————— Autonomous
(human-performed) (fixed path) (bounded decisions) (context-driven path)
LevelSignalsOrchestration implications
HumanStep requires human judgment, creativity, or physical action; AI cannot performNo AI artifact — captured as Human step in the Decomposition table
DeterministicSteps execute in fixed order, no branching on quality, failure = stop or retryPrompt or Skill-Powered Prompt likely sufficient
GuidedBounded AI judgment at steps, human steers at checkpoints, mostly fixed sequenceSkill-Powered Prompt or Agent
AutonomousExecutor backtracks, re-invokes, adjusts on failure, checkpoints can redirectAgent required

Based on the autonomy assessment and architecture decisions, the model recommends who drives the workflow:

MechanismDescriptionSignals
PromptHuman follows structured instructions step by step, all logic inlineSequential steps, human provides inputs and makes decisions
Skill-Powered PromptHuman invokes reusable skills in a defined sequenceRepeatable sub-routines, moderate complexity
AgentAgent orchestrates the flow, invoking skills and making sequencing decisionsTool use required, autonomous decisions, multi-step reasoning

Plus an involvement mode:

ModeDescriptionDetermined by
AugmentedHuman in the loop — reviews, steers, or decides at key points during the runWeb/desktop deployment, no scheduled execution
AutomatedAI runs solo — executes end-to-end without human involvementScheduled/unattended execution, CLI

How the artifacts ship together:

ValueWhen to use
PluginMultiple related artifacts shipped together as a marketplace plugin (e.g., handsonai-plugins layout)
Standalone SkillA single skill, uploaded directly (zip for Claude.ai, single SKILL.md for code-mode platforms, single skill for ChatGPT)
Workspace AgentA ChatGPT Workspace Agent bundling orchestration + skills + tools (the current ChatGPT primitive; Custom GPTs are deprecated)
Loose FilesIndividual files in a project directory, no distribution wrapper

A capability tier (reasoning-heavy / fast / vision) with per-step overrides if needed. The spec includes a per-platform mapping (Claude opus/sonnet/haiku, ChatGPT gpt-5/gpt-4o/gpt-4o-mini, Gemini 2.5-pro/2.5-flash) so Build knows which concrete model to use on the target platform.

For each tool the workflow needs, the model researches available integration options — curated MCP servers, APIs, SDKs, and CLIs — with source URLs and trade-offs. The output is platform-agnostic; Build does the per-platform setup research.

End of Layer 1. Lightweight confirmation: “Architecture confirmed: [summary]. Moving to Decomposition. Confirm to proceed.”


For each step (or capability domain), decide which AI building block delivers it.

Steps are defined in the Workflow Requirements. This table adds the building-block classification and the concrete Build output for each:

ColumnValues
StepStep ID from Workflow Requirements
AutonomyHuman / Deterministic / Guided / Autonomous
OrchestrationPrompt / Skill / Agent
IntegrationBlock + tool + use/build tag (e.g., “MCP: Notion (use)“)
IntelligenceModel class + context source IDs + memory flag
Build OutputOne of: New skill: S1 / Use existing: [name] / New agent: A1 / Inline prompt → Workflow Requirements Step N / MCP server: [name] / Human (no artifact)
Human Gate?Yes / No (from Workflow Requirements Human Gates)

For outcome-driven workflows, this is replaced by a Capability Domain Mapping — capability domains are derived during Design (not present in the Workflow Requirements), each with integration needs, intelligence requirements, and Build Output.

When mechanism is Prompt or Skill-Powered Prompt, the spec includes an Orchestrator Prompt Outline — the structural skeleton of the workflow’s main prompt. It names which step invokes which skill, where PAUSE points sit (from Human Gates), and what the user provides at each gate. Build expands the outline into the full orchestrator using Step Details from the Workflow Requirements.

Omitted for Agent mechanism — the agent itself is the orchestrator (see Agent Configuration in Layer 3).

Context items from the Workflow Requirements’ Context Inventory flagged as Partial or No for AI Accessible — with the action needed to make each one accessible before Build runs.

Quick Wins → Core → Future Enhancement. Within each tier, dependencies follow the Depends On field of each skill.

End of Layer 2. Lightweight confirmation: “Decomposition confirmed: [counts]. Moving to Component Blueprints. Confirm to proceed.”


Field-level specs Build uses to generate each new skill and agent.

For each step tagged New skill: SN:

FieldPurpose
ID, Name, DescriptionIdentity. Name must be lowercase-hyphenated, ≤64 chars. Description must start with “This skill should be used when…” and be ≤1024 chars — this is the verbatim text that goes into the SKILL.md frontmatter and drives auto-activation.
Purpose, Covers Steps/DomainsInternal context for spec readers
Inputs, OutputsThe skill’s contract
Decision Logic, Failure ModesWhat the skill does and how it handles edge cases
Required Tools, Depends OnRuntime dependencies on tools and other skills
Stateful?Memory building-block flag

Agent Configuration (13 fields, mandatory when mechanism = Agent)

Section titled “Agent Configuration (13 fields, mandatory when mechanism = Agent)”

For each step tagged New agent: AN:

FieldPurpose
ID, Name, DescriptionIdentity. Description must start with “Use this agent when…” and is the verbatim text for the agent file frontmatter.
Mission, Responsibilities, Output Format, Tone & Style, ConstraintsThe agent’s behavior — decomposed into structured sub-fields, not jammed into a single “Instructions” cell
Model, Memory ScopeCapability and persistence
Tools, SkillsCross-references to Integration Options entries and Skill IDs
Trigger Examples2-3 structured examples (context → user message → expected behavior → invocation) Build uses verbatim to construct <example> blocks in the agent’s description

When more than one agent is defined, this section captures the orchestration pattern (Supervisor / Pipeline / Parallel / Network), the coordinator, the Handoff Contracts table (data passed between agents), and the Aggregation Strategy.

Prerequisites lists platform setup, accounts, credentials, and plugin installs needed before the workflow can run. The Deployment Plan documents where each artifact lives and how it gets deployed — with a Packaging note explaining how artifacts ship together.

End of Layer 3. Final Spec Approval Gate — the only hard gate. Review the full spec; approve to move to Build.


These sections sit outside the three layers:

  • Evaluation Inputs — pointer to the Workflow Requirements’ Acceptance Criteria and Example Scenarios (not duplicated). Step 5 (Test) reads them from the Workflow Requirements directly.
  • Deferred to Build — explicit list of decisions intentionally left for Build (specific platform offering, exact model version, integration setup specifics)
  • Stakeholders (optional, organizational lens) — role swimlane and stakeholder details
  • Self-Test Summary — populated by the Design skill after running the Build Skill Needs Checklist; each item ✓ (passed) or ⚠️ (issue described inline). Lets you see what was verified before approving.

The three layers above are the conceptual structure of the Design Spec. In practice, the skill walks them in this chronological order:

  1. Load Workflow Requirements — Read the Workflow Requirements file from outputs/.
  2. Confirm understanding — Summarize the workflow and ask you to confirm.
  3. Architecture decisions (Layer 1) — Confirm platform (the one question), then extract tool integrations, trigger/schedule, and constraints from the Workflow Requirements and present a confirmation block.
  4. Autonomy assessment — Assess where the whole workflow sits on the autonomy spectrum (Deterministic, Guided, Autonomous).
  5. Orchestration mechanism — Recommend a mechanism (Prompt, Skill-Powered Prompt, or Agent) with an involvement mode (Augmented or Automated).
  6. Classify each step (Layer 2) — Per-step autonomy level, AI building blocks, tools, human review gates.
  7. Identify skill candidates — Steps tagged for skill creation with generation-ready detail.
  8. Agent configuration (Layer 3) — When applicable, generate a platform-agnostic agent blueprint.
  9. Generate Design Spec — Write the complete design document.
  10. Spec Approval Gate — Present the spec for approval. No artifacts are generated until you confirm.

This step is facilitated by the design AI Workflow Framework skill. How you get it depends on your platform — see Set Up the Skills for installation.

Command: /handsonai:design (Claude Code) — or invoke by name on any other platform.

Platform compatibility: Claude Code ✓  |  Claude.ai ✓  |  Claude Cowork ✓  |  ChatGPT ✓  |  Gemini ✓  |  M365 Copilot ✓  |  Cursor / Codex / Antigravity ✓

Start with this prompt:

Design the AI workflow from my Workflow Requirements.
Assess the autonomy level, recommend an orchestration mechanism, and map building blocks.

Upload or paste your Workflow Requirements file ([workflow-name]-requirements.md) from the Deconstruct step. The skill runs the three layers in order, with lightweight confirmation between each, and produces a Design Spec.

"Design the AI workflow from my Workflow Requirements"
→ Reads the most recent Workflow Requirements, runs Design,
produces the Design Spec for approval
"Design the expense-reporting workflow"
→ Reads outputs/expense-reporting-requirements.md, recommends
an orchestration mechanism, and generates the spec

The Design skill works best in plan mode. The layered flow maps naturally onto how you’ll work with the model:

LayerMode
L1 ArchitectureConversational — discuss platform, mechanism, packaging. Quick back-and-forth.
L2 DecompositionThe natural plan-mode entry point — start planning here. The model classifies each step and proposes Build Outputs.
L3 Component BlueprintsStays in plan mode — detailed field-level spec generation.

How to activate plan mode on your platform:

PlatformHow to activate plan mode
Claude CodePress Shift+Tab twice, or type /plan
CursorSelect “Plan” in the composer mode
Codex CLIRun with the --plan flag
Other AI toolsAsk the model: “Switch to plan mode. Walk me through Layers 2 and 3 of the Design — classify each step, identify skill candidates, write the component blueprints.”

After the model produces the spec, review and approve at the Spec Approval Gate before moving to Build.


The Design Spec is organized into the three layers above, plus cross-layer sections. The spec opens with YAML frontmatter so Build can summarize it in one read.

Layer 1 — Architecture sections: Execution Pattern, Architecture Decisions (with Packaging), Autonomy Spectrum Summary, Integration Options (with Source URLs), Model Recommendation (with per-platform mapping).

Layer 2 — Decomposition sections: Step-by-Step Decomposition (or Capability Domain Mapping for outcome-driven), Orchestrator Prompt Outline (when mechanism is Prompt or Skill-Powered Prompt), Data Readiness Summary, Recommended Implementation Order.

Layer 3 — Component Blueprint sections: Skill Candidates (12 fields each), Agent Configuration (13 fields each; mandatory for outcome-driven), Multi-Agent Configuration (when applicable), Prerequisites, Deployment Plan.

Cross-layer sections: Evaluation Inputs (pointer), Deferred to Build, Stakeholders (optional), Self-Test Summary (verification results).

Build reads both the Design Spec and the Workflow Requirements together — the Design Spec is the architectural blueprint; the Workflow Requirements remains the canonical source for per-step content, acceptance criteria, and human gates.


The Design Spec is structured for machine consumption — by the Build skill, by other AI agents, or by engineers building tooling around the spec.

ElementFormat
FrontmatterYAML. Required fields: workflow, requirements_file, spec_version, definition_type, mechanism, involvement, platform, platform_mode, packaging, counts
Stable IDsS1, S2, … for skills; A1, A2, … for agents; C1, C2, … for context items (in the Workflow Requirements); E1, E2, … for example scenarios (in the Workflow Requirements)
Canonical vocabularyAutonomy: Human / Deterministic / Guided / Autonomous. Mechanism: Prompt / Skill-Powered Prompt / Agent. Packaging: Plugin / Standalone Skill / Workspace Agent / Loose Files. Build Output: New skill: SN / Use existing: [name] / New agent: AN / Inline prompt → Workflow Requirements Step N / MCP server: [name] / Human (no artifact)
Section orderingLayer 1 → Layer 2 → Layer 3 → Cross-layer. Section names within each layer are fixed (consumers can locate any section by name).
Self-Test SummaryEach Build Skill Needs Checklist item marked ✓ or ⚠️ with inline description. Lets consumers see what was verified.

Consuming the spec standalone (without the Build skill)

Section titled “Consuming the spec standalone (without the Build skill)”

A capable agent (Claude Code, Codex, ChatGPT with skill support, Gemini CLI) can build the artifacts from the Design Spec alone, given access to:

  1. The Workflow Requirements file — referenced from frontmatter requirements_file:
  2. The agentskills.io specification — the open standard skill format (agentskills.io/specification)
  3. The target platform’s agent format docs — Claude Code’s sub-agents docs, Codex’s subagents docs, or equivalent

For best results, run Build via the framework’s Build skill — it auto-resolves these via the platform registry and handles deployment.

The Design Spec is portable across platforms because every major platform (Claude Code, Claude.ai, Cowork, Codex, ChatGPT, Gemini CLI) now follows the agentskills.io standard for skills. Three practical workflows:

  1. Same platform throughout (most common) — Design on Claude Code → Build on Claude Code. Platform in the spec matches Build’s runtime platform.
  2. Design once, Build elsewhere — Skill Candidates and Agent Configuration carry over unchanged because they’re agentskills.io-compatible. Update Architecture Decisions (Platform, Packaging) and Deployment Plan when moving. Example: design on Claude Code at home, build on Codex at work.
  3. Plan for a different target platform — Design on Claude Code targeting ChatGPT Workspace Agents (or any other platform). Build resolves the platform-specific translation at generation time.

agentskills.io is the portability layer. Every major platform converges on the standard, so Skill Candidates from one platform’s Design Spec build cleanly on another with minimal rewriting.