3.3: Run Workflows¶

You've worked with the model to build your building blocks — context artifacts, a prompt, maybe skills, agents, or MCP connections. Now run the workflow.

Start with your Run Guide. The model generates a Run Guide ([name]-run-guide.md) at the end of Construct — it tells you exactly how to set up and run everything on your specific platform. If you have one, follow it.

The sections below cover the general execution patterns if you need to decide independently or want to understand the options.

Choose Your Execution Pattern¶

What you built	Execution pattern	Where to run it
Prompt + context files	Paste and run in a normal chat	Any AI tool (Claude, ChatGPT, Gemini, M365 Copilot)
Prompt + context + project	Run in a project — context is pre-loaded, paste the prompt each run	Claude Projects, ChatGPT Projects, Gemini Gems, M365 Copilot Notebooks
Agent + skills + MCP	Command an agent — give a goal, it plans and executes	Claude Code subagents, OpenAI Agents, Google Agents, M365 Copilot Agents
API script + context files	Run a script from the terminal	Any environment with API access
SDK agent app + tools	Deploy an agent app locally or to cloud	Local terminal, Docker, cloud service
AI calls in an existing app	Integrate into your pipeline	CI/CD, existing application, scheduled job
Any of the above + a schedule	Automate — recurring trigger, runs without you	Claude Code + launchd / Task Scheduler

Paste and Run¶

The simplest execution pattern. You have a prompt and maybe some context files. Open a chat, provide what the prompt needs, and get the output.

How to run it:

Open any AI tool — Claude, ChatGPT, Gemini, M365 Copilot, or any chat interface
If you have context files (reference materials, style guides, example data), attach or upload them to the conversation
Paste the prompt generated during the Construct phase
Provide the inputs the prompt asks for — the input requirements section tells you exactly what's needed
Review the output

This is where most workflows start, and many stay here permanently. If the workflow runs well as a paste-and-run prompt, there's no reason to add complexity.

Run in a Project¶

When you're running the same workflow frequently and attaching the same context files every time, move it into a project. A project pre-loads your context so you just paste the prompt and go.

Normal chat vs. project¶

	Normal chat	Project
Use when	Infrequent workflow (monthly or less), few or no context files, one-off or experimental runs	Frequent workflow (weekly+), 3+ context files, team members need to run the same workflow
Context	Attached fresh each run	Pre-loaded — available in every conversation
History	Single conversation	Multiple conversations share the same context
Setup effort	None	A few minutes to create the project and upload files

How to run it:¶

Create the project — name it after your workflow
Upload your context artifacts — the documents, data, rules, and examples you gathered during the Construct step
Add custom instructions (optional) — a brief orientation for the AI about this workflow's purpose
Start a new conversation within the project
Paste the workflow prompt — the prompt is always pasted fresh each run, never embedded in project instructions. The prompt IS the workflow.
Provide inputs and review the output

Each time you run the workflow, start a new conversation in the project. The context is already there — you just paste the prompt and provide the inputs.

Where to create a project¶

Platform	Feature name	How to create
Claude	Claude Projects	Sidebar > Projects > New Project
ChatGPT	ChatGPT Projects	Sidebar > Projects > New Project
Gemini	Gems	Gems tab > Create Gem
M365 Copilot	Notebooks	Notebooks in Copilot sidebar

For platform-specific setup details, see Agentic Building Blocks > Projects.

Command an Agent¶

When your workflow includes steps where the AI needs to plan its own approach, decide which tools to call, and adapt based on what it finds, you run it through an agent system. You give a goal — the agent figures out the path.

How it's different from a prompt¶

With a prompt, you define every step and the AI follows your script. With an agent, you define the outcome and the agent decides how to get there. The agent plans, calls tools, evaluates results, adjusts its approach, and executes multi-step work autonomously. You review at human review gates, not at every step.

How to run it:¶

Give the agent a goal in natural language — describe what you want done, not how to do it step by step
The agent plans and executes — it reads context, calls tools (via MCP), invokes skills, and adapts based on intermediate results
Review at defined gates — your agent definition specifies where the agent pauses for human approval
Accept or redirect — approve the output or give the agent further direction

Where to run agents¶

Platform	Agent system	How it works
Claude	Claude Code subagents	Define agents as `.md` files with system prompts, skills, and tool access. Run with `claude --agent` or via the `framework-orchestrator` pattern.
OpenAI	OpenAI Agents	Build agents with the Agents API or in the Assistants playground. Agents have tools, instructions, and persistent threads.
Google	Google Agents	Build with Vertex AI Agent Builder or Gemini's agent capabilities. Agents connect to Google Workspace tools.
M365 Copilot	Copilot Agents	Build declarative or custom agents in Copilot Studio. Agents connect to Microsoft Graph and business data.

For Claude-specific agent setup, see Agentic Building Blocks > Agents.

Run a Code-First Workflow¶

When your architecture approach is code-first, you run your workflow from the terminal or deploy it as an application. The same four execution patterns apply, but execution happens through scripts and SDK apps rather than platform UIs.

How to run it:¶

API scripts (Prompt and Skill-Powered Prompt patterns):

Set environment variables — export your API key and any configuration values
Install dependencies — pip install -r requirements.txt or npm install
Run the script — python main.py, npx tsx workflow.ts, or equivalent
Review the output — check the terminal output or output files

SDK agent apps (Single Agent and Multi-Agent patterns):

Set environment variables — export your API key and any configuration values
Install dependencies — install the SDK and any tool dependencies
Run the agent — python agent.py, npx tsx agent.ts, or use the SDK's CLI runner
Review at defined gates — the agent pauses for human approval at configured checkpoints
Accept or redirect — approve the output or give the agent further direction

Where to run code-first workflows¶

What you built	How to run	Where
API call script	Run from terminal	Local machine, CI/CD pipeline, cloud function
SDK agent app	Run from terminal or deploy	Local machine, Docker container, cloud service
Integration into existing app	Deploy with the app	Whatever environment hosts your application

Code-first workflows can also be scheduled (see Automate on a Schedule below) — the same scheduling patterns work for scripts and agent apps.

Automate on a Schedule¶

When you want a workflow to run automatically — daily, weekly, or in response to triggers — without a human initiating each run. This is the highest level of the autonomy spectrum (the range from fully human-driven to fully AI-driven).

Scheduling requires an agent system with CLI (command-line interface) access — the ability to run commands from a terminal. Currently, Claude Code subagents (specialized AI assistants that run as part of a larger agent system) are the most straightforward option for this.

How it works:

Define the agent and its workflow (same as "Command an Agent" above)
Create a wrapper script that invokes the agent with a prompt
Register the script with a scheduler (launchd on macOS, Task Scheduler on Windows)
The workflow runs at the scheduled time, saves outputs, and logs results

When to automate vs. when to keep manual:

Automate when the workflow is stable (you've run it several times and trust the output), the trigger is time-based or event-based, and the cost of a bad run is low
Keep manual when you're still iterating on the prompt or agent, the workflow requires judgment calls you haven't codified, or the output goes directly to customers or stakeholders without review

For the full setup guide, see Scheduling Subagents.

Your First Run¶

The first run is a test, not a production run. You're testing whether the building blocks you constructed in 3.2 Construct actually produce the output you need.

What to evaluate:

Did the output match what you expected?
Were any steps skipped or misunderstood?
Did the AI ask for something it should have had (missing context)?
Was the output quality good enough, or generic?

Common first-run issues:

Generic output — the AI doesn't have enough context. Add reference materials, examples, or style guides.
Steps skipped or wrong — the prompt instructions are too vague. Make the steps more explicit.
AI asks for information it should know — a context file is missing. Check your Context Inventory from the AI Building Block Spec.

Code-first first-run issues:

Problem	Root cause	Fix
API authentication error	Invalid or missing API key	Check environment variables, verify key in platform dashboard
SDK tool not found	Tool schema mismatch	Verify tool definition matches the expected format for your SDK
Agent loop timeout	No stop condition or infinite delegation	Add max iterations, check handoff logic between agents
Rate limit errors	Too many API calls too fast	Add retry logic with exponential backoff
Module not found	Missing dependencies	Run `pip install -r requirements.txt` or `npm install`

Test, Iterate, Repeat¶

Building an AI workflow is a cycle: run → evaluate → adjust → run again. Each run is a test. When the output isn't right, go back to 3.2 Construct, adjust the building block that's responsible, and test again.

Most workflows need 2-4 iterations before they produce reliably good results. This is expected — you're tuning the building blocks to your specific business context, and that takes a few passes.

Use this table to figure out which building block to adjust based on what went wrong:

Problem	What to fix
Output quality is generic or off-brand	Add more context — reference materials, examples, style guides
Steps are skipped or misunderstood	Refine the prompt — make instructions more explicit, add examples of expected output
A step needs domain expertise the AI doesn't have	Build a skill for that step — codify the expertise so the prompt can invoke it
The AI needs to make decisions you can't predict in advance	Convert from prompt to agent — let the AI plan its own approach
The workflow should run without you triggering it	Add scheduling — automate with a recurring trigger

This maps directly back to the Build overview — iterating is climbing the building block ladder based on what the workflow actually needs.

Build overview — the full build process including design, skills, agents, and MCP
Worked Examples — Deterministic Automation, AI Collaborative, Autonomous Agent
Scheduling Subagents — full automation setup guide
Agentic Building Blocks — platform-specific implementation details

3.3: Run Workflows¶

Choose Your Execution Pattern¶

Paste and Run¶

Run in a Project¶

Normal chat vs. project¶

How to run it:¶

Where to create a project¶

Command an Agent¶

How it's different from a prompt¶

How to run it:¶

Where to run agents¶

Run a Code-First Workflow¶

How to run it:¶

Where to run code-first workflows¶

Automate on a Schedule¶

Your First Run¶

Test, Iterate, Repeat¶

Related¶