Telebort | Learning to code made fun!

Demo Mode

No student ID available

Concept 15 of 16

Concept 15: AI Collaboration at Scale

Introduction

Welcome to AI Orchestration Mastery - the techniques that separate hobby projects from production-scale applications. You've learned to work with AI for single features. Now it's time to orchestrate AI collaboration for complex, multi-faceted projects that demand strategic resource management, specialized expertise, and systematic quality assurance.

Small-Scale AI Usage:

Single prompts for isolated tasks
Ignore cost implications
One AI does everything
Manual validation and testing
No external tool integration

Production-Scale AI Orchestration:

Strategic token optimization for cost efficiency
Professional project setup (agent.md, npm, Git)
Multi-agent workflows with specialized roles
Automated validation with MCP tools
Systematic quality pipelines

By mastering these advanced techniques, you'll build scalable, maintainable, and production-ready applications that leverage AI's full potential while maintaining control over costs and quality.

This 55-minute lesson brings together four critical orchestration skills: token economics, professional project foundations, multi-agent workflows, and external tool integration.

Learning Objectives

By the end of this lesson, you will:

Optimize token usage and understand AI pricing models
Create professional project foundations (agent.md, npm, Git)
Orchestrate multi-agent workflows for complex tasks
Leverage MCP tools (Context7, Playwright) for validation
Apply systematic approaches to large-scale AI collaboration

Part One: Token Management & Pricing (12 minutes)

Why Token Economics Matter at Scale

At small scale (1-10 requests):

Costs are negligible ($0.01-0.10)
Token optimization isn't critical
Any model works fine

At production scale (100-1000+ requests):

Costs add up quickly ($10-100+ per day)
Inefficient prompts waste money
Model selection significantly impacts budget
Token optimization becomes essential

Real-world scenario:

bash

Your app makes 500 AI requests/day for users:
❌ Inefficient: 20,000 tokens/request × 500 × $3/1M = $30/day = $900/month
✅ Optimized: 5,000 tokens/request × 500 × $3/1M = $7.50/day = $225/month
Savings: $675/month (75% reduction!)

Understanding token economics enables cost-effective AI collaboration at scale.

Understanding Tokens

Tokens are fundamental units AI models use to process text. Estimation: 1 token ~= 4 characters, 1 word ~= 1.3 tokens, 1 page (500 words) ~= 650 tokens.

Key terms:

Input tokens: What you send (prompts, code, context)
Output tokens: What AI generates (responses, code)
Context window: Max tokens AI can process (e.g., 200K for Claude Sonnet)

💡 Tip: Use OpenAI's tokenizer or Anthropic's token counter to estimate costs.

Model Pricing Comparison

Model	Input/Output (per 1M)	Context	Best For	Cost Example
Haiku	$0.25/$1.25	200K	Simple tasks, docs	$0.00137 (code explanation)
Sonnet 4.5	$3/$15	200K	Complex features	$0.0165 (12x more expensive)
GPT-4o	$5/$15	128K	Creative work	Similar to Sonnet
Opus	$15/$75	200K	Novel algorithms	5x Sonnet cost

Key insight (Oct 2024 pricing): Haiku handles 80% of tasks at 1/12th Sonnet's cost. Use Sonnet strategically for complex problems.

Token Optimization Strategies

Strategy	Impact	Implementation
Minimize Context	75% savings	Send only relevant files (2-3 vs. 20). Use error traces, grep to identify needed files.
Strategic Models	60-70% savings	Haiku (60-70%): simple tasks, docs. Sonnet (25-30%): complex logic. Opus (5-10%): novel algorithms.
Reuse Conversations	61% savings	Continue related questions in same chat. Start fresh only for unrelated topics.
Prompt Caching	5x cheaper after first request	Cache system prompts, docs, large codebases. Providers cache for ~5 minutes.

Example cost reduction:

perl

Task: Build authentication system
Haiku (form) $0.001 + Sonnet (logic) $0.045 + Haiku (tests+docs) $0.003 = $0.049
vs. All Sonnet: $0.18 → 73% savings

💡 Tip: Use Haiku for 80% of tasks, Sonnet for complex problems. This single strategy can reduce AI costs by 60-70%.

Part 2: Professional Project Setup (15 minutes)

Why Setup Determines Success

Amateur approach: jump into coding, no docs, random organization -> AI generates inconsistent code, hours wasted clarifying.

Professional approach: agent.md (15 min) + npm (5 min) + Git (5 min) -> AI generates exactly what you need, first try.

Impact: Without agent.md: 5-10 prompts per feature. With agent.md: 1-2 prompts. Time saved: 20-30 min per feature = 3-5 hours across 10 features.

Creating agent.md for Large Projects

agent.md is your project's AI instruction manual. Core principle: Document decisions ONCE, reference with @agent.md.

💡 Complete Template: See Concept 10: Communication with AI, Part 4 for all 10 sections (Project Description, Tech Stack, File Structure, Code Style, Data Structures, Key Functions, Constraints, Goals, Context, Workflow).

Usage:

Reference EVERY prompt: @agent.md Add user authentication -> AI uses correct tech stack (e.g., Clerk not custom auth)
Update as you evolve: git commit -m "docs: Update agent.md with Zustand state"
Multi-file consistency: AI maintains style/structure across all files

💡 Tip: 15-20 min upfront saves 3-5 hours of clarification prompts.

NPM Project Initialization

Why npm: Dependency tracking, custom scripts, collaboration-ready, future-proof, professional structure.

Setup: npm init (interactive) or npm init -y (quick, edit manually).

Essential package.json Sections

Section	Purpose	Example
scripts	Common commands	`dev`: vite, `build`: tsc && vite build, `validate`: lint + type-check + build
dependencies	Runtime packages	react, zustand, openai
devDependencies	Build tools	typescript, vite, eslint, vitest
metadata	Project info	name, version, description, author, license

Git Version Control Setup

Why Git: Track changes, experiment fearlessly (revert anytime), collaborate with AI safely (commit before trying suggestions), build portfolio.

Setup: git init -> create .gitignore (node_modules, .env, dist, .vscode, .DS_Store, *.log, coverage) -> git add . -> git commit -m "Initial setup"

AI-Assisted Commit Strategy:

Commit working code (checkpoint)
Ask AI to improve it
If works -> commit. If breaks -> git reset --hard HEAD (revert)

Frequency: ✅ After features, before AI suggestions, after refactors. ❌ Don't wait for "perfect" code.

💡 Tip: Commits = save points. Save often, experiment fearlessly.

Part 3: Multi-Agent Workflows (18 minutes)

From Single Agent to AI Teams

Single agent: One generalist does everything -> generic solutions, lacks depth, sequential (slow), inconsistent quality.

Multi-agent orchestration: Specialized agents (Architect, Frontend, Backend, QA, DevOps) -> expertise per domain, higher quality, parallel execution (faster), systematic coverage.

When to use:

Single agent: Simple tasks (button, typo), quick questions, prototyping, small fixes
Multi-agent: Complex features (3+ distinct subtasks), specialized expertise (security, performance), QA pipelines, parallel dev (frontend+backend), production polish (UI+tests+docs+optimization)

Multi-Agent Pattern 1: Sequential Pipeline

Use case: Quality assurance - each agent builds on previous output.

Structure: Generator -> Reviewer -> Fixer -> Test Writer -> Docs Writer

Sequential Pipeline Example: Authentication System

5-Agent Quality Pipeline (30-40 minutes total):

Generator: Creates auth.service.js, login.route.js -> ✅ Core logic, ❌ Missing security
Security Reviewer: Identifies 3 vulnerabilities (unhashed passwords, no rate limiting, info leakage)
Fixer: Implements bcrypt hashing, express-rate-limit (5 attempts/15min), sanitized errors
Test Writer: Generates 15 test cases (happy path, failures, rate limiting, token validation)
Docs Writer: Creates API docs (endpoints, curl examples, error codes)

Result: Production-ready auth with code + tests + docs (vs. 2-3 hours manually).

Multi-Agent Pattern 2: Parallel Execution

Use case: Independent tasks run simultaneously.

Structure: Frontend + Backend + Database (parallel) -> Integration Agent (connects all)

Parallel Execution Example: Todo App

3 Agents Run Simultaneously (15 min), Then Integration (10 min):

Frontend: React components (TodoList, TodoForm, TodoFilter) with Zustand + Tailwind -> UI ready
Backend: Express REST API (GET/POST/PATCH/DELETE /api/todos) with validation -> Endpoints ready
Database: PostgreSQL schema (todos, users tables) with indexes, migrations -> Schema ready

Integration Agent: Connects frontend fetch -> backend endpoints -> database queries. Adds CORS, error handling.

Result: Full-stack app in 25 minutes (vs. 50 minutes sequential = 50% faster).

Multi-Agent Pattern 3: Specialized Experts

Use case: Route questions to domain-specific experts.

Structure: General -> Generalist, React -> React Expert, Database -> DB Expert, Security -> Security Expert, Performance -> Performance Expert

Specialized Experts Example: Debugging Session

3 Specialists Work Simultaneously on Different Issues:

Performance Expert: Identifies bundle size 1.2MB (should be less than 500KB), no code splitting, PNG images -> Recommends lazy loading, WebP conversion
React Expert: Fixes "setState on unmounted component" warning -> Adds useEffect cleanup with AbortController
Security Expert: Finds JWT in localStorage (XSS risk), HTTP passwords, no CSRF -> Recommends httpOnly cookies, HTTPS, CSRF tokens

Result: 3 complex issues fixed in 30 minutes (vs. 2+ hours sequentially).

Multi-Agent Orchestration Patterns

Pattern	How It Works	When To Use
Sub-agent delegation	Main agent spawns specialists (UI, backend, state) -> integrates	Advanced IDEs (Trae, Cursor)
Shared memory	Agents read previous context (Agent 1: "Use JWT" -> Agent 2: auto-uses JWT)	Team consistency
Explicit orchestration	You manually: identify subtasks -> route to agents -> integrate -> verify	Most IDEs (Activity 15)

💡 Tip: For complex features, spend 5 minutes planning the agent workflow before starting. "Who does what, in what order?" Clear orchestration prevents duplicate work and inconsistencies.

Part 4: MCP & External Tool Integration (10 minutes)

What Is MCP (Model Context Protocol)?

MCP standardizes AI interaction with external tools. Without MCP: AI can't get real-time data. With MCP: AI calls Weather API, gets "72°F, partly cloudy".

MCP enables: Read/write files, call APIs, query databases, automate browser, fetch docs, run terminal commands.

Toolcall = AI executing a function. Example: "Get latest React docs. use context7" -> AI calls mcp__context7__get-library-docs("/facebook/react") -> fetches React v18.2.0 docs -> synthesizes answer.

MCP Tool 1: Context7 (Documentation Assistant)

Problem: AI's outdated training data -> old API patterns -> bugs. Solution: Context7 fetches real-time docs from official sources.

When to Use Context7

Use Case	Example	Why It Matters
Prevent hallucination	"Latest Expo Camera API. use context7"	Gets v14.1.0 docs (not outdated v12)
Version-specific docs	"React Router v6 nested routes. use context7"	Shows current `<Outlet />` pattern
Breaking changes	"Firebase v8->v9 auth changes. use context7"	Detects modular SDK migration

Usage format: "<question>. use context7" -> AI fetches real-time docs from official sources.

MCP Tool 2: Playwright (Browser Automation)

Problem: Manual testing is slow, error-prone, misses edge cases. Solution: Playwright MCP automates browser testing.

Playwright MCP Capabilities

9 Core Toolcalls: navigate (load URL), snapshot (page structure), click, type, console_messages (errors), take_screenshot, fill_form, evaluate (run JS), wait_for (elements/conditions).

Playwright Testing Workflow

4-Step Process:

Start app: npm run dev -> localhost:5173
Write test prompt: "Test app systematically. use playwright. URL: localhost:5173. Scenarios: happy path, invalid credentials, empty fields, responsive (375/768/1920px), accessibility, console errors."
AI executes toolcalls: navigate -> snapshot -> type email/password -> click submit -> check console -> screenshot -> test responsive
Review report: AI identifies bugs (e.g., missing error handling, button text truncates on mobile) + provides fixes

Combining Context7 + Playwright for Production Validation

5-Step Validation Pipeline:

Verify APIs (Context7): Check OpenAI, Pinecone, Clerk, Vercel -> Finds deprecated Pinecone API -> Fix it
Test happy path (Playwright): Landing -> Sign up -> Dashboard -> Create chat -> Send message -> View sources -> PASSED ✅
Test edge cases (Playwright): Network timeout (✅ retry), Invalid API key (❌ crash), Empty message (✅ blocked), Spam (⚠️ no rate limit)
Performance audit: Lighthouse (Performance: 82, Accessibility: 95, Best Practices: 88, SEO: 92)
Optimize: Fix bundle size 1.1MB, add image optimization, code splitting

Result: Production-validated app with verified APIs, comprehensive tests, performance optimization - all automated!

Summary

In this lesson, you mastered advanced AI orchestration techniques for production-scale projects:

Token Management & Pricing:

1 token ~= 4 characters; understand input vs. output costs
Model pricing: Haiku ($0.25/1M) vs. Sonnet ($3/1M) vs. Opus ($15/1M)
Optimization strategies: Minimize context, strategic model selection, conversation reuse, caching
Use Haiku for 80% of tasks (60-70% cost savings)

Professional Project Setup:

agent.md documentation: Project's instruction manual for AI
Complete template: Description, tech stack, file structure, code style, data structures, constraints
NPM initialization: Dependency tracking, custom scripts, professional structure
Git version control: Commit before AI suggestions, safe experimentation workflow

Multi-Agent Workflows:

Sequential pipeline: Quality assurance (generate -> review -> fix -> test -> document)
Parallel execution: Independent tasks simultaneously (frontend + backend + database)
Specialized experts: Route questions to domain-specific AI
Orchestration patterns reduce development time by 50-70%

MCP & External Tools:

MCP (Model Context Protocol): Standard for AI-tool interaction
Toolcalls: AI executing functions beyond text generation
Context7: Real-time documentation, prevents API hallucination
Playwright MCP: Automated browser testing (happy path, edge cases, responsive)
Combined validation: Context7 for accuracy + Playwright for functionality

Next Steps

Now that you understand production-scale AI orchestration:

Practice token optimization: Track your AI costs for one week, identify optimization opportunities
Create agent.md for existing project: Document your most complex project with comprehensive agent.md
Orchestrate multi-agent workflow: Build a feature using sequential pipeline pattern
Validate with MCP tools: Use Context7 to verify your API implementations, Playwright to test your app

In Activity 15, you'll apply all these techniques by:

Setting up a new project with agent.md, npm, Git
Optimizing a token-heavy prompt (reduce costs by 60%+)
Orchestrating a multi-agent workflow to build a complete feature
Using Context7 and Playwright to validate your implementation

This is where you transition from individual AI user to AI team orchestrator - a critical skill for building production systems at scale.

Key Takeaways

💡 Remember: Production-scale AI development isn't about using AI for everything - it's about strategic orchestration, cost optimization, and systematic validation.

The AI Orchestration Mindset:

"How can I minimize token costs?" -> Strategic model selection, context optimization
"How do I give AI consistent project context?" -> Comprehensive agent.md
"Can I parallelize this work?" -> Multi-agent workflows
"Are my API implementations current?" -> Context7 validation
"Does it work across all scenarios?" -> Playwright automated testing

Before pushing code:

bash

# Verify implementations
"Check APIs against latest docs. use context7"

# Test functionality
"Test all features and edge cases. use playwright"

# Validate performance
"Run Lighthouse audit. Optimize if score less than 90."

The 3 Pillars of Production AI Development:

Cost Efficiency - Token optimization, strategic model selection
Systematic Quality - Multi-agent workflows, automated validation
Professional Structure - agent.md, npm, Git, MCP tools

Ready to orchestrate AI at production scale? Let's move to Activity 15 where you'll build a complete feature using these advanced techniques! 🚀

Concept 15 of 16

Concept 15: AI Collaboration at Scale

Introduction

Small-Scale AI Usage:

Single prompts for isolated tasks
Ignore cost implications
One AI does everything
Manual validation and testing
No external tool integration

Production-Scale AI Orchestration:

Strategic token optimization for cost efficiency
Professional project setup (agent.md, npm, Git)
Multi-agent workflows with specialized roles
Automated validation with MCP tools
Systematic quality pipelines

By mastering these advanced techniques, you'll build scalable, maintainable, and production-ready applications that leverage AI's full potential while maintaining control over costs and quality.

This 55-minute lesson brings together four critical orchestration skills: token economics, professional project foundations, multi-agent workflows, and external tool integration.

Learning Objectives

By the end of this lesson, you will:

Optimize token usage and understand AI pricing models
Create professional project foundations (agent.md, npm, Git)
Orchestrate multi-agent workflows for complex tasks
Leverage MCP tools (Context7, Playwright) for validation
Apply systematic approaches to large-scale AI collaboration

Part One: Token Management & Pricing (12 minutes)

Why Token Economics Matter at Scale

At small scale (1-10 requests):

Costs are negligible ($0.01-0.10)
Token optimization isn't critical
Any model works fine

At production scale (100-1000+ requests):

Costs add up quickly ($10-100+ per day)
Inefficient prompts waste money
Model selection significantly impacts budget
Token optimization becomes essential

Real-world scenario:

bash

Your app makes 500 AI requests/day for users:
❌ Inefficient: 20,000 tokens/request × 500 × $3/1M = $30/day = $900/month
✅ Optimized: 5,000 tokens/request × 500 × $3/1M = $7.50/day = $225/month
Savings: $675/month (75% reduction!)

Understanding token economics enables cost-effective AI collaboration at scale.

Understanding Tokens

Tokens are fundamental units AI models use to process text. Estimation: 1 token ~= 4 characters, 1 word ~= 1.3 tokens, 1 page (500 words) ~= 650 tokens.

Key terms:

Input tokens: What you send (prompts, code, context)
Output tokens: What AI generates (responses, code)
Context window: Max tokens AI can process (e.g., 200K for Claude Sonnet)

💡 Tip: Use OpenAI's tokenizer or Anthropic's token counter to estimate costs.

Model Pricing Comparison

Model	Input/Output (per 1M)	Context	Best For	Cost Example
Haiku	$0.25/$1.25	200K	Simple tasks, docs	$0.00137 (code explanation)
Sonnet 4.5	$3/$15	200K	Complex features	$0.0165 (12x more expensive)
GPT-4o	$5/$15	128K	Creative work	Similar to Sonnet
Opus	$15/$75	200K	Novel algorithms	5x Sonnet cost

Key insight (Oct 2024 pricing): Haiku handles 80% of tasks at 1/12th Sonnet's cost. Use Sonnet strategically for complex problems.

Token Optimization Strategies

Strategy	Impact	Implementation
Minimize Context	75% savings	Send only relevant files (2-3 vs. 20). Use error traces, grep to identify needed files.
Strategic Models	60-70% savings	Haiku (60-70%): simple tasks, docs. Sonnet (25-30%): complex logic. Opus (5-10%): novel algorithms.
Reuse Conversations	61% savings	Continue related questions in same chat. Start fresh only for unrelated topics.
Prompt Caching	5x cheaper after first request	Cache system prompts, docs, large codebases. Providers cache for ~5 minutes.

Example cost reduction:

perl

Task: Build authentication system
Haiku (form) $0.001 + Sonnet (logic) $0.045 + Haiku (tests+docs) $0.003 = $0.049
vs. All Sonnet: $0.18 → 73% savings

💡 Tip: Use Haiku for 80% of tasks, Sonnet for complex problems. This single strategy can reduce AI costs by 60-70%.

Part 2: Professional Project Setup (15 minutes)

Why Setup Determines Success

Amateur approach: jump into coding, no docs, random organization -> AI generates inconsistent code, hours wasted clarifying.

Professional approach: agent.md (15 min) + npm (5 min) + Git (5 min) -> AI generates exactly what you need, first try.

Impact: Without agent.md: 5-10 prompts per feature. With agent.md: 1-2 prompts. Time saved: 20-30 min per feature = 3-5 hours across 10 features.

Creating agent.md for Large Projects

agent.md is your project's AI instruction manual. Core principle: Document decisions ONCE, reference with @agent.md.

💡 Complete Template: See Concept 10: Communication with AI, Part 4 for all 10 sections (Project Description, Tech Stack, File Structure, Code Style, Data Structures, Key Functions, Constraints, Goals, Context, Workflow).

Usage:

Reference EVERY prompt: @agent.md Add user authentication -> AI uses correct tech stack (e.g., Clerk not custom auth)
Update as you evolve: git commit -m "docs: Update agent.md with Zustand state"
Multi-file consistency: AI maintains style/structure across all files

💡 Tip: 15-20 min upfront saves 3-5 hours of clarification prompts.

NPM Project Initialization

Why npm: Dependency tracking, custom scripts, collaboration-ready, future-proof, professional structure.

Setup: npm init (interactive) or npm init -y (quick, edit manually).

Essential package.json Sections

Section	Purpose	Example
scripts	Common commands	`dev`: vite, `build`: tsc && vite build, `validate`: lint + type-check + build
dependencies	Runtime packages	react, zustand, openai
devDependencies	Build tools	typescript, vite, eslint, vitest
metadata	Project info	name, version, description, author, license

Git Version Control Setup

Why Git: Track changes, experiment fearlessly (revert anytime), collaborate with AI safely (commit before trying suggestions), build portfolio.

Setup: git init -> create .gitignore (node_modules, .env, dist, .vscode, .DS_Store, *.log, coverage) -> git add . -> git commit -m "Initial setup"

AI-Assisted Commit Strategy:

Commit working code (checkpoint)
Ask AI to improve it
If works -> commit. If breaks -> git reset --hard HEAD (revert)

Frequency: ✅ After features, before AI suggestions, after refactors. ❌ Don't wait for "perfect" code.

💡 Tip: Commits = save points. Save often, experiment fearlessly.

Part 3: Multi-Agent Workflows (18 minutes)

From Single Agent to AI Teams

Single agent: One generalist does everything -> generic solutions, lacks depth, sequential (slow), inconsistent quality.

Multi-agent orchestration: Specialized agents (Architect, Frontend, Backend, QA, DevOps) -> expertise per domain, higher quality, parallel execution (faster), systematic coverage.

When to use:

Single agent: Simple tasks (button, typo), quick questions, prototyping, small fixes
Multi-agent: Complex features (3+ distinct subtasks), specialized expertise (security, performance), QA pipelines, parallel dev (frontend+backend), production polish (UI+tests+docs+optimization)

Multi-Agent Pattern 1: Sequential Pipeline

Use case: Quality assurance - each agent builds on previous output.

Structure: Generator -> Reviewer -> Fixer -> Test Writer -> Docs Writer

Sequential Pipeline Example: Authentication System

5-Agent Quality Pipeline (30-40 minutes total):

Generator: Creates auth.service.js, login.route.js -> ✅ Core logic, ❌ Missing security
Security Reviewer: Identifies 3 vulnerabilities (unhashed passwords, no rate limiting, info leakage)
Fixer: Implements bcrypt hashing, express-rate-limit (5 attempts/15min), sanitized errors
Test Writer: Generates 15 test cases (happy path, failures, rate limiting, token validation)
Docs Writer: Creates API docs (endpoints, curl examples, error codes)

Result: Production-ready auth with code + tests + docs (vs. 2-3 hours manually).

Multi-Agent Pattern 2: Parallel Execution

Use case: Independent tasks run simultaneously.

Structure: Frontend + Backend + Database (parallel) -> Integration Agent (connects all)

Parallel Execution Example: Todo App

3 Agents Run Simultaneously (15 min), Then Integration (10 min):

Frontend: React components (TodoList, TodoForm, TodoFilter) with Zustand + Tailwind -> UI ready
Backend: Express REST API (GET/POST/PATCH/DELETE /api/todos) with validation -> Endpoints ready
Database: PostgreSQL schema (todos, users tables) with indexes, migrations -> Schema ready

Integration Agent: Connects frontend fetch -> backend endpoints -> database queries. Adds CORS, error handling.

Result: Full-stack app in 25 minutes (vs. 50 minutes sequential = 50% faster).

Multi-Agent Pattern 3: Specialized Experts

Use case: Route questions to domain-specific experts.

Structure: General -> Generalist, React -> React Expert, Database -> DB Expert, Security -> Security Expert, Performance -> Performance Expert

Specialized Experts Example: Debugging Session

3 Specialists Work Simultaneously on Different Issues:

Performance Expert: Identifies bundle size 1.2MB (should be less than 500KB), no code splitting, PNG images -> Recommends lazy loading, WebP conversion
React Expert: Fixes "setState on unmounted component" warning -> Adds useEffect cleanup with AbortController
Security Expert: Finds JWT in localStorage (XSS risk), HTTP passwords, no CSRF -> Recommends httpOnly cookies, HTTPS, CSRF tokens

Result: 3 complex issues fixed in 30 minutes (vs. 2+ hours sequentially).

Multi-Agent Orchestration Patterns

Pattern	How It Works	When To Use
Sub-agent delegation	Main agent spawns specialists (UI, backend, state) -> integrates	Advanced IDEs (Trae, Cursor)
Shared memory	Agents read previous context (Agent 1: "Use JWT" -> Agent 2: auto-uses JWT)	Team consistency
Explicit orchestration	You manually: identify subtasks -> route to agents -> integrate -> verify	Most IDEs (Activity 15)

💡 Tip: For complex features, spend 5 minutes planning the agent workflow before starting. "Who does what, in what order?" Clear orchestration prevents duplicate work and inconsistencies.

Part 4: MCP & External Tool Integration (10 minutes)

What Is MCP (Model Context Protocol)?

MCP standardizes AI interaction with external tools. Without MCP: AI can't get real-time data. With MCP: AI calls Weather API, gets "72°F, partly cloudy".

MCP enables: Read/write files, call APIs, query databases, automate browser, fetch docs, run terminal commands.

MCP Tool 1: Context7 (Documentation Assistant)

Problem: AI's outdated training data -> old API patterns -> bugs. Solution: Context7 fetches real-time docs from official sources.

When to Use Context7

Use Case	Example	Why It Matters
Prevent hallucination	"Latest Expo Camera API. use context7"	Gets v14.1.0 docs (not outdated v12)
Version-specific docs	"React Router v6 nested routes. use context7"	Shows current `<Outlet />` pattern
Breaking changes	"Firebase v8->v9 auth changes. use context7"	Detects modular SDK migration

Usage format: "<question>. use context7" -> AI fetches real-time docs from official sources.

MCP Tool 2: Playwright (Browser Automation)

Problem: Manual testing is slow, error-prone, misses edge cases. Solution: Playwright MCP automates browser testing.

Playwright MCP Capabilities

9 Core Toolcalls: navigate (load URL), snapshot (page structure), click, type, console_messages (errors), take_screenshot, fill_form, evaluate (run JS), wait_for (elements/conditions).

Playwright Testing Workflow

4-Step Process:

Start app: npm run dev -> localhost:5173
Write test prompt: "Test app systematically. use playwright. URL: localhost:5173. Scenarios: happy path, invalid credentials, empty fields, responsive (375/768/1920px), accessibility, console errors."
AI executes toolcalls: navigate -> snapshot -> type email/password -> click submit -> check console -> screenshot -> test responsive
Review report: AI identifies bugs (e.g., missing error handling, button text truncates on mobile) + provides fixes

Combining Context7 + Playwright for Production Validation

5-Step Validation Pipeline:

Verify APIs (Context7): Check OpenAI, Pinecone, Clerk, Vercel -> Finds deprecated Pinecone API -> Fix it
Test happy path (Playwright): Landing -> Sign up -> Dashboard -> Create chat -> Send message -> View sources -> PASSED ✅
Test edge cases (Playwright): Network timeout (✅ retry), Invalid API key (❌ crash), Empty message (✅ blocked), Spam (⚠️ no rate limit)
Performance audit: Lighthouse (Performance: 82, Accessibility: 95, Best Practices: 88, SEO: 92)
Optimize: Fix bundle size 1.1MB, add image optimization, code splitting

Result: Production-validated app with verified APIs, comprehensive tests, performance optimization - all automated!

Summary

In this lesson, you mastered advanced AI orchestration techniques for production-scale projects:

Token Management & Pricing:

1 token ~= 4 characters; understand input vs. output costs
Model pricing: Haiku ($0.25/1M) vs. Sonnet ($3/1M) vs. Opus ($15/1M)
Optimization strategies: Minimize context, strategic model selection, conversation reuse, caching
Use Haiku for 80% of tasks (60-70% cost savings)

Professional Project Setup:

agent.md documentation: Project's instruction manual for AI
Complete template: Description, tech stack, file structure, code style, data structures, constraints
NPM initialization: Dependency tracking, custom scripts, professional structure
Git version control: Commit before AI suggestions, safe experimentation workflow

Multi-Agent Workflows:

Sequential pipeline: Quality assurance (generate -> review -> fix -> test -> document)
Parallel execution: Independent tasks simultaneously (frontend + backend + database)
Specialized experts: Route questions to domain-specific AI
Orchestration patterns reduce development time by 50-70%

MCP & External Tools:

MCP (Model Context Protocol): Standard for AI-tool interaction
Toolcalls: AI executing functions beyond text generation
Context7: Real-time documentation, prevents API hallucination
Playwright MCP: Automated browser testing (happy path, edge cases, responsive)
Combined validation: Context7 for accuracy + Playwright for functionality

Next Steps

Now that you understand production-scale AI orchestration:

Practice token optimization: Track your AI costs for one week, identify optimization opportunities
Create agent.md for existing project: Document your most complex project with comprehensive agent.md
Orchestrate multi-agent workflow: Build a feature using sequential pipeline pattern
Validate with MCP tools: Use Context7 to verify your API implementations, Playwright to test your app

In Activity 15, you'll apply all these techniques by:

Setting up a new project with agent.md, npm, Git
Optimizing a token-heavy prompt (reduce costs by 60%+)
Orchestrating a multi-agent workflow to build a complete feature
Using Context7 and Playwright to validate your implementation

This is where you transition from individual AI user to AI team orchestrator - a critical skill for building production systems at scale.

Key Takeaways

💡 Remember: Production-scale AI development isn't about using AI for everything - it's about strategic orchestration, cost optimization, and systematic validation.

The AI Orchestration Mindset:

"How can I minimize token costs?" -> Strategic model selection, context optimization
"How do I give AI consistent project context?" -> Comprehensive agent.md
"Can I parallelize this work?" -> Multi-agent workflows
"Are my API implementations current?" -> Context7 validation
"Does it work across all scenarios?" -> Playwright automated testing

Before pushing code:

bash

# Verify implementations
"Check APIs against latest docs. use context7"

# Test functionality
"Test all features and edge cases. use playwright"

# Validate performance
"Run Lighthouse audit. Optimize if score less than 90."

The 3 Pillars of Production AI Development:

Cost Efficiency - Token optimization, strategic model selection
Systematic Quality - Multi-agent workflows, automated validation
Professional Structure - agent.md, npm, Git, MCP tools

Ready to orchestrate AI at production scale? Let's move to Activity 15 where you'll build a complete feature using these advanced techniques! 🚀