M09: AI-Assisted Code Review — Workshop Guide

Self-directed | 45–60 min | Requires: M09 study guide read beforehand

Before You Start

Prerequisites

M09 study guide read (theory + AutoCommenter research)
M08 completion (security review patterns)
1-2 weeks Claude Code usage
Experience reviewing code in your team’s workflow
Familiarity with your team’s code style guide
A Git repository where you can commit .claude/ files

What you’ll build The theory explains what automated review catches and misses. This workshop makes it operational. You will build a /review skill with a structured checklist, a code-reviewer subagent, and practice the Writer/Reviewer pattern. By the end, you’ll have a team-aligned code review workflow and direct experience of how a fresh perspective catches things the original writer missed.

What You’ll Do

Build the /review skill with a structured checklist
Create the code-reviewer subagent
Implement the Writer/Reviewer pattern skill
Review intentionally flawed code, fix it, and re-review until approved

Part 1 — Review Philosophy

The pre-work covered two critical insights:

Style is automatable — linters handle naming, formatting, obvious bugs
Design judgment is human — architecture fit, tradeoffs, context

The Google AutoCommenter study proved this: automated systems catch style issues perfectly but miss subtle design problems.

Part 2 — Build the /review Skill and Code-Reviewer Subagent

Step 1: Create /review Skill with Checklist

Create .claude/skills/review.md:

---
name: Review
description: Structured code review using team checklist
disable-model-invocation: false
allowed-tools: [read_file, grep_codebase]
---

# Code Review Checklist

Review the provided code against these criteria. Return structured feedback.

## Format

For each item below, respond:
- **[PASS|WARN|FAIL]** Category - Specific finding or "No issues"

## Checklist

### Design & Architecture (Critical)
- [ ] Code follows team architecture patterns (see ARCHITECTURE.md)
- [ ] No unnecessary coupling to external systems
- [ ] Abstractions are justified (is this complexity needed?)
- [ ] Data structures chosen appropriately
- [ ] APIs are intuitive and well-scoped

### Security & Data Protection
- [ ] No hardcoded secrets or credentials
- [ ] User input validated/sanitized
- [ ] PII not exposed in logs or errors
- [ ] Authentication/authorization checks present (if needed)
- [ ] Dependencies scanned for vulnerabilities

### Correctness & Logic
- [ ] Happy path works correctly
- [ ] Edge cases handled (null, empty, overflow, negative)
- [ ] Error handling present and informative
- [ ] State management correct (no race conditions, stale state)
- [ ] Performance acceptable (no N+1 queries, unbounded loops)

### Testability & Testing
- [ ] Code is testable (low coupling, dependencies injectable)
- [ ] Unit tests cover main logic and edge cases
- [ ] Test names are descriptive ("testValidateEmail" > "test1")
- [ ] Integration tests cover happy path
- [ ] Mocking/stubbing appropriate (not testing dependencies)

### Code Style & Maintainability
- [ ] Names are clear and descriptive
- [ ] Functions focused (single responsibility principle)
- [ ] No excessive nesting (max 3-4 levels)
- [ ] Comments explain "why", not "what" (code is the what)
- [ ] Follows team conventions (see STYLE_GUIDE.md)

### Documentation
- [ ] Function signatures have JSDoc/docstrings
- [ ] Complex logic explained in comments
- [ ] Breaking changes documented
- [ ] README updated (if public API)

## Summary

At the end, summarize:
- [ ] **Blockers:** Must fix before merge (Critical findings)
- [ ] **Warnings:** Should address before merge (Medium findings)
- [ ] **Nice-to-haves:** Consider for next iteration (Low findings)
- [ ] **Approval:** Ready to merge? (Yes/No)

## Example Review Output

```
### Design & Architecture
[PASS] Architecture - Code follows team patterns for service layer

### Security & Data Protection
[FAIL] Secrets - Hardcoded API key found in src/api.js:15
  → Move to environment variable (process.env.API_KEY)

### Correctness & Logic
[WARN] Edge cases - What happens if response.data is undefined?
  → Add null check: if (!response?.data) { ... }

### Testability & Testing
[PASS] Tests - Comprehensive test coverage, good mocking

...

## Summary
- Blockers: 1 (hardcoded secret)
- Warnings: 2 (missing edge case checks)
- Nice-to-haves: 0
- Approval: NO - Fix secret exposure before merge
```

Step 2: Create Code-Reviewer Subagent

Create .claude/agents/code-reviewer.md:

---
name: code-reviewer
description: Thorough code reviewer focused on design, correctness, and security
model: claude-opus-4-1
instructions: |
  You are a senior code reviewer. Your job is to evaluate code for design,
  correctness, security, and maintainability. You review code *others* have written,
  not code you wrote (no bias).

  When reviewing:
  1. Read the code with fresh eyes. Don't assume intent.
  2. Ask: "Would I write this the same way? If not, why?"
  3. Check edge cases: What breaks this? Null inputs? Huge scale?
  4. Verify security: Any secrets? Input validation? Data exposure?
  5. Judge design: Is this abstraction justified? Testable? Team-aligned?

  Use the `/review` checklist as your framework. Be specific:
  - Don't say "complexity is high" — say "three nested loops with unclear purpose"
  - Don't say "missing tests" — say "error case where response is null is untested"
  - Don't say "bad naming" — say "variable 'x' should be 'maxRetries'"

  Be constructive. Every finding should suggest a fix.

  Return findings in the structured format from `/review`.

tools:
  - read_file
  - grep_codebase
---

Part 3 — Implement the Writer/Reviewer Pattern

Create Feature-with-Review Skill

Create .claude/skills/feature-with-review.md:

---
name: Feature with Review
description: Complete feature workflow - write, review, iterate
disable-model-invocation: false
allowed-tools: []
---

# Feature Development with Code Review

This skill orchestrates the Writer/Reviewer pattern.

## Steps

### 1. Clarify Requirements
- What are we building?
- Success criteria?
- Constraints (performance, security, dependencies)?

### 2. Write Code (Writer Mode)
Ask Claude:
> "Implement [feature] following [style guide]. Create [files] with tests."

Claude generates feature code.

### 3. Review Code (Reviewer Mode)
Ask @code-reviewer:
> "@code-reviewer review src/[feature].ts using the /review checklist"

Reviewer subagent evaluates against checklist.

### 4. Address Findings
If blockers:
- Fix hardcoded secrets
- Add missing error handling
- Implement security checks

If warnings:
- Consider refactoring
- Add edge case tests
- Document complex logic

If nice-to-haves:
- Can defer to next iteration

### 5. Re-review (If Major Changes)
If you made significant changes:
> "@code-reviewer re-review src/[feature].ts"

Continue until: Blockers = 0, Warnings acceptable, Approval = YES

### 6. Merge
Once approved, commit and merge to main.

## Example Session

```
User: "I need to add a discount calculation feature. Let's follow the team style guide and ensure it's well-tested."

Claude (Writer): Creates src/discount.ts, src/discount.test.ts

User: "@code-reviewer review src/discount.ts using /review checklist"

Reviewer: Returns structured findings:
  - [FAIL] Missing edge case: discount > 100% should be capped
  - [WARN] Test for negative price missing
  - [PASS] Architecture follows team patterns

User: Fixes the two issues (adds edge case handling, adds test)

User: "@code-reviewer re-review src/discount.ts"

Reviewer: [PASS] All issues resolved. Ready to merge.

User: Commits and merges.
```

## Key Principle

**The writer and reviewer are different actors.** This prevents bias and ensures fresh perspective.

Part 4 — Hands-on: Review Real Code, Fix, Re-review

Step 1: Review Poorly-Written Code

Create src/payment-processor.ts with the following intentional issues:

// Intentional issues for review practice
export class PaymentProcessor {
  private api = 'https://api.stripe.com';
  private key = 'sk_test_1234567890'; // HARDCODED SECRET

  async processPayment(amount, cardToken, userId) {
    // Missing input validation
    if (amount > 999999) {
      throw new Error('Amount too large');
      // What about amount < 0?
      // What about amount = 0?
    }

    // Over-coupling: directly calls external API
    const response = await fetch(this.api + '/charges', {
      method: 'POST',
      body: JSON.stringify({
        amount,
        source: cardToken,
        customer: userId,
      }),
      headers: {
        Authorization: `Bearer ${this.key}`,
      },
    });

    // Insufficient error handling
    const data = await response.json();

    // Logs sensitive data
    console.log(`Payment processed: ${JSON.stringify(data)}`);

    return data;
  }

  // Missing tests
  // Missing documentation
}

Run the review:

@code-reviewer review src/payment-processor.ts using /review checklist

Expected findings:

### Design & Architecture
[WARN] Coupling - Direct Stripe API calls; consider dependency injection
  → Create StripeClient interface, inject it; makes testing easier

### Security & Data Protection
[FAIL] Secrets - Hardcoded API key in src/payment-processor.ts:3
  → Move to environment variable (process.env.STRIPE_SECRET_KEY)
[FAIL] Data Protection - Sensitive payment data logged
  → Remove console.log or log only transaction ID, not full response

### Correctness & Logic
[FAIL] Edge cases - What if amount is 0 or negative?
  → Add validation: if (amount <= 0) throw new Error('...')
[WARN] Error handling - response.json() could fail; not caught
  → Wrap in try-catch: try { const data = await response.json(); } catch (e) { ... }

### Testability & Testing
[FAIL] Tests - No tests present; this is a critical payment feature
  → Add unit tests covering: success path, error cases, edge cases

### Code Style & Maintainability
[WARN] Documentation - No function doc; parameters lack types
  → Add JSDoc: @param {number} amount, @param {string} cardToken, etc.

## Summary
- Blockers: 2 (hardcoded secret, no edge case validation)
- Warnings: 3 (no tests, missing error handling, poor documentation)
- Nice-to-haves: 0
- Approval: NO - Fix blockers and consider warnings before merge

Step 2: Fix the Code

Address all blockers and warnings found in the review:

import { validatePaymentInput } from './validation';
import { StripeClient } from './stripe-client'; // External API injected

/**
 * Processes payments using Stripe.
 * @param {number} amount - Payment amount in cents (must be > 0)
 * @param {string} cardToken - Stripe card token
 * @param {string} userId - User ID for audit trail
 * @returns {Promise<PaymentResult>} Transaction result with ID
 */
export class PaymentProcessor {
  constructor(private stripe: StripeClient) {}

  async processPayment(
    amount: number,
    cardToken: string,
    userId: string,
  ): Promise<PaymentResult> {
    // Validate input
    validatePaymentInput({ amount, cardToken, userId });

    // Edge case: amount must be positive
    if (amount <= 0) {
      throw new Error('Amount must be positive');
    }
    if (amount > 999999) {
      throw new Error('Amount exceeds maximum');
    }

    try {
      // Use injected client, not hardcoded credentials
      const result = await this.stripe.createCharge({
        amount,
        source: cardToken,
        customer: userId,
      });

      // Log safely (no sensitive data)
      console.log(`Payment processed: transaction_id=${result.id}, user_id=${userId}`);

      return {
        success: true,
        transactionId: result.id,
      };
    } catch (error) {
      // Informative error, doesn't leak API details
      console.error('Payment processing failed:', error.message);
      throw new Error('Payment failed. Please try again.');
    }
  }
}

// Test coverage
describe('PaymentProcessor', () => {
  let processor: PaymentProcessor;
  let stripeMock: jest.Mocked<StripeClient>;

  beforeEach(() => {
    stripeMock = createMockStripeClient();
    processor = new PaymentProcessor(stripeMock);
  });

  describe('Input validation', () => {
    it('rejects zero amount', async () => {
      await expect(processor.processPayment(0, 'tok_123', 'user_1'))
        .rejects
        .toThrow('Amount must be positive');
    });

    it('rejects negative amount', async () => {
      await expect(processor.processPayment(-100, 'tok_123', 'user_1'))
        .rejects
        .toThrow('Amount must be positive');
    });

    it('rejects amount exceeding limit', async () => {
      await expect(processor.processPayment(1000000, 'tok_123', 'user_1'))
        .rejects
        .toThrow('Amount exceeds maximum');
    });
  });

  describe('Happy path', () => {
    it('processes valid payment and returns transaction ID', async () => {
      stripeMock.createCharge.mockResolvedValueOnce({ id: 'ch_123' });

      const result = await processor.processPayment(1000, 'tok_123', 'user_1');

      expect(result.success).toBe(true);
      expect(result.transactionId).toBe('ch_123');
    });
  });

  describe('Error handling', () => {
    it('handles Stripe API errors gracefully', async () => {
      stripeMock.createCharge.mockRejectedValueOnce(
        new Error('Card declined'),
      );

      await expect(processor.processPayment(1000, 'tok_123', 'user_1'))
        .rejects
        .toThrow('Payment failed');
    });
  });
});

Step 3: Re-review

@code-reviewer re-review src/payment-processor.ts

Expected output:

### Design & Architecture
[PASS] Architecture - Dependency injection makes this testable
[PASS] Coupling - Stripe client is abstracted behind interface

### Security & Data Protection
[PASS] Secrets - API key from environment variable
[PASS] Data Protection - Sensitive data not logged

### Correctness & Logic
[PASS] Edge cases - All validated (zero, negative, max amount)
[PASS] Error handling - Try-catch wraps API call; safe error message

### Testability & Testing
[PASS] Tests - Comprehensive coverage of happy path, edge cases, errors
[PASS] Mocking - Stripe client properly mocked

### Code Style & Maintainability
[PASS] Documentation - JSDoc covers parameters and return
[PASS] Functions focused - Single responsibility

## Summary
- Blockers: 0
- Warnings: 0
- Nice-to-haves: 0
- Approval: YES - Ready to merge

Reflection

Take a moment to consider these questions before moving on — they’ll reinforce what you just experienced:

What did the reviewer catch that you might have missed as the writer? Look for: design coupling, edge cases, error handling assumptions.
Why was the second review faster than the first? The fixes were surgical; no design changes; the reviewer confirmed alignment rather than finding new issues.
How does this process scale to your team? Code review becomes a checklist. Blockers must be fixed. Fresh perspective is built in.
What’s one thing you’d add to the /review checklist for your team’s specific codebase?

Troubleshooting

Reviewer subagent is too critical or too lenient:

Adjust instructions in .claude/agents/code-reviewer.md
Add examples of what “good design” looks like for your team
Reference your ARCHITECTURE.md file

Review takes too long:

Use linters for style (ESLint, Prettier) before running the review
Focus review on design, correctness, security — not formatting
Create a /review mini version for a quick pass and the full version for a deep dive

Review findings are being ignored:

Make review a blocker for PR merge (CI/CD integration)
Surface findings in PR comments with context for why each issue matters
Connect findings to past production incidents where applicable

Reviewer misses obvious issues:

Re-check: is the issue truly obvious, or domain-specific knowledge the reviewer lacks?
Add to reviewer subagent instructions: “Check for [specific issue type]”
Pair with SAST tools to catch patterns the reviewer misses

Completion Checklist

Before moving on, verify you have:

.claude/skills/review.md committed to your repo
.claude/agents/code-reviewer.md committed to your repo
.claude/skills/feature-with-review.md committed to your repo
Tested the workflow on at least one feature
Verified fixes pass the code-reviewer subagent
Can explain the difference between automated style checking and human design judgment
Plan to use the Writer/Reviewer pattern for your next 2-3 features

References

Google AutoCommenter Research: https://arxiv.org/abs/2210.02968 (“AutoCommenter: A Large Language Model for Programming Comments”)
Google Code Review Best Practices: https://google.github.io/eng-practices/review/
Trunk Engineering Playbook: https://www.trunkbaseddevelopment.com/code-review/
Code Review Culture: https://engineering.squarespace.com/blog/2020/code-review-best-practices
Security Review Patterns: https://owasp.org/www-project-secure-coding-practices/

Workshop complete. Verify your checklist: blockers = 0, warnings resolved, code approved. Ready to ship.