claude-review-pdca

A closed-loop PDCA (Plan-Do-Check-Act) system that automatically surfaces past code review findings during implementation, enabling Claude Code to learn from its own mistakes.

Best for teams already using Claude Code review skills who want past findings to come back automatically at edit time instead of being forgotten after one review pass.

Status

This repo is now usable in both of these modes:

Claude Code hook mode: findings are injected automatically by hooks
Codex / manual mode: the same context injection can be triggered explicitly with a command

That means the core workflow is no longer "hidden in local glue only" -- the implementation-session bridge now lives in this repo.

At a Glance

Store review findings in SQLite
Re-inject only relevant findings when editing the same file later
Log learned implementation patterns from review-fix sessions
Support both hook-based and command-based runtimes
Keep false-positive learning human-gated

The Problem

Claude Code's review skills (/ifr, /review-fix-loop, etc.) detect bugs, design issues, and anti-patterns -- but findings are stored once and forgotten. The next coding session starts from zero, with no memory of past mistakes. A bulk "you have N pending findings" notification at session start is noise; developers ignore it.

The Solution

Inject the right findings at the right time: when a file is being edited, only past findings for that specific file are surfaced -- keeping signal-to-noise ratio high.

Developer edits a file
     |
     v
PreToolUse hook fires
     |
     v
SQLite query: file_path filter + severity + freshness + repo scope
     |
     v
Relevant findings injected into context (max 8)
     |
     v
Claude implements with awareness of past mistakes
     |
     v
Every 5 edits: batch review trigger
     |
     v
SessionEnd: learned false-positive patterns written to CLAUDE.md

Runtime Modes

This repo now supports two execution styles:

Claude Code hook mode
- hooks/pre-tool-inject-findings.py
- hooks/post-tool-edit-counter.py
- hooks/session-end-learn.py
- hooks/implementation-session-detector.js
- hooks/review-feedback-session-check.js
Codex / manual command mode
- scripts/prepare-implementation-context.py
- scripts/record-rfl-patterns.py

The key idea is: Claude can use hooks, while Codex can call the equivalent command explicitly. That means pinned-repo users can understand the whole workflow from this repo alone, without relying on hidden local-only glue for implementation-session activation.

Quick Start

1) Claude Code hook mode

Register the hooks described in docs/hooks.md.

Main entrypoints:

hooks/pre-tool-inject-findings.py
hooks/post-tool-edit-counter.py
hooks/session-end-learn.py
hooks/implementation-session-detector.js
hooks/review-feedback-session-check.js

2) Codex / manual mode

Before the first implementation edit for a target file:

python scripts/prepare-implementation-context.py \
  --session-id codex-sess-1 \
  --cwd C:/path/to/repo \
  --prompt "sc-rfl この file を修正" \
  --file-path src/app/main.py

This prints the same context block that the Claude hook path would inject.

Codex / manual command example

Before implementation:

python scripts/prepare-implementation-context.py \
  --session-id codex-sess-1 \
  --cwd C:/path/to/repo \
  --prompt "sc-rfl この file を修正" \
  --file-path src/app/main.py

This command:

detects implementation markers such as sc-rfl / /rfl
writes implementation-session.json
invokes the same PreToolUse injection path that Claude hook mode uses
prints the context block to reuse in the current agent turn

What Lives in This Repo

Included here:

hook-side context injection logic
implementation-session detection
learned-pattern logging / summarization
Codex/manual bridge command
tests for hook-equivalent behavior

Still external today:

the main review-feedback.py CLI / DB producer is still expected at the path pointed to by REVIEW_FEEDBACK_SCRIPT
default path: ~/.claude/scripts/review-feedback.py

So this repo now contains the runtime bridge and reinjection logic, but not yet a fully vendored producer stack.

Architecture

                    +------------------+
                    |  review-feedback |
                    |      .db         |  <-- SQLite: findings, sessions, dismissals
                    +--------+---------+
                             |
              +--------------+--------------+
              |              |              |
     +--------v---+  +------v------+  +----v---------+
     | PreToolUse |  | PostToolUse |  |  SessionEnd  |
     |   Hook     |  |    Hook     |  |    Hook      |
     +--------+---+  +------+------+  +----+---------+
              |              |              |
   File-specific      Edit counter    FP pattern
   finding injection  + file tracking  learning
   (Phase A/B)        (batch trigger)  + stale GC
              |              |              |
              v              v              v
     Claude context   batch-review-    CLAUDE.md
     (SNR-optimized)  trigger.py       auto-update

Hook Pipeline

Hook	Trigger	Purpose
PreToolUse	Before Edit/Write	Inject past findings for the target file (Phase A: file-specific, Phase B: project-wide fallback)
PostToolUse	After Edit/Write	Count edits, track modified files, update finding freshness
SessionEnd	Session close	Learn confirmed false-positive patterns, clean up state files, GC stale findings

Key Design Decisions

1. Pinpoint Injection over Bulk Notification

Instead of dumping all pending findings at session start (context pollution), findings are injected only when the relevant file is being edited. This keeps the signal-to-noise ratio high -- Claude sees 3-8 relevant findings, not 50 unrelated ones.

2. Two-Phase Query Strategy

Phase A: File-specific findings filtered by severity, freshness (30-day window OR recent last_relevant_edit), and repo scope
Phase B: If Phase A returns nothing, fall back to project-wide critical-only findings (max 5)

This ensures high-severity issues are never missed, even for files without prior findings.

3. Human-Only Dismissal (Anti-Sycophancy Guard)

Claude cannot dismiss its own findings. The dismissed_by column enforces 'user' -- preventing the "sycophancy trap" where an AI marks its own flagged issues as false positives to appear helpful. Only after a human confirms a pattern 2+ times does it get promoted to CLAUDE.md.

4. Batch Review Trigger

Running /ifr after every single edit would explode costs. Instead, edits are counted silently, and a structured review is triggered every 5 edits with a prioritized report of which files to review.

5. Resolution Lifecycle

Findings follow a state machine: pending -> accepted | rejected_intentional | rejected_wrong | fixed | stale. The stale transition (90 days) is automatic via SessionEnd GC, preventing unbounded growth.

6. Repository Scoping

repo_root column prevents cross-project contamination -- findings from project A never leak into project B's injection context. Path normalization handles Windows backslashes, UNC paths (//server/share), and case differences.

Project Structure

claude-review-pdca/
  CODEX.md                          # Codex-side activation rules
  hooks/
    config.py                       # Shared config (DB path, normalize_git_root)
    implementation-session-detector.js
    pre-tool-inject-findings.py     # PreToolUse: file-specific finding injection
    post-tool-edit-counter.py       # PostToolUse: edit counting + file tracking
    review-feedback-session-check.js
    session-end-learn.py            # SessionEnd: FP learning + cleanup + stale GC
  scripts/
    batch-review-trigger.py         # 5-edit batch review coordinator
    backfill-repo-root.py           # Migration: backfill repo_root for legacy data
    prepare-implementation-context.py
    record-review-outcome.py        # common review outcome -> feedback/pattern producer
    record-rfl-patterns.py          # findings -> review-patterns.db bridge
  tests/
    conftest.py                     # Pytest fixtures (in-memory SQLite)
    test_config.py                  # Config module tests
    test_pre_tool_inject_findings.py
    test_pre_tool_inject_main.py    # Phase A/B injection logic
    test_post_tool_edit_counter.py
    test_batch_review_trigger.py
    test_record_review_outcome.py
    test_session_end_learn.py
  docs/
    auto-pdca-producer-design.md    # sc-rfl / sc-ifr / sc-ir 共通 producer 設計
    design.md                       # System architecture deep-dive
    db-schema.md                    # Full schema + standard queries
    hooks.md                        # Hook implementation specs
    dismissal-policy.md             # Safety rules for FP dismissal
    references.md                   # Karpathy, Boris Cherny, etc.
  CLAUDE.md                         # Project-specific Claude Code instructions

Codex Activation Rule

If an implementation task prompt includes markers such as:

sc-rfl
sc-review-fix-loop
sc-ui
sc-frontend-implementation
sc-tdd
sc-e2e
sc-bt
sc-at
/review-fix-loop
/rfl

then the Codex-side equivalent of hook activation is:

identify the edit target file(s)
run scripts/prepare-implementation-context.py
use the returned injected context while editing

See CODEX.md for the exact rule text.

Database Schema (Key Columns)

CREATE TABLE findings (
    id              INTEGER PRIMARY KEY,
    session_id      TEXT,
    repo_root       TEXT,           -- repository scope isolation
    reviewer        TEXT,
    severity        TEXT,           -- critical / high / warning / info
    category        TEXT,
    file_path       TEXT,
    finding_summary TEXT,
    resolution      TEXT DEFAULT 'pending',
        -- CHECK: pending / accepted / rejected_intentional
        --        / rejected_wrong / fixed / stale
    dismissed       INTEGER DEFAULT 0,
    dismissed_by    TEXT,           -- 'user' only (anti-sycophancy)
    fp_reason       TEXT,
    last_relevant_edit TEXT,        -- freshness signal for dormant findings
    created_at      TEXT,
    ...
);

Indexes: idx_file_path, idx_pending, idx_repo_file(repo_root, file_path)

Inspiration

Andrej Karpathy -- "Vibe coding" and the feedback loop between AI coding assistants and code quality
Boris Cherny -- Self-correcting repositories: every mistake becomes a rule that prevents the next one
PDCA Cycle -- Continuous improvement applied to AI-assisted development: Plan (review rules) -> Do (implement) -> Check (findings) -> Act (learn patterns)

Tech Stack

Python 3.12+ with type hints
SQLite for findings persistence (zero-dependency, single-file DB)
Claude Code Hooks API (PreToolUse / PostToolUse / SessionEnd)
pytest for testing with in-memory SQLite fixtures

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
.github/workflows		.github/workflows
docs		docs
hooks		hooks
scripts		scripts
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CODEX.md		CODEX.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

claude-review-pdca

Status

At a Glance

The Problem

The Solution

Runtime Modes

Quick Start

1) Claude Code hook mode

2) Codex / manual mode

Codex / manual command example

What Lives in This Repo

Architecture

Hook Pipeline

Key Design Decisions

1. Pinpoint Injection over Bulk Notification

2. Two-Phase Query Strategy

3. Human-Only Dismissal (Anti-Sycophancy Guard)

4. Batch Review Trigger

5. Resolution Lifecycle

6. Repository Scoping

Project Structure

Codex Activation Rule

Database Schema (Key Columns)

Inspiration

Tech Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

claude-review-pdca

Status

At a Glance

The Problem

The Solution

Runtime Modes

Quick Start

1) Claude Code hook mode

2) Codex / manual mode

Codex / manual command example

What Lives in This Repo

Architecture

Hook Pipeline

Key Design Decisions

1. Pinpoint Injection over Bulk Notification

2. Two-Phase Query Strategy

3. Human-Only Dismissal (Anti-Sycophancy Guard)

4. Batch Review Trigger

5. Resolution Lifecycle

6. Repository Scoping

Project Structure

Codex Activation Rule

Database Schema (Key Columns)

Inspiration

Tech Stack

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages