Discipline

Trading Kill List: How to Build an Evidence-Based Performance Fix List (With AI)

Eight common leaks with detection methods, Kill List scoring matrix, monthly forensic workflow, 7-day action plan, AI Council multi-agent review, and printable starter kit checklist.

trading kill listAI trading reviewevidence based tradingmulti agent AI tradingtrading performance leaksforensic trade reviewhow to build a trading kill listAI council tradingkill list trading journalAI trading journalautomated trade reviewtrading Kill Listhow to review trading journalcompliance drift tradingspreadsheet trading journalmonthly trade reviewR-multiple reviewtrading journal improvement

You closed the week green. You "reviewed" on Saturday morning and felt productive. Three months later, the same mistake is still showing up on your biggest losers, and you cannot explain why your win rate improved while your account did not.

That gap is rarely about effort. It is about review design. Most traders treat their journal like a diary: what happened, how it felt, what P&L said. Real improvement needs a prioritized list of fixes ranked by evidence, not by which loss hurt the most.

Related guides: how to keep a journal that improves edge , why most journals lie to you , R-multiple discipline , and journal vs spreadsheet .

Traders call that a Kill List. This guide shows how to build one with structured data, an 8-leak detection framework, Kill List scoring, a monthly forensic workflow, a 7-day action plan, and when multi-agent AI (AI Council) accelerates the same process.

Key takeaways: (1) Manual reviews fail at scale because of recency bias, tag inconsistency, and no impact ranking. (2) Eight common leaks cover most performance drag, each has a detection method. (3) Rank fixes by evidence × impact, not by emotional sting. (4) A 45-minute monthly forensic workflow beats daily Kill List edits. (5) AI Council helps when cross-pattern questions exceed what you can run manually in one session.

Written by The Final Tape team, built for traders who measure discipline in data, not stories.

Proven framework: These eight leaks appear consistently across traders who scale beyond 100–200 structured trades, compliance drift after wins, tag clusters on losers, regime mismatch, and size deviations that P&L alone never surfaces.

Terms in this guide: Kill List = ranked queue of leaks; fix Rank 1 before Rank 2. Compliance drift = checklist % sliding lower over time or after wins. Evidence score = High / Medium / Low from sample size and consistency. Impact score = tag frequency × average R lost. Rank 1 = highest evidence + highest impact, written as one testable rule.

Trader reviewing structured trade notes for Kill List ranking
A Kill List turns review from storytelling into measurement.

Why Manual Trade Reviews Stop Working at Scale

Review from memory or a P&L column alone, and predictable biases take over. These are not character flaws. They are limits of human cognition once you pass ~100 structured trades.

BiasWhat happensResult
Recency biasLast big win or loss dominates the storyQuiet patterns across 40+ trades disappear
Survivorship biasWinners analyzed more than losersYou avoid the trades that need the most work
Tag inconsistencyNotes like "exited early" cannot be countedPatterns stay hidden
No impact rankingEvery issue feels equally urgentEffort spreads too thin
Time sinkTwo hours still misses systemic leaksLow return on review time
Non-repeatabilityDifferent questions every monthYou cannot measure improvement

Real example: compliance drift

A futures trader with 180 trades believed discipline was solid. Win rate and monthly P&L looked fine. The picture changed when they bucketed trades into groups of 20 and tracked compliance over time.

TradesAvg complianceKey finding
1–6094%Strong discipline on breakout setup
61–12083%Drop after four-week winning streak
121–18071%Expectancy turned negative on main setup
11 of last 25 losers missed "regime check" (breakouts in chop). Rank 1 rule: no breakout entry without confirmed trend regime.

Quick test: Split last 40 trades into two groups of 20. Compare average compliance. If the second group is 10+ points lower, you likely have drift.

Full forensic schema: why journals lie . Spreadsheet columns: journal vs Excel guide .

The 8 Most Common Trading Leaks (And How to Detect Them)

You cannot rank leaks you never measured. Eight fields are enough to start. These eight leak categories cover most performance drag we see across traders at scale.

FieldWhat to logWhy it matters
Setup nameWhich playbook this trade belongs toSetup-level analysis
Compliance %Checklist rules followed at entryFilters high-quality data
Planned risk ($)Dollar risk if stopped (1R)Accurate R math
Outcome (R)Net P&L ÷ planned riskTrue edge measurement
Exit tagFear exit, target hit, trail, etc.Behavioral patterns
RegimeTrend, chop, news, etc.Regime mismatch
Hold timeMinutes or hours in tradeWhen edge decays
Size vs planMore or less risk than intendedSizing drift
Do not wait for perfect data. Log these eight fields on every trade before building a Kill List.

Leak 1: Compliance drift

Rule-following drops after winning streaks or when confidence rises. Detection: chart compliance % by 20-trade bucket; note downward slope after green weeks.

Leak 2: Tag patterns on losers

Same exit or entry label clusters on losses. Detection: COUNTIF or pivot top negative tags on losers only; flag any tag on 30%+ of recent losers.

Leak 3: Exit quality

Early exits compress average winner R. Detection: compare avg R on "fear exit" vs "target hit" tags; check MFE/MAE ratio if logged.

Leak 4: Regime mismatch

Valid setup in wrong market conditions. Detection: filter by regime tag; compare expectancy in trend vs chop for each setup.

Leak 5: Streak behavior

Size or rules change after win/loss runs. Detection: compare compliance % and size vs plan in trades after 3+ consecutive wins or losses.

Leak 6: Holding time

Edge lives in a window you are not respecting. Detection: bucket hold time; chart avg R by bucket per setup.

Leak 7: R distribution gaps

One setup pays, another bleeds at similar win rate. Detection: expectancy in R per setup on ≥80% compliance trades only.

Leak 8: Size deviations

Actual risk exceeds planned 1R regularly. Detection: flag rows where planned risk > 1.05× target; count frequency per month.

LeakDetection methodRank signal
Compliance driftCompliance % by 20-trade bucket10+ point drop after wins
Tag patternsPivot exit_tag on losersTag on 30%+ of losers
Exit qualityAvg R by exit tagFear exits avg −0.3R+ worse than target
Regime mismatchExpectancy by regime per setupNegative R in one regime only
Streak behaviorPost-streak compliance + sizeRules slip after 3+ win run
Holding timeAvg R by hold-time bucketEdge outside your window
R distributionSetup expectancy on clean tradesWin rate OK, R negative
Size deviationsplanned_risk > 1.05× target3+ oversize rows per month

How to Build and Prioritize Your Kill List

Under an hour in Google Sheets once the trades tab is structured. Three tabs: trades (source data), pivots (candidate leaks), kill_list (ranked fixes).

TabColumns / contentPurpose
tradessetup, compliance_%, planned_risk_$, outcome_r, exit_tag, regimeSource data
pivotsCOUNTIF / pivot on losers by exit_tag; compliance by 20-trade bucketCandidate leaks
kill_listrank, rule, evidence, impact, metric, status, target_dateRanked fixes
One workbook. No parallel master_v3_FINAL copies.

Eight-step workflow:

  1. Step 1

    Export last 60–100 trades with all eight fields

  2. Step 2

    Pivot or COUNTIF top negative tags on losers only

  3. Step 3

    Chart compliance % by 20-trade bucket; note downward slope

  4. Step 4

    Per candidate: sample size, avg R impact, 2–3 trade IDs

  5. Step 5

    Score evidence (see rubric below)

  6. Step 6

    Score impact (see rubric below)

  7. Step 7

    Rank 1 = highest combined score; write one behavioral rule

  8. Step 8

    Re-measure same metric after 20 new trades before Rank 2

ScoreEvidence (step 5)Impact (step 6)
High20+ trades, same pattern in tags + compliance viewTag on 40%+ of losers or avg −0.5R+ per hit
Medium10–19 trades or one view onlyTag on 20–39% of losers or avg −0.25R to −0.5R
LowUnder 10 trades or inconsistentRare tag or avg under −0.25R
Rank 1 needs High evidence or High impact with Medium on the other axis. Do not rank on Low/Low.
Impact × Frequency matrixHigh frequency (30%+ losers)Medium (15–29%)Low (<15%)
High avg R loss (−0.5R+)Rank 1 candidateRank 1–2 candidateMonitor
Medium (−0.25R to −0.5R)Rank 1–2 candidateRank 2–3Backlog
Low (<−0.25R)Rank 2–3BacklogIgnore until sample grows
Use this matrix after scoring evidence. High/High = immediate Rank 1.
TaskSheets example
Count fear exits on losers=COUNTIFS(exit_tag,"fear exit",outcome_r,"<0")
Avg R on fear-exit losers=AVERAGEIF(exit_tag,"fear exit",outcome_r)
Compliance bucket avgAVERAGE of compliance_% for trades 61–80

Example Rank 1: "Missed regime check on 11/25 recent losers (−0.4R avg). Rule: no breakout unless 15m trend filter confirms. Measure: regime-check compliance % on next 20 trades."

Spreadsheet pivots for Kill List evidence and impact scoring
Rank by evidence and impact, not by how recently you lost money.

Monthly Forensic Review Workflow

Run once per month. Daily Kill List edits create noise. Weekly execution loop handles trade logging and one active fix.

WeekFocusOutput
Week 1Log every trade with full fieldsClean data
Week 2Compliance trend + tag pivotBaseline numbers
Week 3Top leaks; choose Rank 1One fix with evidence
Week 4Execute Rank 1 only; track behaviorMeasurable change
Month endRe-run analysis vs baselineClose item or keep working
45 minutes at month end. Same questions every month so improvement is measurable.

Weekly execution loop: 45-minute journal review . R discipline: R-multiple guide .

When Multi-Agent AI (AI Council) Makes the Biggest Difference

Past ~150–200 structured trades, manual pivots consume weekends and still miss cross-pattern links, streak behavior + exit quality + regime on the same subset. Multi-agent AI runs the same forensic questions in parallel: eight specialist reviews synthesized into one ranked Kill List.

AI Council is not a replacement for structured logging. It accelerates detection when human review time becomes the bottleneck.

Agent lensWhat it surfacesExample output
Performance AnalystExpectancy decay, R distribution by setup"ORB setup: +0.42R at 94% compliance, −0.18R below 80%"
Behavioral PsychologistSizing and comment patterns after streaks"Position size 1.3× planned after 3 wins — 8 of 12 post-streak losers"
Execution TacticianMFE/MAE abuse, premature exits"Avg winner captures 41% of MFE; fear exits leave 2.1R on table"
Risk AssassinDrawdown DNA, ruin-adjacent sizing"4 trades at 1.4R planned risk in 10 days — 68% of monthly drawdown"
Setup SurgeonPer-setup regime dependency"Pullback long: +0.6R in trend, −0.4R in chop — 71% of chop trades non-compliant"
Regime CartographerSession and volatility clusters"Tuesday NY open: 62% win rate but −0.1R expectancy — size drift"
Entry & Exit JudgeStop placement, chasing, target discipline"11/18 losers tagged fear exit within 5 min of entry on breakout setup"
Chief Coaching OfficerSynthesized Kill List ranked by $ impactRank 1: "No breakout without regime check — est. −$2,400/mo at current frequency"

The difference from a single chat prompt: each agent runs domain-specific analysis on your full trade tape, then the Chief Coaching Officer debates disagreements and outputs one prioritized Kill List, not a generic pep talk.

Ready to run eight specialist reviews on your trade history? Explore the AI Council workflow or start free with The Final Tape . Academy walkthrough: Kill List episode .

7-Day Action Plan to Create Your First Kill List

Day 1: Audit your columns

Compare your current log to the eight required fields. Add missing columns before exporting. Success: every field has a defined input rule.

Day 2: Export and fix R

Export last 50 trades. Recalculate R on 5 rows using planned risk at entry, not a global cell. Success: R math verified on sample.

Day 3: Run one leak detection

Pick compliance drift or tag patterns. Run the detection method from the leak table. Success: one candidate leak with sample size and avg R.

Day 4: Draft Rank 1 with scores

Score evidence and impact using the rubric. Plot on the Impact × Frequency matrix. Success: Rank 1 candidate with High/Medium on at least one axis.

Day 5: Write the behavioral rule

One testable rule for 20 trades. Include the metric you will re-measure. Success: rule is specific enough to pass/fail on the next trade.

Day 6: Block monthly review

Schedule 45 minutes on the first weekend of next month. Same weekday for weekly loop. Success: calendar blocked before you forget.

Day 7: Set re-measurement reminder

Reminder after 20 new trades on Rank 1. Do not open Rank 2 until Rank 1 metric moves or you have 20 fresh data points. Success: reminder set with the metric name.

Common Mistakes That Waste the Kill List Process

Fixing multiple items at once

Cannot tell what worked

Ignoring low-compliance trades when analyzing edge

Editing the Kill List daily

Noise instead of signal

Rejecting Rank 1 because it stings emotionally

Adding new setups before Rank 1 closes

Declaring victory without re-measuring the metric

Ranking from P&L color instead of tag frequency × R impact

Skipping the 20-trade re-measurement window

How The Final Tape's AI Council Automates the Kill List

Structured fields at submit

compliance %, planned risk, tags, regime, no post-hoc storytelling

Eight-agent parallel review

Performance, Behavioral, Execution, Risk, Setup, Regime, Entry/Exit, Chief Coach

Kill List ranked by dollar impact

Rank 1 fix with evidence, sample trades, and re-measurement metric

Setup DNA pages

per-setup expectancy and regime breakdown without manual pivots

Monthly Deep Audit + daily light refresh

forensic pass without rebuilding spreadsheets

Trade Lab

click any trade for instant multi-agent micro-analysis

Ready to automate your Kill List with multi-agent AI? Start free with The Final Tape or explore the AI Council and AI trading journal workflows.

Frequently asked questions

What is a Kill List in trading?

A ranked queue of performance leaks. Each item has evidence (example trades), impact in R, and one fix you execute before moving on.

How many trades do I need?

Prototype with 40–50 trades. Stable Rank 1 usually needs 100+ with consistent tags and compliance logging.

Can I do this in Excel?

Yes. Three tabs: trades, pivots, kill_list. Pivot tags on losers, chart compliance buckets, score evidence and impact by hand.

Do I need AI?

No. AI helps when cross-pattern questions exceed what you can run in 45 minutes manually. The methodology comes first.

What counts as closing Rank 1?

The targeted metric improved on a fresh 20-trade sample: higher compliance %, lower tag frequency, better exit R. Not a feeling.

How do I build a trading kill list from scratch?

Log eight fields per trade, run detection on the eight leak categories, score evidence and impact, write one testable rule for Rank 1, and re-measure after 20 new trades. The 7-day action plan and starter kit checklist walk through each step.

What is AI Council for trading journals?

AI Council runs eight specialist agents on your trade tape, performance, behavioral, execution, risk, setup, regime, entry/exit, and synthesizes a ranked Kill List by dollar impact. It automates the monthly forensic review when manual pivots no longer scale.

Traders who improve consistently are not the ones who journal the most. They identify the highest-impact leak, fix it, measure whether the metric moved, then move to the next item.

Start with structured logging. Build your first Kill List this week. Close one item with data before adding complexity. That loop compounds faster than almost anything else in trading.

The Final Tape scores compliance at submit, runs multi-agent AI Council reviews, and ranks your Kill List by dollar impact, built for traders who outgrow manual pivots. Try it free . See pricing or explore the AI Council workflow.

Related: prop journal guide , weekly journal loop , Kill List starter kit , Academy M01–M03.

Stop reviewing from memory

Run compliance scoring, tag ranking, and Kill List rules on every trade — not once a month when the account feels off.