What's the difference between filtering and curve-fitting?

Filtering applies pre-declared categories (setup, session, day-of-week, instrument) defined in your trading plan to decompose aggregate performance. Curve-fitting applies post-hoc categories discovered by inspecting the data after the fact ('trades on Tuesday afternoons after I had coffee'). The first generalizes forward because the categories exist independent of outcomes; the second produces inflated hindsight numbers that don't reproduce because the criteria were defined to fit existing results. The defense is pre-declaration — write filter categories before running analysis, not after. Discovered filters frequently show flattering numbers in backtest and regress to baseline in forward testing.

How many layers of filters can I apply?

3 layers maximum to preserve statistical reliability. Layer 1: setup type. Layer 2: setup × session. Layer 3: setup × session × day-of-week. Beyond 3 layers, per-cell sample size drops below 10 trades and variance dominates signal — apparent PF improvements at layers 4-5 are typically statistical noise that won't reproduce forward. The discipline isn't about finding maximum specificity; it's about finding actionable patterns with adequate sample size. Less drilling preserves more reliability. Most retail traders find their actual edge at 2-3 layers; deeper analysis produces over-fit conclusions that destroy framework value.

What if no filter produces a good edge?

Three possibilities. (1) Sample size insufficient — if you have fewer than 100 total trades, filter analysis can't reliably detect edge. Wait for 200+ trades. (2) Strategy lacks underlying expectancy — if all setups are PF below 1.0, the filtering framework can't create edge that doesn't exist; you need to fix strategy first via edge measurement. (3) Filtering applied wrong — try the 8 preset filters before custom multi-layer combinations. The presets often surface the dominant pattern faster than custom drilling. If presets and 3-layer cascade both fail to find positive-PF subsets, the problem is strategy, not filtering.

Should I only trade my filtered edge and nothing else?

Mostly yes, but with calibration. If your filtered edge represents 30-50% of original trade volume but produces 80-120% of original P/L, restricting to just the filter is structurally optimal. If the filter restricts volume below viable trading frequency (under 5 trades per month), you may need to relax to a slightly looser filter that preserves enough volume. The goal isn't maximum PF — it's maximum total P/L, which balances edge quality against trade frequency. Most retail traders' filtered edges produce both higher PF and acceptable volume; the constraint is rarely binding.

How often does my filtered edge change?

Monthly stability is typical for traders on stable strategies; quarterly review catches regime shifts. Re-run filter analysis monthly to verify the edge persists. Most filters that hold for 3 consecutive months are reliable enough to commit to schedule changes; filters that flip between months are variance, not signal. Markets shift regimes (volatility cycles, central bank cycles, correlation regimes), and filter validity can change with regimes. The pre-declared filter discipline plus monthly validation produces stable enough edges to act on; without monthly re-validation, traders commit to filter findings that have already become stale.

What's the most common filtering mistake?

Drilling past 3 layers. Multi-dimensional filtering produces apparently dramatic results — PF 5.0 on the 4-layer filter looks much better than PF 2.0 on the 2-layer filter. But the 4-layer cell typically has 6-12 trades, where variance dominates signal. The PF 5.0 is statistical noise that won't reproduce. Counter-discipline: stop at 3 layers; the gain in apparent edge from layer 4 is overwhelmed by the loss in statistical reliability. Less is more in filtering depth. Most over-fit findings come from traders who couldn't resist drilling deeper for more flattering numbers.

How is this different from impact analysis?

Filter analysis identifies the subset where edge lives (positive direction); impact analysis quantifies what happens when you remove the destructive subset (negative direction). Both are subset analyses on the same data, but ask different questions. Filter analysis: 'where does my edge concentrate?' Impact analysis: 'how much would removing my worst trades improve my P/L?' Run filter analysis first to identify the productive subset; then run impact analysis to quantify the cost of the destructive subset. Together they produce the complete picture: keep these subsets, eliminate those subsets, expected P/L improvement is X.

How to Filter Down to Your Actual Trading Edge

Your aggregate trading stats lie about your real edge. A profit factor of 1.2 across 150 trades sounds mediocre — and the conclusion most traders draw is "my strategy is marginal, I need to find a better one." But the aggregate is an average of very different subsets. Hidden inside those 150 trades is typically one PF 2.0+ subset (your actual edge) and one PF 0.5 subset (trades that destroy the edge). The average hides both. Filtering reveals them — so you can grow one and kill the other. Most retail traders don't have a strategy problem; they have a filtering problem. The edge is already in their journal data; they just haven't isolated it.

This guide covers the 3-layer filtering cascade (Setup → Session → Day-of-Week) for systematic edge discovery, the sample-size discipline that prevents overfitting, the eight quick filter presets that produce immediate before/after comparisons, the five common filtering traps that destroy analytical value, and the practical decision framework for what to do once your filtered edge is identified — without restricting yourself into a strategy that produces 5 trades per month.

Filtering methodology adapts stratified sampling principles from statistical analysis to discretionary trading data. The cascade structure (broad-to-specific layers) reflects standard hierarchical analysis from data science. Sample-size requirements per cell (20+ trades for moderate confidence) reference standard statistical sample-size determination. Specific dollar figures and PF thresholds illustrate typical observations from aggregated journal data; individual trader patterns vary substantially based on strategy, instrument, and trading style.

The filtering principle in one sentence: Your overall PF is an average that hides specific subsets — some that produce real edge and others that destroy it. Filtering peels back the average to reveal both, allowing you to grow the productive subset and eliminate the destructive one without changing your underlying strategy.

Why Your Aggregate Stats Are Lying to You

Imagine a restaurant that averages 3.5-star rating. Seems mediocre. But drill down: the dinner menu is 4.8 stars; the lunch menu is 2.1 stars. The restaurant isn't mediocre — it's excellent at dinner and terrible at lunch. The 3.5 average hides a specific, fixable problem.

The Restaurant Pattern in Trading Data

Your trading stats work the same way. A PF of 1.2 across 150 trades might actually be:

BOS+FVG setup: PF 2.4 (42 trades)
Range breakout: PF 1.1 (38 trades)
Untagged trades: PF 0.5 (70 trades)

Your edge is the BOS+FVG. Everything else is dragging it down. But without filtering, all you see is the 1.2 average — which makes you think the whole strategy is marginal. The conclusion you'd draw from the aggregate (find better strategy) is opposite to what your data actually says (cut the destructive subset, keep the working one).

Why Most Traders Misread Their Own Data

Aggregate-only thinking is the default for two reasons. First, journals report aggregate metrics by default — single P/L line, single win rate, single profit factor. The decomposed view requires explicit filtering setup, which most traders never configure. Second, the aggregate is psychologically simpler — one number is easier to relate to than five subset numbers. The simplicity has a cost: the aggregate systematically misrepresents what the strategy actually does, and the misrepresentation drives wrong-direction interventions (changing strategy when subset filtering is what's needed).

The Filtering Cascade: From Broad to Specific

Don't jump to triple-filtered analysis. Work through the cascade one layer at a time:

Layer 1: Setup Type

Always the first filter. Break trades by setup tag:

Setup	Trades	Win Rate	PF	Avg P/L	Verdict
BOS + FVG	42	60%	2.4	+$38	Core edge
Liquidity sweep	25	52%	1.5	+$18	Secondary edge
Range breakout	38	45%	1.1	+$4	Marginal — review
Untagged / no setup	45	33%	0.5	−$32	Cut entirely

Decision: Keep BOS+FVG and liquidity sweep. Put range breakout on probation (tighten criteria). Eliminate untagged trades. Already your filtered universe is PF ~1.9 instead of 1.2 — you haven't changed strategy, just removed what doesn't work. See setup performance breakdown for the full per-setup decomposition framework.

Layer 2: Session

Take surviving setups and break by session:

Setup + Session	Trades	PF	Verdict
BOS+FVG, London AM	28	2.8	Peak edge
BOS+FVG, Overlap	10	1.6	Good (small sample)
BOS+FVG, NY PM	4	0.7	Cut (no edge here)
Liq sweep, London AM	15	1.8	Solid
Liq sweep, Overlap	8	1.0	Marginal (small sample)

Decision: BOS+FVG is a London AM setup. Don't take it in NY PM. Liquidity sweep works in London AM too. Your edge lives in one session.

Layer 3: Day of Week (Optional)

Only add this layer if you have 150+ total trades, ensuring 10+ trades per cell:

BOS+FVG, London AM	Monday	Tuesday	Wednesday	Thursday	Friday
Trades	5	7	8	5	3
Win Rate	60%	71%	63%	60%	33%
PF	1.8	3.2	2.5	2.0	0.5

Decision: Tuesday and Wednesday are the peak. Friday kills the setup (only 3 trades but PF 0.5 — consistent with the broader Friday pattern). Monday and Thursday are fine.

Your Filtered Edge

BOS+FVG setup, London AM session, Tuesday-Thursday. If this combination has 15-20+ trades with PF 2.5+, you've found your specific actionable edge. Trade more of this. Trade less of everything else. The full filtered subset typically represents 30-50% of original trade volume but produces 80-120% of original net P/L.

Building Your Filter Stack: Systematic Process

Use this framework to systematically find your edge:

The 5-Step Filter Process

Start broad: PF by setup type. Find the top 1-2 setups.
Add session: PF by setup × session. Find which sessions amplify the edge.
Check day: PF by setup × session × day. Find if any day kills the edge.
At each layer: Check sample size. Need 20+ trades per cell for reliability.
Stop when sample gets small. PF 4.0 on 6 trades is noise, not edge.

The Sample-Size Floor

Per-cell sample size determines whether the analysis is signal or noise. 20+ trades per cell produces moderate-confidence conclusions; 50+ produces high-confidence. Below 10 trades per cell, you're fitting noise — the apparent pattern won't reproduce when you commit to it forward. Stop adding filter layers when any cell drops below 10 trades; the gain in specificity is offset by the loss in reliability.

8 Quick Filter Presets

If you don't know where to start, try these preset filters on your data:

Preset Filter Library

#	Filter	What It Reveals
1	Long trades only	Do you have a directional bias? Many traders are better long than short.
2	First 3 trades of day only	Is overtrading killing your edge? Early trades are usually best.
3	Trades with tagged setup only	How much do unplanned trades cost you?
4	London session only	Is your edge session-specific?
5	Tuesday-Thursday only	Are Monday/Friday dragging you down?
6	Exclude trades within 10 min of a loss	Revenge trade removal — the instant P/L boost.
7	Risk ≤ 1% only	Does oversizing hurt your performance?
8	Top instrument only	Is one pair carrying everything while others drain?

How to Use the Preset Library

Run all 8 presets. Compare each to your baseline aggregate. The preset that shows the biggest improvement is your priority fix. Most retail traders find that 1-2 presets show dramatic improvements (Preset 3 "tagged setups only" or Preset 5 "Tuesday-Thursday only" are the most common winners) while others show small improvements. Implement the dramatic-improvement preset first; layer in additional filters only after the first fix is established.

The Hidden Deal-Breaker: The Overfitting Trap

The filtering framework's biggest failure mode is overfitting — finding flattering patterns by data-mining without statistical validity. Run enough filter combinations on the same dataset, and one or two will show beautiful PF 4.0 results purely by chance. Acting on those flattering results produces forward performance that reverts to baseline because the apparent edge was statistical noise, not structural signal.

Three Specific Overfitting Patterns

Multi-layer drilling without sample protection. Setup × session × day × instrument × direction = 5 layers. With 150 total trades, the average cell has 1-2 trades. Apparent PF in those cells is meaningless — variance dominates entirely. Stop drilling when any cell drops below 10 trades; better yet, stop at 3 layers regardless of cell size.
Multiple comparison cherry-picking. Run 8 preset filters, find that "Preset 6 (no revenge trades)" shows PF 2.8 vs aggregate 1.2. But you ran 8 filters; one or two were guaranteed to show flattering numbers by chance. The improvement may be real or may be statistical artifact. Counter-discipline: validate with 30-60 days of forward trading on the filter; persistent improvement is real, regression to baseline is the artifact signal.
Post-hoc filter discovery. "After looking at the data, I notice that trades I took on days when ES was up 0.5% had PF 3.2." This is data-fitting, not strategy discovery. The filter was discovered by inspecting outcomes; it doesn't generalize forward because the criterion was defined to fit existing data, not based on a hypothesis about market mechanics.

The Pre-Declared Filter Discipline

Three preconditions for legitimate filtering: (1) Pre-declare filter categories before running analysis — setups, sessions, days, instruments are pre-declared; "trades on Tuesday afternoons after I had coffee" is post-hoc. (2) Limit to 3 filter layers maximum to preserve cell sample size. (3) Validate with forward trading — if the filter holds for 30-60 days of subsequent trading, the edge is real; if results regress to baseline, the original analysis was data-mining. See impact analysis for the survivorship-bias trap that's the same pattern in different framing.

Practical read: Filtering is genuinely powerful when applied to pre-declared categories with adequate sample sizes; it's a noise-generation machine when applied to post-hoc data inspection across many combinations. The discipline isn't in the filter math — it's in protecting filter discovery from the cognitive bias toward flattering results. Most "filtered edge" findings that don't reproduce forward came from overfitting, not from market regime change.

Filtering infrastructure determines whether the framework produces actionable insight or self-flattering noise. Manual filter analysis in spreadsheets is slow and produces drift over time as filter definitions slip. Automated journals with built-in filter builder enforce locked categories and produce per-filter equity curves automatically. The trading journal comparison covers which journals support multi-layer filter analytics. The paired equity curve comparison covers the visualization layer of filter results. The impact analysis guide covers the quantitative tool that builds on filter findings. The setup performance breakdown covers Layer 1 in detail.

Common Filtering Traps

5 Traps That Destroy Filter Analysis Value

Overfitting: 4+ layer filter with 8 trades per cell is meaningless. Keep sample at 20+ per cell, 3 layers maximum.
Survivorship bias: Your current trades survived because you took them. What about setups you saw and skipped? Filtering only works on what's in your journal. Document skipped setups in a "watched but didn't take" log to reduce this bias.
Static thinking: The filter that works this quarter might not work next quarter. Re-validate regularly — quarterly minimum, monthly preferred.
Action paralysis: You find 5 insights but implement none because you can't decide which matters most. Pick the highest-PF improvement and start there. Single-fix implementation beats multi-fix planning that never gets executed.
Cutting too much: If your filter leaves 5 trades per month, you can't make a living. Ensure remaining volume is viable. The goal isn't maximum PF — it's maximum total P/L, which requires balancing edge quality against trade frequency.

3 Mistakes Traders Make With Filtering

Mistake 1: Drilling Past 3 Layers

Multi-dimensional filtering produces apparently dramatic results — PF 5.0 on the 4-layer filter looks much better than PF 2.0 on the 2-layer filter. But the 4-layer cell typically has 6-12 trades, where variance dominates signal. The PF 5.0 is statistical noise that won't reproduce. Stop at 3 layers; the gain in apparent edge from layer 4 is overwhelmed by the loss in statistical reliability. Less is more in filtering depth.

Mistake 2: Treating Aggregate as the Real Performance

Some traders see their aggregate PF 1.2 and conclude the strategy doesn't work, then abandon it for a new approach. The aggregate often hides a profitable subset that would compound dramatically if isolated. Always run Layer 1 (setup decomposition) before concluding aggregate performance reflects strategy quality. The aggregate is the average; the components reveal the structure.

Mistake 3: Implementing Multiple Filter Changes Simultaneously

Found 4 promising filter improvements? Don't implement all 4 next week. Implementing simultaneously means you can't attribute forward improvement (or degradation) to specific filters. Phase the changes: implement filter 1 for 30 days, evaluate, then add filter 2. Slower but produces actionable feedback. Multi-change implementation produces unattributable results that prevent learning.

Who Should Skip Filter Analysis (For Now)

Traders with fewer than 100 total trades. Per-cell sample size will be too small for meaningful filter results. Wait until 200+ total trades before running multi-layer filter analysis. Layer 1 (setup decomposition) is usable at 100+ total; layers 2-3 require 200+.
Traders without consistent setup tagging. Filter Layer 1 requires categorical tags. Untagged data produces an "unknown" bucket that dominates and prevents filtering. Tag retroactively from journal notes or commit to forward tagging for 60-90 days before running filters.
Single-setup traders. Layer 1 reduces to "all trades" for single-setup approaches. Layers 2-3 still apply (session, day filtering on the single setup). The framework works but with fewer dimensions; expect fewer dramatic findings than multi-setup traders.
Algorithmic traders. Systematic strategies should be analyzed via backtesting and walk-forward analysis rather than discretionary filter analysis. Filter findings on algorithmic data often suggest parameter optimization opportunities that produce overfit strategies if implemented naively.
Traders mid-strategy-transition. Filter analysis presupposes a stable strategy across the analysis window. Strategy changes within the last 30 days mean filter findings blend two systems. Stabilize first; analyze second.

Methodology Note

Filtering methodology: Adapts stratified sampling and hierarchical analysis principles from statistical analysis to discretionary trading data. The 3-layer cascade reflects standard hierarchical decomposition with the broad-to-specific drilling pattern.
Sample size requirements: Minimum 20 trades per cell for moderate-confidence filter conclusions; 50+ for high confidence. Below 10 trades per cell, variance dominates and conclusions are unreliable. Total dataset 100+ trades for Layer 1; 200+ for Layers 2-3 combined.
3-layer maximum: Hard limit to prevent overfitting. Apparent edge from layer 4-5 typically reflects statistical noise rather than structural pattern. Drill depth follows sample-size capacity, not analytical curiosity.
Pre-declared filter discipline: Filter categories must be definable in advance — pre-declared via your trading plan, not discovered post-hoc through data inspection. Discovered filters produce inflated hindsight numbers that fail to reproduce forward.
Forward validation requirement: Filter findings should be validated with 30-60 days of forward trading before committing to schedule changes. Persistent improvement is real edge; regression to baseline indicates original analysis was overfitting.

For our full editorial process, see our editorial methodology.

Final Verdict: The Edge Is Already in Your Journal

Your aggregate stats are an average that hides your real edge. Filtering peels back the layers to reveal where you actually make money — which setup, which session, which day, which conditions. Once you see it, the path forward is obvious: do more of what the data says works, do less of everything else. The strategy doesn't need to change; the filtering does.

The framework's biggest failure mode is overfitting — finding flattering patterns by data-mining without statistical validity. The defense: pre-declared filter categories, 3-layer maximum, 20+ trades per cell minimum, forward validation over 30-60 days before committing. Without these disciplines, filter analysis produces self-flattering noise that doesn't reproduce when committed to forward.

Three principles from the framework:

Aggregate hides; components reveal. Always decompose by setup before drawing conclusions about strategy quality.
3 layers maximum, 20+ trades per cell minimum. Drill deeper than this and you're fitting noise instead of finding signal.
Pre-declared categories generalize forward; discovered filters don't. Filter discovery through data inspection produces inflated hindsight that doesn't reproduce.

For related analysis: equity curve comparison for the visual filtering technique that complements per-cell metrics, impact analysis for the counterfactual cut simulation that builds on filter findings, setup performance breakdown for Layer 1 deep dive, trade quality score for the per-trade discipline grading that combines with filter analysis, session performance comparison for Layer 2 deep dive, and Friday P/L analysis for Layer 3 day-of-week pattern analysis.

How to Filter Down to Your Actual Trading Edge

Would this setup survive?

Why Your Aggregate Stats Are Lying to You

The Restaurant Pattern in Trading Data

Why Most Traders Misread Their Own Data

The Filtering Cascade: From Broad to Specific

Layer 1: Setup Type

Layer 2: Session

Layer 3: Day of Week (Optional)

Your Filtered Edge

Building Your Filter Stack: Systematic Process

The 5-Step Filter Process

The Sample-Size Floor

8 Quick Filter Presets

Preset Filter Library

How to Use the Preset Library

The Hidden Deal-Breaker: The Overfitting Trap

Three Specific Overfitting Patterns

The Pre-Declared Filter Discipline

Common Filtering Traps

5 Traps That Destroy Filter Analysis Value

3 Mistakes Traders Make With Filtering

Mistake 1: Drilling Past 3 Layers

Mistake 2: Treating Aggregate as the Real Performance

Mistake 3: Implementing Multiple Filter Changes Simultaneously

Who Should Skip Filter Analysis (For Now)

Methodology Note

Final Verdict: The Edge Is Already in Your Journal

Turn theory into proof.

Frequently Asked Questions

How to Filter Down to Your Actual Trading Edge

Would this setup survive?

Why Your Aggregate Stats Are Lying to You

The Restaurant Pattern in Trading Data

Why Most Traders Misread Their Own Data

The Filtering Cascade: From Broad to Specific

Layer 1: Setup Type

Layer 2: Session

Layer 3: Day of Week (Optional)

Your Filtered Edge

Building Your Filter Stack: Systematic Process

The 5-Step Filter Process

The Sample-Size Floor

8 Quick Filter Presets

Preset Filter Library

How to Use the Preset Library

The Hidden Deal-Breaker: The Overfitting Trap

Three Specific Overfitting Patterns

The Pre-Declared Filter Discipline

Common Filtering Traps

5 Traps That Destroy Filter Analysis Value

3 Mistakes Traders Make With Filtering

Mistake 1: Drilling Past 3 Layers

Mistake 2: Treating Aggregate as the Real Performance

Mistake 3: Implementing Multiple Filter Changes Simultaneously

Who Should Skip Filter Analysis (For Now)

Methodology Note

Final Verdict: The Edge Is Already in Your Journal

Turn theory into proof.

Frequently Asked Questions

Related Guides & Tools

Equity Curve Comparison Hack

Impact Analysis: Cut Worst Setups

Setup Performance Breakdown

Do You Have a Trading Edge?

Trade Quality Score

London vs NY vs Asia