Claude US30 Long Stopped in 35 Minutes: When 58% Conviction Meets a False Reclaim
May 27 long at 50,766. Trend Agent at 58% with REDUCE_SIZE, Macro Agent lean_bull at 50% — both sub-60% — but a clean reclaim of broken resistance pulled Claude in anyway. Stop hit at 50,706 for -$1,000 (SL).
Season 2 of the AI Trading Benchmark runs Claude Opus 4.7 and GPT-5.5 head-to-head across six instruments — EUR/USD, NAS100, US500, US30, USDJPY, and GBP/USD — on identical live market data, $1,000 fixed risk per trade, and a $50,000 starting balance per side.
This is Claude's tenth Season 2 trade and the second US30 long of the season. It is also the trade immediately after the May 26 NAS100 win, and it gives back almost exactly what that win produced. The setup is the framework's preferred US30 template — a long retest of broken intraday resistance with breadth supportive and yields softening — but with one specific weak spot: both the Trend Agent and the Macro Agent printed sub-60% confidence, and the Trend Agent explicitly flagged REDUCE_SIZE. The setup graded out at 6.5 out of 10. Claude entered anyway. The market closed the question 35 minutes later.
About reported results. Each setup defines three take-profit targets (TP1, TP2, TP3), but the broker closes the full position at TP1 — so the realized R-multiple is always TP1's distance from entry when any TP is hit, and -1R on a stop. The dollar P&L shown in this article is the actual broker close at TP1 (or stop). TP2 and TP3 are reported as informational levels: how far price ran after the broker had already exited.
Result
R-Multiple
AI Confidence
Win Rate
Season Record
Market Environment — May 27, 2026
Wednesday opened in the kind of mixed-signal environment that is the framework's hardest setup to read honestly. The NYSE advance/decline line printed 711, above its 5-day EMA of 698.6 — supportive of longs but recovering from a sharp two-day deterioration (931 → 377 → 711). VIX was 16.99, below its 5-day EMA of 17.08, sitting in the lower half of the normal-volatility regime that favors breakouts. The US 10-year Treasury yield was 4.487%, below its 5-day EMA of 4.524% and softening — equity-supportive. The dollar index at 99.05 was below its 5-day EMA of 99.15 — non-restrictive for multinationals.
Three of four breadth-and-macro reads were supportive. The fourth was the issue: the Macro Analysis Agent printed lean_bull at only 50% confidence on US30 specifically (61% on the group), with the explicit flag that US30 was below its own 5-day EMA and consolidating range-bound inside yesterday's range of 50,379–51,091. The Trend Authority Agent agreed directionally — BULLISH, TRANSITIONING — but at 58% confidence with the regime tag REDUCE_SIZE. The Trend Agent's invalidation sat at 50,628.7.
The framework's confluence gate requires at least four of seven inputs. The pre-trade analysis scored this setup at four out of seven confirmed plus one partial — Medium-High, but driven entirely by structure and breadth. Both agents printed below the 60% confidence threshold. The pre-trade analysis was explicit about this: "Agent confidences are sub-threshold but directionally aligned; structure and breadth carry the setup." It recommended position-size reduction to 0.5–0.75% effective risk, well below the standard 1%.
The intraday tape itself was the strongest argument for the long. Price had opened in the lower part of yesterday's range, formed a clean NY low at 50,572 — exactly at the Trend Agent's support level — reclaimed VWAP at 50,586, and broken decisively through the 50,755 resistance on the 14:00 UTC candle with volume. By 14:30 UTC the 15-minute structure had flipped fully bullish: EMA fast above slow, MACD positive, RSI 64.8. The trade thesis was simple: wait for the pullback into the broken-resistance retest zone at 50,755–50,775 and buy the reclaim.
That setup is what fired at 14:42 UTC.
US30 LONG
Setup: US30 Long Pullback to Broken Resistance
Analysis by SkyAnalyst AI
Strategy Analysis
The pattern Claude was trading
The template was a broken-resistance retest long — structurally identical in spirit to the NAS100 setup the day before, just on a different instrument and with different confluence math. Yesterday's intraday resistance at 50,755 had been broken on a strong 14:00 UTC candle. The retest of that level — flipped from resistance into support, with a clean 5-minute reaction candle and the 15-minute trend stack intact — is the structurally cleanest US30 entry on a transitional-bullish day.
The framework's pre-trade plan was specific. Entry zone 50,755–50,775, with the trigger being a 5-minute bullish rejection candle in the zone or a reclaim of 50,770 after a dip. Stop at 50,715, sitting below the 14:15 candle low of 50,760 and outside the 60-minute ATR of 59.5 points. TP1 at 50,827 (intraday swing high), TP2 at 50,855 (NY session high), TP3 at 50,898 (60-minute extension). R:R of roughly 1:1.7 to TP2.
The 6.5-out-of-10 dilemma
Every Claude setup gets a confluence-gate score. The May 27 US30 long graded 6.5 on the analysis card, which sits right at the floor of what the framework will trade. Four confirmations cleared cleanly: NYAD above its 5-day EMA, VIX compressed below EMA, no scheduled USD event within the entry window, and price at the structural retest level. Two confirmations explicitly failed: Macro Agent confidence at 50% (below the 60% threshold) and Trend Agent confidence at 58% (also below 60). One was partial: the 60-minute EMA stack had not yet fully bullish-stacked — fast was still below slow.
The framework's own logic says the agent-confidence misses are not absolute vetoes. They are conviction limiters. A 6.5 setup with structure and breadth carrying the weight is tradeable — at reduced size. The Trend Agent's REDUCE_SIZE flag was the explicit instruction: 0.5–0.75% effective risk, not the standard 1%.
This is the part where the methodology disagrees with the broker book. Season 2 risk is flat $1,000 per trade by design — the framework's REDUCE_SIZE recommendation has no mechanism to scale that down. The agent flagged "trade smaller." The book trades the full $1,000 regardless. Either the framework's risk-sizing instruction is wrong, or the Season 2 flat-risk rule needs an override path for REDUCE_SIZE setups. We log it here and leave the design question for the ledger.
The single evaluation
Claude's evaluation chain on this trade was one step long. At 14:42 UTC, with price at 50,771 — 80% of the way through the entry zone — Claude assessed: the 15-minute forming candle had wicked to 50,762 and reclaimed 50,770. That matched the "reclaim of 50,770 after a dip" trigger from the pre-trade plan. Trend Agent had ticked up to BULLISH/TRENDING at 66%. MACD was bullish across timeframes, price was above VWAP and EMAs, and the NY session high at 50,855 had already been tagged earlier in the session, demonstrating buyer strength at higher prices.
The acknowledged risk in Claude's own reasoning: price was already at the upper 2-SD VWAP band and the 5-minute RSI was at 65+. The pullback to the zone provided what Claude called "a clean R:R toward TP1 at 50,827." Decision: ENTER at 62% confidence. Position opened at 50,766.
What happened next
Thirty-five minutes. Price made one push toward 50,790, failed to take 50,800, and rolled. The 15-minute candle that triggered the entry — the wick-and-reclaim — turned out to be the high of the session's second leg. By 15:17 UTC, price had traded back through the entry-zone low at 50,755, through the 60-minute EMA fast, and into the stop at 50,715. Exit printed at 50,706.
The trade booked -1.0R, exactly -$1,000 (SL). Loss duration: 35 minutes. Among the shortest stop-out windows in Season 2 so far.
Reading the loss
The honest reading is that the setup was a 6.5-out-of-10 by the framework's own scoring, and a 6.5 trades closer to coin-flip than to edge. The structure and breadth carried it past the confluence gate; the agent-confidence numbers were the warning the framework gave itself, and the warning was correct. The Trend Agent at 58% TRANSITIONING is, by the system's own definitions, a regime where the underlying trend is unclear and momentum is shifting. That is the worst regime in which to take a continuation trade off a single 15-minute wick.
The post-mortem is straightforward. Either the framework needs to enforce its own REDUCE_SIZE recommendation on the position-sizing side — converting "0.5–0.75% risk" into something the simulated $1,000 risk can actually express — or it needs to raise the agent-confidence floor for Medium-High setups from 5/7 to 5/7-with-both-agents-at-or-above-60. The May 27 trade is the case study for either fix.
For the public ledger, the result is -1R / -$1,000 (SL). Claude's running balance moves from $48,114 to $47,114. The week's modest win on NAS100 gets given back, in full, in less than an hour.
BULLISH 58%Regime:
TRANSITIONINGMacro:
SUPPORTIVEneutral 58%USD remains firm but off recent highs as safe-haven demand eases with improving Middle East ceasefire hopes and lower oil prices, while Fed expectations remain "higher for longer" on resilient US data. Tomorrow's Core PCE and Preliminary GDP will be the key macro pivot for all USD pairs, with rate differentials and commodity dynamics driving divergent outcomes across the forex complex.
16.49 (normal)99.2 neutral4.48 stable92.5 bearishWhat the AI Actually Saw
Full 6.3k-character analysis produced by the AI before entering this trade. Unedited.
- Multi-timeframe technical analysis
- Key support and resistance levels
- Pattern recognition and trade setup identification
- Risk-reward assessment with defined invalidation
- Macro context and cross-asset correlation check
Powered by SkyAnalyst AI
SkyAnalyst AIUS30 NY AM Session Analysis — 10:30 ET
1. Breadth Assessment (Primary Driver)
| Metric | Current | 5-Day EMA | Signal |
|---|---|---|---|
| NYAD | 711 | 698.6 | Above EMA ✅ |
| NYAD vs 5d range | Mid (5d low ~95, high ~1326) | — | Neutral, not extreme |
| VIX | 16.99 | 17.08 | Below EMA ✅ (breakout-favorable) |
Read: NYAD is positive and above its 5-day EMA — supports a long bias. Breadth is recovering from a sharp 2-day drop (931 → 377 → 711) but is not at a 5-day high, so no breadth-extreme veto on shorts either. VIX is sub-EMA with no spike → standard stops acceptable, breakouts favored over mean-reversion.
Default directional bias: LONG, moderate conviction.
2. Macro Regime & Cross-Asset
- Macro Agent: lean_bull (50% confidence on US30, 61% on group), tradeability moderate (60). Flags: price below 5d EMA and range-bound consolidation — soft headwind.
- DXY: 99.05, below 5d EMA (99.15), within yesterday's range → non-restrictive for multinationals ✅
- US10Y: 4.487%, below 5d EMA (4.524%), softening → supportive for equities/financials ✅
- Gold: Below yesterday's low → risk-on rotation signal
- Catalyst: Thursday 8:30am ET data (Core PCE/GDP/Claims) — not today, no high-impact USD event scheduled.
Regime classification: Risk-on / transitional-bullish. NYAD positive + VIX below EMA + yields softening + DXY softening = constructive backdrop. Macro confidence is the weak link (only 50% on US30).
3. Trend Structure & Key Levels
Trend Agent: BULLISH, 58% confidence, TRANSITIONING, REDUCE_SIZE.
- Key Levels: R = 50,755 / S = 50,572 / VWAP = 50,586
- Invalidation: 50,628.7
60min: Price 50,804 sits above fast EMA (50,638) and slow EMA (50,655), MACD just flipped to histogram positive with line still slightly below zero (-4.5). EMA stack is not yet fully bullish (fast still below slow) — partial confirmation. RSI 60.8.
15min: Strong bullish reversal. EMA fast (50,667) > slow (50,632), MACD line +26 with strong histogram, RSI 64.8. Price has broken decisively above 50,755 Trend Agent resistance — resistance has flipped to support.
Today's structure: Opened in lower part of yesterday's range, formed NY low at 50,572 (= Trend Agent S), reclaimed VWAP, then broke through 50,755 resistance on the 14:00 candle with volume.
4. Lower-Timeframe (5m) Entry Read
- Opening range break: Confirmed bullish — broke 50,731 OR high on 14:10 candle, sustained above 50,800.
- Current price extension: 50,804 sits at upper_2sd VWAP band, RSI ~65, MACD strong but price is 218 pts above VWAP — chasing here is poor R:R.
- Best entry profile = pullback to broken resistance / EMA9 5m. The 50,755 prior resistance + 50,765–50,775 EMA9/upper_1sd zone is the high-probability second-chance entry.
- Today's high 50,855 = today's high resistance / approaching upper_band_1x on 60m (50,863).
5. Confluence Gate — Long Pullback Setup
| # | Confluence | Status |
|---|---|---|
| i | NYAD positive (711 > EMA 698.6) | ✅ |
| ii | VIX below EMA (16.99 < 17.08) | ✅ |
| iii | Macro Agent lean_bull (50% — below 60 threshold) | ❌ |
| iv | Trend Agent bullish (58% — below 60 threshold) | ❌ |
| v | 60m EMA stack supports long (fast still < slow) | ⚠️ Partial |
| vi | Price at structural level (50,755 broken-resistance + EMA9 5m) on pullback | ✅ (pending) |
| vii | No high-impact USD event within 30 min | ✅ |
Score: 4/7 confirmed + 1 partial = Medium-High (6.5/7.5 band). Agent confidences are sub-threshold but directionally aligned; structure and breadth carry the setup.
🟢 Setup 1 — LONG Pullback to Broken Resistance
| Parameter | Level |
|---|---|
| Directional Bias | Long (with-trend continuation) |
| Entry Zone | 50,755 – 50,775 (broken R + EMA9 5m + upper_1sd VWAP) |
| Entry Trigger | 5m bullish rejection candle in the zone (wick + close back above 50,770), OR reclaim of 50,770 after a dip |
| Stop Loss | Below 50,715 (under 14:15 candle low 50,760 → buffer to 50,715, below 1x 60m ATR support and above Trend Agent invalidation 50,628.7) |
| TP1 | 50,827 (today's intraday swing high) — ~1.2R |
| TP2 | 50,855 (today's high / 60m upper_band_1x 50,863) — ~2R |
| TP3 | 50,898 (60m S2 / Fib 23.6) — ~3R (only if NYAD pushes new 5d highs) |
| R:R | ~1.7–2:1 to TP2 |
| Confidence | Medium-High (6.5/10) |
| Invalidation | 5m close below 50,715 OR break of Trend Agent invalidation 50,628.7 |
Confluences supporting: NYAD positive, VIX compressed, broken resistance retest at 50,755, 15m bullish stack, OR break sustained past 9:45, no scheduled USD event.
Risks:
- Macro Agent flagged range-bound consolidation within yesterday's range (50,379–51,091) — upside may stall before TP3.
- Trend Agent at 58% with REDUCE_SIZE recommendation — size conservatively.
- 60m EMA fast still below slow → transitional, not confirmed trend.
- Price extended on 5m (RSI 65, upper_2sd VWAP) — do not chase; require pullback.
❌ No Short Setup
A short here fails the gate:
- NYAD positive ❌
- VIX below EMA ❌
- Trend Agent bullish ❌
- Macro lean_bull ❌
- Price broke above 50,755 resistance ❌
Only 1/7 confluence — No high-probability short setup.
Risk Management Notes
- Position sizing: Given Trend Agent's REDUCE_SIZE flag and sub-60 confidence on both agents, use 0.5–0.75% equity risk (vs. standard 1%) — express as R, not lots.
- Stop is structural (~60 pts): Within US30 normal range (50–100 pts), above 1x 60m ATR (59.5), and above Trend Agent invalidation. Acceptable.
- If price runs to 50,855+ without a pullback to entry zone, stand aside — chasing the 200+ pt move above VWAP gives back 50–80% per the post-extension rule.
- Time stop: If not filled by 11:30 ET, reassess — London close volatility wanes and lunchtime chop reduces edge.
Summary
| Item | Read |
|---|---|
| Bias | Long on pullback |
| Regime | Risk-on / transitional-bullish |
| Best Entry | 50,755–50,775 retest |
| Veto check | None triggered |
| Setup quality | Medium-High (5/7) |
| Conviction limiter | Both agents <60% confidence; macro flags range-bound |
Wait for the pullback. Do not chase 50,800+.
1 Evaluation — Quick Entry
The AI evaluated US30-Pepperstone 1 time before reaching a final decision. Each step shows confidence, reasoning, and whether the model was ready to enter.
Price is at 50,771 (80% of zone) right at the broken resistance retest level. The 15m forming candle shows a wick to 50,762 with reclaim back above 50,770, satisfying the "reclaim of 50,770 after a dip" trigger. Trend Agent confirms BULLISH/TRENDING at 66%, MACD bullish across timeframes, price above VWAP and EMAs, and NY high at 50,855 already tagged showing buyer strength. Risk is the upper-2SD VWAP extension and RSI 65+, but the pullback to zone provides a clean R:R toward TP1 at 50,827.
Account Performance
Profit taken at TP1 — the full position is closed at the first target to keep results measurable and comparable across models.
Key Takeaways
A 6.5/10 confluence score is at the floor of what the framework will trade, and 6.5 trades are not edge trades. Four confirmations cleared, two agent-confidence checks explicitly failed, and one structural check was partial. Structure and breadth carried the setup past the gate. The agent-confidence misses were the warning the system gave itself, and the warning was correct.
Season 2 flat-$1,000 risk and the framework's REDUCE_SIZE recommendation are not currently reconciled. The Trend Agent's regime tag explicitly recommended 0.5–0.75% effective risk on this setup. The Season 2 methodology trades $1,000 flat regardless. Either the risk-sizing instruction needs an override path, or the confidence-floor rule needs to be enforced harder on the entry-gate side. The May 27 US30 trade is the case study for the design fix.
A 15-minute wick-and-reclaim is the lowest-quality trigger inside the framework's published playbook. The candle that triggered the entry — wick to 50,762, reclaim of 50,770 — closed within 15 minutes as the high of the session's second leg. The reclaim was not a reclaim. It was the top of the bounce. Higher-conviction triggers (5-minute close with momentum confirmation, two consecutive reclaim candles, breakout-of-the-reclaim-bar high) would have either filtered the trade out or signaled the failure earlier.
TRANSITIONING is the worst regime to take a continuation trade. Trend Agent at BULLISH 58% TRANSITIONING means the underlying trend is unclear and momentum is shifting. The framework's own regime definition says continuation setups are lowest-edge in this state. The May 27 US30 long was, by the system's own taxonomy, the wrong type of trade for the regime that was actually printing.
The week now stands at +$781, -$1,000 — net -$219 across two trades. The May 26 NAS100 win has been functionally erased. Claude exits May 27 at $47,114 on the season, down $2,886 from the $50,000 starting balance and roughly $2,000 behind GPT-5.5 on the head-to-head.
The 6.5-confluence setup is the framework's gray zone. The May 27 US30 trade entered that zone with both agents below the 60% conviction floor and the Trend Agent explicitly flagging REDUCE_SIZE. The Season 2 methodology has no mechanism to honor that recommendation, so the trade went on at full flat risk and resolved as a -1R stop inside 35 minutes. The design question the ledger raises is whether the entry gate needs a hard confidence floor, or whether the risk system needs a soft-size override. Either fix would have left the May 26 NAS100 win on the book. — Isaac, Senior Research Editor
Compare with Eduardo’s analysis →Methodology
Both AI models receive identical market data, identical infrastructure, and identical risk parameters. No prompt engineering. No human intervention. Standard API temperature (0.0). Trades executed on demo accounts with institutional spread conditions via Pepperstone Markets. Each model operates with a $50,000 starting balance and 2% risk per trade. All positions are closed at TP1 — the first take-profit target — to keep results measurable and directly comparable across models.
Forex pairs and gold (XAUUSD) have standardized pricing across brokers — the prices in this article will closely match what you see on your own platform. US index CFDs (NAS100, US30, US500) are different: each broker constructs its own index price feed, so entry prices, stop distances, and P&L figures for index trades are specific to Pepperstone Markets. All trades in this experiment were analyzed, executed, and settled on Pepperstone demo accounts using Pepperstone's price feed.
Why This Cannot Be Replicated in ChatGPT or Claude Alone
Copying the analysis prompt into ChatGPT or Claude will not reproduce these results. Neither model has access to live market data — and the data is the foundation of everything.
Every analysis session, SkyAnalyst AI assembles a structured data packet of 50,000–100,000 tokens per instrument from live broker APIs. This is not a price quote. It contains 5 hours of multi-timeframe candle data across 60-minute, 15-minute, and 5-minute charts — each candle carrying full indicator overlays: EMA fast/slow, ATR, MACD with histogram, RSI, volume with SMA, VWAP with standard deviation bands, and others. On top of that: session structure levels (Tokyo, London, New York highs and lows), Fibonacci retracement and extension levels, a rolling 5-day macro window covering the 10Y yield, DXY, VIX, NYAD breadth, oil, and gold — along with additional proprietary data layers, all formatted as structured JSON specifically designed for LLM consumption.
The model never starts from raw data. Before Claude or GPT sees anything, two proprietary SkyAnalyst AI agents — among other internal systems — have already processed the environment: the Macro Analysis Agent produces directional bias with confidence scores and tradeability ratings across intraday and multi-day horizons, while the Trend Authority Agent evaluates technical structure — EMA alignment, momentum, regime classification — and outputs direction, confidence, key levels, and invalidation prices. The trading model synthesizes what these agents and preprocessing layers have already evaluated. This multi-agent pipeline is what produces the quality of analysis shown in this article — a single prompt to a single model, no matter how detailed, cannot replicate what multiple specialized systems produce in sequence.
The goal is to emulate what a professional trader actually does: read the macro environment, analyze multi-timeframe technicals, identify a setup with defined risk, wait for precise entry conditions, and execute with discipline. SkyAnalyst AI provides the infrastructure that gives the trading model everything it needs to do this — live data, preprocessed context, real-time monitoring, and broker execution. This is not a chatbot experiment. It is an institutional-grade trading pipeline where the AI model is the decision-maker, operating under the same conditions and constraints a professional desk would demand.
Trading involves substantial risk of loss. Past performance is not indicative of future results. These are AI model results shared for educational and research purposes only. Not financial advice.
Stay in the loop
Want the framework behind these trades?
Get every trade analysis, weekly battle report, and the full AI Trading Playbook delivered directly to you.
- Daily Trade Analysis
- Weekly Battle Reports
- AI Trading Insights
- The AI Trading Playbook (free)