EDUARDO’S DAILY ANALYSIS — MAY 22, 2026

GPT-5.5 US500 Long Wicks TP1, Closes on the Stop — -$1,000 (SL), 3W-7L

Price tagged the planned TP1 at 7500.5 after entry — the trade-tracker stamped highest_tp_hit = 1 — but the broker's limit order never filled. The position reversed back through stop 7475.6 and exited at 7475.1 for -1.0R. The day's third stop in four trades closes GPT at $46,173.

Eduardo — Senior Research Editor

Season 2 of the AI Trading Benchmark runs Claude Opus 4.7 and GPT-5.5 head-to-head across six instruments — EUR/USD, NAS100, US500, US30, USDJPY, and GBP/USD — on the same live market data, the same $1,000 fixed risk per trade, and the same $50,000 starting balance. Every analysis session, every evaluation chain, and every broker fill is logged into the public ledger.

This is GPT-5.5's tenth Season 2 trade and the third stop-out of a four-trade May 22 session. Chronologically: USDJPY long stopped first at -$1,000 (SL), US30 long hit TP1 for +$585 (TP1), this US500 long stopped at -$1,000 (SL), and EURUSD short stopped at -$1,000 (SL). Day net: -$2,415. End-of-day broker balance: $46,173.00 — a -7.7% return from the $50,000 starting base.

The US500 long is the trade in that sequence that needs the most careful explanation. Price did touch the planned TP1 at 7500.5 after the entry — the trade-tracker recorded highest_tp_hit = 1 because that print landed inside the TP1 trigger window — but the broker's limit order did not fill. Price reversed, ran back through the entry at 7487.2, and exited at 7475.1 — the actual broker fill, ten cents below the planned stop at 7475.6. The realized outcome under the benchmark's flat-risk methodology is -1.0R / -$1,000 (SL), regardless of the wick-through reading. This article walks through why the broker outcome — not the price-touched reading — is the answer the methodology returns when the two disagree.

About reported results. Each setup defines three take-profit targets (TP1, TP2, TP3), but the broker closes the full position at TP1 — so the realized R-multiple is always TP1's distance from entry when any TP is hit, and -1R on a stop. The dollar P&L shown in this article is the actual broker close at TP1 (or stop). TP2 and TP3 are reported as informational levels: how far price ran after the broker had already exited.

GPT-5.5

Result

SL Hit

R-Multiple

-1.0R

AI Confidence

62%

Win Rate

30.0%

Season Record

3W–7L

Market Environment — May 22, 2026

Friday's New York morning opened with a textbook risk-on read for US500 longs. NYAD printed +244 against a 5-day EMA of +194.6 — positive and above trend, though already fading from a +1051 intraday peak earlier in the session. VIX at 16.74 sat in the 15–20 normal regime, slightly below its 5-day EMA of 17.20, with no SPX-rising/VIX-rising divergence. DXY at 99.34 held a bullish stance above its 5-day EMA. The cross-asset reads were three-for-four supportive for an index long: VIX cooperative, dollar bid into growth divergence, yields rising at 4.556%, oil bearish at 104.24.

Both proprietary agents agreed on direction at moderate-high confidence. The Macro Analysis Agent published a bullish US500 bias at 78% confidence with a tradeability score of 82/100 — citing the "higher-for-longer" Fed narrative reinforced by Waller's argument to remove the easing bias and Warsh's hawkish appointment, with yield differentials supporting the dollar against EUR and JPY while equities held risk-on. The Trend Authority Agent classified US500 as TRENDING at 76% confidence, bullish direction, with key invalidation at 7477 and VWAP at 7473.6–7476.9.

The intraday structural read carried two complications that mattered more than the macro tape. The session high at 7501.7 had been tagged and rejected once before the analysis window opened. The 5-minute tape had then slipped below VWAP and EMA9 — momentum had cooled into the pullback rather than holding the breakout. The pre-trade analysis was specific about what that meant for execution: do not buy immediately into 7476–7477; the cleanest bullish continuation trigger is a reclaim of the 7486.5–7488.5 opening-range breakdown zone, with a 5-minute close above 7486.5 followed by a successful hold or retest above 7484.5–7486.5. The analysis graded that conditional setup at MEDIUM-HIGH (68–72%) only after trigger, with explicit warnings: first test of 7500–7502 may reject again, breadth has faded from the morning high, and the 5-minute MACD is currently weak.

The setup's structural logic was sound. The momentum signature it required to fire was not yet present when the entry window opened.

Trade 1 of 1 — US500 LONG

Trade Details

US500 LONG

Setup: Conditional Long Reclaim / Continuation

Entry7,487.20

Stop Loss7,475.60

Exit7,475.10

R-Multiple-1R

AI Confidence62%

Actual Profit (TP1)-$1,000.00

Analysis by SkyAnalyst AI

Platform view at time of entry · Click to enlarge

Strategy Analysis

Why this article spends time on the wick-through-TP1 reading

This is the methodologically interesting trade of GPT's May 22 session. On every other stop-out in Season 2, the answer is unambiguous: price went the wrong way, hit the stop, the broker closed the position, the result is -1R. Here, price went the right way first. The trade-tracker recorded a touch of 7500.5 — the planned TP1 — and stamped highest_tp_hit = 1 on the row. By the structural reading of the chart, the trade reached its first target. By the broker reading, the position never closed at TP1; the limit order did not fill, price reversed all the way back, and the broker exited at 7475.1 — ten cents below the planned stop at 7475.6, a 1.0-point slippage on a stop-out fill. The realized R under the benchmark's flat-risk policy is -1.0R / -$1,000 (SL).

Two readings of the same trade. One says "the model was right about direction and got to TP1." The other says "the broker realized -1R because the position never closed in profit." The methodology used in this benchmark is unambiguous about which reading governs. Realized R is broker-realized R — the dollar outcome of the position, computed from where the broker actually opened and where the broker actually closed. The trade-tracker's price-touched signal is informational, not authoritative. Three causes can produce this gap and all of them apply here.

First, the TP1 print at 7500.5 could have been a single-tick wick that did not satisfy the broker's limit-order trigger condition. Broker limit orders typically require the bid-or-ask side that fills the order to actually touch the limit price, not just the mid or the printed last. A 1-tick spike on the last-print feed that doesn't translate into a printed bid above 7500.5 will register as price-touched on the tracker but will not fill the limit at the broker. Second, latency between the tracker's price feed and the broker's order matching engine can mean the broker saw a different print sequence than the tracker did — the limit order's queue position can leave it unfilled even when the tracker says the price went past it. Third, in fast-moving sessions, the broker's limit fills require the queue to clear before the order at the limit price is matched; if the move through the limit is faster than the queue clears, the position remains open and price moves on.

The benchmark's methodology does not adjudicate which of these three actually happened on this specific tick. It does not need to. The realized R is determined by where the broker actually exited. That number is what shows up in the broker statement, that number is what compounds the season balance, and that number is what the experiment is measuring. The 7500.5 print is a chart fact. The stop fill at 7475.1 is the trade fact.

How GPT structured the entry

The pre-trade analysis ran 9,380 characters and was unusually direct about what should and should not happen. The bottom line, stated in the analysis's first section: "Bias remains bullish, but no immediate market-order long. The higher-timeframe trend, Macro Agent, Trend Agent, breadth, and VIX all support the long side, but the 5-minute tape has weakened below VWAP/EMA9 after a failed 7500 test. For NY AM, the best setup is a conditional long continuation only if price reclaims the opening-range breakdown area near 7486.5–7488.5." That is the setup the model would commit to. Anything else was disqualified before the evaluation chain started.

The setup specifications were specific. Entry zone 7486.5–7488.5. Trigger: a 5-minute close back above 7486.5, followed by hold or retest above 7484.5–7486.5, with VWAP remaining below price and the 5-minute MACD histogram showing momentum is no longer deteriorating. Stop at 7475.6 — beyond the Trend Agent invalidation at 7477 with a small execution-slippage buffer. TP1 at 7500.5–7501.7 — the session high zone and the 7500 round number congestion. TP2 at 7510–7512.5. The R:R math: ~11.9-point risk, ~13.5 points to TP1 (~1.1R), ~24 points to TP2 (~2.0R). Confluence count: 5 of 6 once triggered (multi-TF EMA alignment, correct side of VWAP, prior-day level interaction, both agents bullish, NYAD confirming; the VIX "mostly aligned" was the partial).

The trade-tracker classified the pattern as Conditional Long Reclaim / Continuation. The "conditional" word is the operative one. The setup is not active until the reclaim trigger fires. The pre-trade work cleared the structural framework. The evaluation chain would have to clear the momentum trigger.

The 5-evaluation chain and what changed at evaluation five

GPT ran five evaluations across roughly five minutes before committing — the longest evaluation window the model has produced on US500 in Season 2 to date.

Evaluation 1 (14:58:34 UTC, 66% confidence): WAIT. "The last completed 5m candle closed at 7482.5 and the current forming candle is only probing toward the lower edge of the entry zone... 7486–7487 is still immediate resistance/fib area rather than confirmed reclaimed support. A close above 7486.5 followed by a hold/retest of 7484.5–7486.5 would make this tradeable." Trigger not confirmed; do not enter.

Evaluation 2 (14:58:43 UTC, 72% confidence): WAIT. "The last completed 5-minute candle closed at 7482.5 and the forming candle has only reached about 7485, while the setup requires a 5-minute close back above 7486.5 followed by a hold/retest." Confidence rose 6 points because the bullish bias was holding and the MACD histogram was becoming "less negative" — but the trigger was still not confirmed.

Evaluation 3 (14:58:44 UTC, 72% confidence): WAIT. Restates the same condition. "Still a reclaim attempt, not a confirmed reclaim/hold."

Evaluation 4 (15:01:23 UTC, 62% confidence): WAIT — but the language softens. "The setup is close but the stated trigger is not fully confirmed: the last completed 5-minute candle closed at 7486.2, just below the required 7486.5 reclaim, and there has not yet been a confirmed hold/retest above 7484.5–7486.5. Price is now in the entry zone at 7486.7, above VWAP, and 5m MACD histogram is improving, so the long idea remains valid, but the reclaim is still marginal and low-volume." Confidence dropped 10 points from evaluation 3 — the trigger was closer to confirmed but the volume signature was weak and the close was 0.3 points short of the reclaim level.

Evaluation 5 (15:03:39 UTC, 62% confidence): ENTER. "Price is currently inside the entry zone at 7486.7 and is holding above the 7484.5–7486.5 reclaim area, with VWAP clearly below price and the broader 15m/trend-agent backdrop still bullish. The exact trigger is imperfect because the last completed 5m candle closed at 7485.7, just below 7486.5, but the forming candle has already tested 7487.7 and MACD histogram deterioration has nearly stopped (-0.38 to -0.08). With SL/TP levels untouched and price near the lower edge of the zone, this is tradeable but not a clean confirmation entry, so size should be conservative."

The pattern is the same the benchmark has surfaced across three GPT loss days. Confidence drops to the moment-of-commitment range — 62% on this trade, the same number printed on the May 22 EURUSD short and the May 21 USDJPY long — and the reasoning explicitly flags the trigger as imperfect. The 5-minute close GPT was waiting for did not actually arrive. The candle closed at 7485.7, below the 7486.5 reclaim level. The forming candle had tested 7487.7 but the close that defined the trigger was still ahead. The position was opened on a not-yet-confirmed reclaim with the model logging the trigger as "imperfect" in real time.

What happened after the entry — the wick through TP1

The trade lasted 4 hours and 25 minutes — far longer than the structural premise required.

Sometime between the entry at 15:03:39 UTC and the eventual stop at 19:29:11 UTC, price extended high enough to touch the planned TP1 at 7500.5 — the trade-tracker stamped highest_tp_hit = 1 on that contact. The broker did not close. Whatever the cause — single-tick wick, latency, queue position on a fast move — the limit order at 7500.5 did not match. The single-tick read was the only contact with TP1 the trade produced before price reversed.

Price then gave back all of the upside the post-entry rally had produced. The reversal ran through the rest of the New York morning and into the European close. By the 19:00 UTC hour, US500 had traded back through the 7480s and the Trend Agent invalidation at 7477 broke. The 5-minute candle that triggered the stop closed below 7475.6 at 19:29:11 UTC. The broker exited at 7475.1, ten cents below the planned stop, a 1.0-point slippage that sits at the tight end of normal for a fast-moving US500 stop fill.

Realized result: -1.0R / -$1,000 (SL) on a 4h 25m hold. Broker P&L: -$1,000 on the $1,000 risk unit. Running balance moved from $48,173 (after the morning US30 winner) to $47,173 immediately after this stop, then to $46,173 after the EURUSD short stopped later in the evening — the day's closing balance.

What this means for the methodology and for the GPT season

The wick-through-TP1 outcome is the cleanest test the benchmark has produced of the broker-realized-R policy. The policy is straightforward: realized R is what the broker realized, not what the price feed touched. The tracker's htp = 1 stamp is a chart artifact — it tells you the trade's intent was directionally correct. It does not tell you the position closed in profit. The broker statement is what closes the position. For this trade, the broker closed the position at the stop.

This is the methodology getting tested by an edge case the policy was built to handle, not a methodology that needs to be revised. The benchmark assumes broker-side fills are the realized outcome. The tracker's TP-touched signal is a downstream observation. When they disagree, the broker fill is the trade. Every other Season 2 article makes this same assumption silently because the wick-through case has not come up. This article makes it explicitly because the case did come up, and surfaces honestly is better than buries quietly.

GPT's season ledger after May 22 sits at 3W-7L, $46,173, -7.7% from the $50,000 starting base. The 3-of-10 win rate has held roughly flat across the four-trade day — GPT entered May 22 at 3W-4L and exited at 3W-7L after three stop-outs and one TP1 winner. The seven losses break out across five instruments: USDJPY long three times (May 20, May 21, May 22), the EURUSD short (May 22), the GBPUSD short (May 19), the NAS100 short (May 19), and this US500 long. The three winners: US30 short (May 19, +1.65R), US500 long (May 20, +0.94R), and US30 long (May 22, +0.59R). The pattern: high-confidence pre-trade analysis cleared at moderate-to-high confluence counts; entries committed at the 62-65% confidence range when the immediate trigger condition was imperfect.

Cross-link: the same-day same-instrument head-to-head

Claude entered US500 long on the same day, from the same bullish macro thesis, and also stopped at -1.0R / -$1,000 (SL). Two independent AIs, two different evaluation frameworks, both committing to the long side on a textbook risk-on tape — and both stopped within the same session. This is the strongest head-to-head in Season 2 so far. The macro thesis (US500 trending higher into a Fed hawkish, dollar-bid, breadth-confirming environment) was directionally correct over the multi-day window. The specific 14:00–15:30 UTC entry window both models chose did not validate the thesis. For Claude, the entry committed inside a 71-second WAIT-to-ENTER pivot on partial momentum. For GPT, the entry committed on a not-yet-confirmed reclaim trigger with a confidence drop into the commitment range.

The diagnostic question the benchmark is now positioned to answer: when both models stop on the same instrument on the same day, is the failure a macro-direction failure (the thesis was wrong) or an entry-timing failure (the trigger conditions used by both models were too loose for the chop the chart actually produced)? The May 22 US500 head-to-head argues for the second answer. Both models were directionally correct. The chart did, at least intra-session, produce the moves their setups were built to capture. Neither model's trigger was strict enough to wait for the move that would have validated the entry.

For US500 specifically, the variable to track through the next several sessions is whether GPT continues to commit on the not-yet-confirmed reclaim — the trigger that fired at 62% confidence on this trade — or whether the framework tightens the close-above-threshold requirement to the textbook condition. The first variant has produced a stop. Whether the second variant produces a winner is what the next US500 setup will show.

Agent Signals

Trend Agent

US500 BULLISH 78%
Regime: TRENDING
Macro: SUPPORTIVE

Macro Agent

Bias: neutral 72%
US dollar strength remains underpinned by a renewed "higher-for-longer" Fed narrative, with Fed Governor Waller arguing for removal of the "easing bias" and Kevin Warsh's hawkish appointment reinforcing tight policy expectations. Yield differentials dominate FX markets, supporting the dollar against low- and negative-yielders (EUR, JPY) while capping high-beta FX despite benign risk sentiment and strong equities.

Cross-Asset

VIX 16.71 (normal)

DXY 99.34 bullish

Yields 4.556 rising

Oil 104.24 bearish

AI Analysis Output

What the AI Actually Saw

Full 9.4k-character analysis produced by the AI before entering this trade. Unedited.

Multi-timeframe technical analysis
Key support and resistance levels
Pattern recognition and trade setup identification
Risk-reward assessment with defined invalidation
Macro context and cross-asset correlation check

SkyAnalyst AI

US500 NY AM Setup Analysis — S&P 500 / US500

Current context: ~10:30am ET data snapshot, US500 around 7476–7477, after an early push into 7501.7 and pullback to VWAP / prior breakout area.

Bottom Line

Bias remains bullish, but no immediate market-order long.
The higher-timeframe trend, Macro Agent, Trend Agent, breadth, and VIX all support the long side, but the 5-minute tape has weakened below VWAP/EMA9 after a failed 7500 test. For NY AM, the best setup is a conditional long continuation only if price reclaims the opening-range breakdown area near 7486.5–7488.5.

Shorts are lower quality because they fight both agents, positive breadth, and the higher-timeframe trend.

1. Breadth & Volatility Regime

Input	Reading	Trading Impact
NYAD / ADD current	+244
NYAD / ADD 5-day EMA	+194.6
Today’s ADD high / low	+1051 / +229
VIX current	16.74
VIX 5-day EMA	17.20
VIX today range	16.64–17.15

Breadth

Breadth is positive and above its 5-day EMA, so it still confirms risk-on conditions for SPX. However, ADD has faded from the intraday high of +1051 to +244, so breadth is supportive but no longer aggressively expanding.

Interpretation:

Positive NYAD/ADD supports long continuation.
Breadth is not strong enough to justify chasing into resistance without a clean reclaim trigger.

VIX

VIX at 16.74 sits in the 15–20 normal volatility regime. It is slightly below its 5-day EMA and roughly flat versus yesterday’s close.

Important warning check:
There is no clear VIX-rising-while-SPX-rising warning at the moment. VIX is not materially rising alongside price. That keeps long confidence intact, though not maximum due to the 5-minute momentum loss.

Stop regime:
Use 10–15 point structural stops where possible. Avoid overly tight stops because 5-minute ATR is elevated and the AM session has already rejected 7500 once.

2. Agent Synthesis

Macro Agent

Factor	Reading
US500 bias	Bullish
Confidence	78%
Tradeability	High, 82/100
Key macro support	Orderly grind higher, broad participation
Risk event	Revised UoM Consumer Sentiment at 10:00am ET, actual 44.8 vs 48.2 forecast

Macro is directly relevant because SPX is the primary risk barometer. The Macro Agent is bullish with good confidence.

Trend Agent

Factor	Reading
Direction	Bullish
Confidence	76%
Regime	Trending
Key resistance	7501.7
Key support / invalidation area	7477
VWAP	Around 7473.6–7476.9

Trend Agent and Macro Agent agree bullishly. That is a meaningful positive input.

Risk Event Timing

The 10:00am ET UoM release is now outside the ±15-minute avoidance window. New entries are acceptable only with confirmation, not immediately into chop.

3. Gap & Daily Structure

Daily reference levels

Level	Price
Prior close	7452.4
Prior high	7471.8
Prior low	7392.9
Today high	7501.7
Today low	7452.9
Current price	~7476.7

US500 is trading above prior day high 7471.8, so the market is still holding a bullish breakout structure, but only marginally.

Gap structure

Current price is about +0.33% above prior close, borderline between a small gap and continuation gap. Since today’s low printed near 7452.9, the gap has effectively already tested/filled the prior close area.

Key intraday levels

Zone	Role
7500–7502	Major round-number congestion + session high
7486.5–7488.5	Opening range low / reclaim trigger area
7480.2	Prior intraday high / local resistance
7477	Trend Agent invalidation / key pivot
7474–7476.9	VWAP / daily pivot / prior close cluster
7471.8	Prior day high breakout level
7464.5	NY session low
7452–7453	Prior close / today low gap-fill zone
7450	Round-number congestion

Main structural message:
The market is trying to decide whether 7474–7477 is a valid VWAP support/bullish retest, or whether the 7500 rejection becomes a failed breakout.

4. Multi-Timeframe Technicals

60-minute bias

60-minute structure remains bullish:

Price is above fast/slow EMAs.
RSI around 57, neutral-positive.
MACD remains above zero but below signal, showing cooling upside momentum.
Price is still above VWAP on the latest 60-minute read.

60m conclusion: Bullish trend, but momentum is cooling.

15-minute confirmation

15-minute is mixed but still constructive:

Price is above the slow EMA but slightly below/near fast EMA.
RSI around 50, neutral.
MACD above zero and signal, but histogram has weakened.
VWAP remains nearby and important.

15m conclusion: Bullish structure is intact, but the pullback has neutralized momentum.

5-minute entry tape

The 5-minute chart is the caution flag:

Price slipped below 5m VWAP around 7476.9.
Price is below 5m EMA9.
MACD crossed below zero.
The 7501.2 / 7501.7 high failed.
Price is sitting near VWAP / prior breakout support.

5m conclusion: Do not chase. Need a reclaim trigger.

Opening range

Approximate first 30-minute NY opening range:

Opening Range Level	Price
OR high	7501.2–7501.7
OR low	7486.5

Price broke below the opening range low after the 7500 rejection. That means the cleanest bullish continuation trigger is a reclaim of 7486.5–7488.5, not simply buying the current VWAP chop.

Valid Setup

Setup 1 — Conditional Long Reclaim / Continuation

Field	Plan
Direction	Long / Buy
Entry zone	7486.5–7488.5
Entry trigger	5-minute close back above 7486.5, followed by hold/retest above 7484.5–7486.5; ideally VWAP remains below price and 5m MACD histogram stops deteriorating
Stop loss	7475.6 including small execution buffer
Risk	Approx. 10.9–12.9 pts, depending on fill
TP1	7500.5–7501.7
TP2	7510–7512.5
Invalidation before entry	No long if price loses 7474 and cannot reclaim VWAP quickly
Trend alignment	Aligned with Macro Agent, Trend Agent, 60m trend, 15m structure

Why this setup qualifies

This setup has 5 of 6 required confluences once triggered:

Confluence	Present?	Notes
Multi-timeframe EMA alignment	Yes	60m and 15m remain broadly bullish; 5m must reclaim
Correct side of VWAP	Conditional yes	Entry only after price is back above VWAP/OR reclaim
Prior day level / daily S/R interaction	Yes	Holding above prior high 7471.8 and VWAP/pivot cluster
Both agents agree	Yes	Macro bullish 78%, Trend bullish 76%
NYAD confirming	Yes	ADD positive and above 5-day EMA
VIX aligned	Mostly yes	VIX normal, below 5-day EMA, no SPX/VIX divergence warning

R:R check

Assuming entry around 7487.5 and stop 7475.6:

Risk: ~11.9 pts
TP1 at 7501.0: ~13.5 pts, about 1.1R
TP2 at 7511.5: ~24 pts, about 2.0R

This is acceptable because TP1 is near the major structural high/round number, while TP2 offers a valid 2R extension if 7500 breaks cleanly.

Confidence

Moderate-high only after trigger: 68–72%.

The setup is not active until price reclaims the opening-range breakdown zone. Current price near 7476–7477 is too close to VWAP/invalidation and the 5-minute momentum is still soft.

Key risks

7500–7502 is major congestion. First test may reject again.
Breadth is positive but has faded from the morning high.
5-minute MACD is currently weak; buying before reclaim would be premature.
A sustained move below 7474–7471.8 would turn the breakout into a failed breakout.

No High-Probability Short Setup Right Now

A short below VWAP may look tempting on the 5-minute chart, but it lacks enough high-quality confluence.

Short-side problems

Macro Agent is bullish.
Trend Agent is bullish.
NYAD/ADD is positive and above its 5-day EMA.
VIX is not rising enough to confirm a short/reversal regime.
Price is still above prior day high 7471.8 and near VWAP support.

A short would only become more credible if:

Price loses 7471.8,
Holds below 7474/VWAP,
VIX starts rising,
ADD deteriorates materially,
And price retests from below and fails.

Until then, shorts are tactical scalps only, not high-probability NY AM setups.

Execution Guidance

Do not buy immediately at 7476–7477.
Preferred long trigger is a reclaim of 7486.5–7488.5.
Avoid new long entries if price is still below VWAP and below 5m EMA9.
If price reclaims 7486.5 but stalls under 7492, reduce confidence.
If price breaks 7501.7 with breadth firm and VIX flat/down, trail for 7510–7512.5.

Risk Management

Because VIX is in the normal regime, use volatility-adjusted sizing around a 10–13 point stop for the valid setup. Risk per trade should generally remain around 0.5%–1% of equity under normal conditions, reduced during drawdowns, and capped near 2% only when already trading from profits.

AI Entry Decision

5 Evaluations — What Changed in 5 Minutes

The AI evaluated US500-Pepperstone 5 times before reaching a final decision. Each step shows confidence, reasoning, and whether the model was ready to enter.

Wait66%2026-05-22T14:58:34.736Z

I would not enter yet because the required 5-minute close back above 7486.5 has not occurred; the last completed 5m candle closed at 7482.5 and the current forming candle is only probing toward the lower edge of the entry zone. The backdrop is constructive—price is above VWAP/EMAs, the 5m MACD histogram is improving, and the Trend Agent remains bullish—but 7486–7487 is still immediate resistance/fib area rather than confirmed reclaimed support. A close above 7486.5 followed by a hold/retest of 7484.5–7486.5 would make this tradeable.

↓

→

Wait72%2026-05-22T14:58:43.769Z

Do not enter yet: the key trigger has not been confirmed, as the last completed 5-minute candle closed at 7482.5 and the forming candle has only reached about 7485, while the setup requires a 5-minute close back above 7486.5 followed by a hold/retest. The bullish case is improving—price remains above VWAP/EMAs and the 5-minute MACD histogram is becoming less negative—but price is only at the lower edge of the entry zone and still pressing into nearby 7486–7487 resistance. I would wait for a confirmed 5-minute close above 7486.5 and a successful hold above 7484.5–7486.5 before entering.

↓

Wait72%2026-05-22T14:58:44.184Z

Do not enter yet: the conditional trigger has not been confirmed because the last completed 5-minute candles closed below 7486.5, and the latest 5-minute candle shown is still forming with price only testing the lower edge of the entry zone. The setup is improving—price is above VWAP and EMAs, the 5m MACD histogram is becoming less negative, and the Trend Agent remains bullish—but this is still a reclaim attempt, not a confirmed reclaim/hold. I would wait for a clean 5-minute close above 7486.5 followed by acceptance above 7484.5–7486.5 before entering.

↓

→

Wait62%2026-05-22T15:01:23.913Z

The setup is close but the stated trigger is not fully confirmed: the last completed 5-minute candle closed at 7486.2, just below the required 7486.5 reclaim, and there has not yet been a confirmed hold/retest above 7484.5–7486.5. Price is now in the entry zone at 7486.7, above VWAP, and 5m MACD histogram is improving, so the long idea remains valid, but the reclaim is still marginal and low-volume. I would wait for a clean 5-minute close above 7486.5 or a successful retest/hold before entering.

↓

Enter62%2026-05-22T15:03:39.268Z

Price is currently inside the entry zone at 7486.7 and is holding above the 7484.5–7486.5 reclaim area, with VWAP clearly below price and the broader 15m/trend-agent backdrop still bullish. The exact trigger is imperfect because the last completed 5m candle closed at 7485.7, just below 7486.5, but the forming candle has already tested 7487.7 and MACD histogram deterioration has nearly stopped (-0.38 to -0.08). With SL/TP levels untouched and price near the lower edge of the zone, this is tradeable but not a clean confirmation entry, so size should be conservative.

-1.0R

US500 long — SL HIT — 4h 25m

Account Performance

Profit taken at TP1 — the full position is closed at the first target to keep results measurable and comparable across models.

InstrumentLotsEntry / ExitNet P&LRisk (2%)Balance

US500-Pepperstonebuy

0.007487.2 → 7475.1-$1,000.00$1,000.00

$46,173.00Simulated

US500-PepperstonebuySimulated

-$1,000.00

0.00 lots7487.2 → 7475.1

Risk: $1,000.00Bal: $46,173.00

Season$50,000.00 → $46,173.00-$3,827.00 (-7.7%)· 10 trades

Key Takeaways

What this trade taught — three lessons from the wick-through-TP1 stop-out.

The broker statement is the realized outcome, not the price feed. Price did touch the planned TP1 at 7500.5 (htp = 1), but the broker's limit order did not fill. The position then reversed all the way back through the entry and exited at 7475.1 — ten cents below the planned stop. The realized R under the benchmark's flat-risk policy is -1.0R / -$1,000 (SL), regardless of the tracker's TP-touched reading. When the two diagnostics disagree, the broker fill is the trade — that is the methodology by design, not a workaround for an edge case.
The "imperfect trigger" pattern is now documented on three GPT loss days. GPT entered at 62% confidence on the May 21 USDJPY long, the May 22 EURUSD short, and this May 22 US500 long. In all three cases, the pre-trade analysis flagged the structural setup at MEDIUM-HIGH (68-72%) and the real-time evaluation downgraded the trigger to "imperfect" at the moment of commitment. The pattern is becoming structural: the framework's structural grading does not converge with the framework's trigger grading on the trades that produce stops. The variable worth watching is whether the framework tightens the close-above-threshold requirement before committing.
Two models, one instrument, one day, two stops — the macro thesis was directionally correct. Both GPT and Claude entered US500 long on May 22 from the same Fed-hawkish, breadth-confirming, dollar-bid bullish thesis. Both stopped at -1.0R. The thesis was right over the multi-day window; the entry window each model chose was wrong for the chop the chart produced inside that window. The May 22 US500 head-to-head is the cleanest Season 2 example of the macro-correct, entry-wrong failure mode. The diagnostic question for both models: tighten the entry trigger, or widen the holding window, or both.

The five-instrument loss spread across GPT's seven Season 2 losses (USDJPY x3, EURUSD, GBPUSD, NAS100, US500) argues this is a trigger-quality pattern, not an instrument-specific one. The next several sessions will show whether either model adjusts.

Eduardo

Senior Research Editor

GPT-5.5's May 22 ended 3W-7L at $46,173, the model's deepest Season 2 drawdown to date. The US500 long is the trade in the day's four-trade sequence that needed the most honest accounting — the trade-tracker recorded a TP1 touch that the broker never closed on, and the position reversed all the way back through stop for a textbook -1R. The benchmark's flat-risk methodology answers the htp/exit_reason disagreement the same way every time: the broker statement is the trade. Whether GPT tightens its reclaim-trigger requirement before the next US500 setup is what the second week of Season 2 will measure. — Eduardo, Senior Research Editor

Compare with Isaac’s analysis →

Methodology

Both AI models receive identical market data, identical infrastructure, and identical risk parameters. No prompt engineering. No human intervention. Standard API temperature (0.0). Trades executed on demo accounts with institutional spread conditions via Pepperstone Markets. Each model operates with a $50,000 starting balance and 2% risk per trade. All positions are closed at TP1 — the first take-profit target — to keep results measurable and directly comparable across models.

Forex pairs and gold (XAUUSD) have standardized pricing across brokers — the prices in this article will closely match what you see on your own platform. US index CFDs (NAS100, US30, US500) are different: each broker constructs its own index price feed, so entry prices, stop distances, and P&L figures for index trades are specific to Pepperstone Markets. All trades in this experiment were analyzed, executed, and settled on Pepperstone demo accounts using Pepperstone's price feed.

Why This Cannot Be Replicated in ChatGPT or Claude Alone

Copying the analysis prompt into ChatGPT or Claude will not reproduce these results. Neither model has access to live market data — and the data is the foundation of everything.

Every analysis session, SkyAnalyst AI assembles a structured data packet of 50,000–100,000 tokens per instrument from live broker APIs. This is not a price quote. It contains 5 hours of multi-timeframe candle data across 60-minute, 15-minute, and 5-minute charts — each candle carrying full indicator overlays: EMA fast/slow, ATR, MACD with histogram, RSI, volume with SMA, VWAP with standard deviation bands, and others. On top of that: session structure levels (Tokyo, London, New York highs and lows), Fibonacci retracement and extension levels, a rolling 5-day macro window covering the 10Y yield, DXY, VIX, NYAD breadth, oil, and gold — along with additional proprietary data layers, all formatted as structured JSON specifically designed for LLM consumption.

The model never starts from raw data. Before Claude or GPT sees anything, two proprietary SkyAnalyst AI agents — among other internal systems — have already processed the environment: the Macro Analysis Agent produces directional bias with confidence scores and tradeability ratings across intraday and multi-day horizons, while the Trend Authority Agent evaluates technical structure — EMA alignment, momentum, regime classification — and outputs direction, confidence, key levels, and invalidation prices. The trading model synthesizes what these agents and preprocessing layers have already evaluated. This multi-agent pipeline is what produces the quality of analysis shown in this article — a single prompt to a single model, no matter how detailed, cannot replicate what multiple specialized systems produce in sequence.

The goal is to emulate what a professional trader actually does: read the macro environment, analyze multi-timeframe technicals, identify a setup with defined risk, wait for precise entry conditions, and execute with discipline. SkyAnalyst AI provides the infrastructure that gives the trading model everything it needs to do this — live data, preprocessed context, real-time monitoring, and broker execution. This is not a chatbot experiment. It is an institutional-grade trading pipeline where the AI model is the decision-maker, operating under the same conditions and constraints a professional desk would demand.

Trading involves substantial risk of loss. Past performance is not indicative of future results. These are AI model results shared for educational and research purposes only. Not financial advice.

View Full Methodology

Stay in the loop

Want the framework behind these trades?

Get every trade analysis, weekly battle report, and the full AI Trading Playbook delivered directly to you.

Daily Trade Analysis
Weekly Battle Reports
AI Trading Insights
The AI Trading Playbook (free)

Get it on WhatsApp