Pre-Market Alpha: Computing Overnight Signals and Liquidity Forecasts Before the Open | US Stocks

The closing bell does not end the market. It merely pauses it.

Between 4:00 PM ET and 9:30 AM ET, roughly 17 hours of accumulated overnight information—earnings whispers, ADR movements, futures pricing, macro releases scheduled for the following morning—settles into the pre-market order book. The traders who arrive at 9:30 AM with positions already constructed have an edge that cannot be manufactured in the first 30 seconds of live trading. That edge is built the night before.

This article walks through the signals you can precompute after hours, the data sources that power pre-market liquidity estimation, and the production-grade WebSocket infrastructure needed to monitor overnight order flow and arrive at the open prepared.

The Case for Overnight Signal Precomputation

The standard retail workflow treats the market as a daytime-only phenomenon. Traders monitor price action during regular hours, close positions before 4:00 PM, and treat the overnight window as dead time. This assumption is structurally expensive.

Consider what happens during a typical earnings-adjacent overnight period. An equity like Rivian Automotive (RIVN) reports earnings after market close on Thursday. By Friday morning, the pre-market order book has already priced in the implied move. A trader who waits until 9:30 AM to assess the landscape is arriving to a battlefield where the first shots were fired at 4:01 PM the previous day.

Precomputing signals overnight accomplishes three things:

1. Latency arbitrage. By the time regular trading begins, your order book baseline, implied volatility surface, and directional bias are already computed. You are not consuming data—you are acting on preprocessed intelligence.

2. Noise reduction. High-frequency signals during the open are contaminated by quote fade, spoofing residue, and HFT acceleration. Overnight signals, though thinner in volume, carry higher information density per trade.

3. Strategy preconditioning. Precomputed signals feed directly into your opening strategy's entry conditions. Rather than computing a momentum score from scratch at 9:29:45 AM, you arrive with a binary or continuous signal already scored.

The Pre-Market Data Landscape

Before diving into signal construction, you need to understand what data is actually available during the overnight window and which sources carry reliable liquidity signals.

Data Source	Availability (ET)	Information Content	Reliability
US equity futures (ES, NQ)	23 hours	Broad market direction, sector rotation	High
ADR prices (24-hour OTC)	Continuous	International stock movement pricing	Medium-High
Extended-hours trading (IEX, FINRA)	4:00 AM – 9:30 AM	Actual trade flow in US equities	Medium
Options implied volatility	Pre-market 7:00 AM+	Market-maker fear/greed pricing	High
News sentiment feeds	Continuous	Overnights earnings pre-pricing	Low-Medium
Pre-market order book depth	7:00 AM – 9:30 AM	Limit order congestion at key levels	High (when available)

The limitation that most traders underestimate is the availability of actual order book depth data during the pre-market window. US equity exchanges provide consolidated pre-market depth, but the granularity and refresh rates are inferior to regular-hours data. For the purposes of this article, we will focus on signals that can be computed from available data: futures-driven market direction, trade-flow derived pressure ratios, and overnight news sentiment scoring.

Signal Taxonomy: What to Precompute After Hours

Signal Category 1: Directional Bias from Futures and ADRs

The most accessible overnight directional signal comes from the ES (E-mini S&P 500) and NQ (E-mini Nasdaq-100) futures, which trade nearly 23 hours per day. The overnight change in these contracts represents the market's collective assessment of overnight news, macro developments, and sentiment.

Computation: Track the change from the previous session's 4:15 PM ET close to the current pre-market reading at 7:00 AM ET. Normalize by historical volatility to produce a z-score directional signal.

import os
import time
import json
import random
import logging
from datetime import datetime, timezone
from typing import Optional
import requests

logging.basicConfig(
    level=logging.INFO,
    format="%(asctime)s [%(levelname)s] %(message)s",
    datefmt="%H:%M:%S"
)
logger = logging.getLogger(__name__)

# ⚠️ For production HFT workloads, use aiohttp/asyncio
# This synchronous implementation is suitable for pre-market signal computation

TICKDB_API_KEY = os.environ.get("TICKDB_API_KEY")
BASE_URL = "https://api.tickdb.ai/v1"

class OvernightSignalEngine:
    """
    Computes pre-market directional and volatility signals from
    available data sources for opening strategy preparation.
    """

    def __init__(self, api_key: str):
        self.api_key = api_key
        self.headers = {"X-API-Key": api_key}
        self.base_url = BASE_URL
        self._session = None
        self._retry_count = 3
        self._base_delay = 1.0

    def _request_with_backoff(self, method: str, url: str, **kwargs) -> dict:
        """HTTP request with exponential backoff and jitter."""
        for attempt in range(self._retry_count):
            try:
                response = requests.request(
                    method=method,
                    url=url,
                    headers=self.headers,
                    timeout=(3.05, 10),
                    **kwargs
                )
                data = response.json()

                # Rate-limit handling
                if data.get("code") == 3001:
                    retry_after = int(response.headers.get("Retry-After", 5))
                    logger.warning(f"Rate limited. Waiting {retry_after}s before retry.")
                    time.sleep(retry_after)
                    continue

                if data.get("code", 0) == 0:
                    return data.get("data", {})
                else:
                    logger.error(f"API error {data.get('code')}: {data.get('message')}")
                    return {}

            except requests.exceptions.Timeout:
                logger.warning(f"Request timeout (attempt {attempt + 1}/{self._retry_count})")
            except requests.exceptions.RequestException as e:
                logger.warning(f"Request error: {e} (attempt {attempt + 1}/{self._retry_count})")

            # Exponential backoff with jitter
            if attempt < self._retry_count - 1:
                delay = min(self._base_delay * (2 ** attempt), 30.0)
                jitter = random.uniform(0, delay * 0.1)
                time.sleep(delay + jitter)

        return {}

    def get_futures_close_baseline(self, symbol: str, date: str) -> Optional[float]:
        """
        Retrieve previous session's regular-hours close for normalization.
        Uses the kline endpoint for daily OHLCV.
        """
        url = f"{self.base_url}/market/kline"
        params = {
            "symbol": symbol,
            "interval": "1d",
            "limit": 2,  # Previous day + current (partial)
            "end_time": f"{date}T16:15:00Z"  # 4:15 PM ET close
        }
        klines = self._request_with_backoff("GET", url, params=params)
        if klines and len(klines) >= 1:
            # Previous day's close is the first entry
            return klines[0].get("close")
        return None

    def compute_futures_directional_signal(
        self,
        symbol: str = "ES.US",
        lookback_days: int = 20
    ) -> dict:
        """
        Compute z-score of overnight futures change.
        Returns a directional bias score normalized by historical volatility.
        """
        today = datetime.now(timezone.utc).strftime("%Y-%m-%d")
        yesterday = (
            datetime.now(timezone.utc)
            .replace(hour=0, minute=0, second=0, microsecond=0)
        )

        # Fetch historical closes for volatility normalization
        url = f"{self.base_url}/market/kline"
        params = {
            "symbol": symbol,
            "interval": "1d",
            "limit": lookback_days
        }
        klines = self._request_with_backoff("GET", url, params=params)

        if not klines or len(klines) < lookback_days:
            logger.warning(f"Insufficient kline data for {symbol}")
            return {"signal": 0.0, "confidence": "low", "raw_change": 0.0}

        # Compute daily returns
        closes = [float(k.get("close", 0)) for k in klines]
        returns = []
        for i in range(1, len(closes)):
            if closes[i - 1] > 0:
                ret = (closes[i] - closes[i - 1]) / closes[i - 1]
                returns.append(ret)

        if not returns:
            return {"signal": 0.0, "confidence": "low", "raw_change": 0.0}

        # Historical statistics
        mean_ret = sum(returns) / len(returns)
        variance = sum((r - mean_ret) ** 2 for r in returns) / len(returns)
        std_dev = variance ** 0.5

        # Today's overnight change (current vs previous close)
        if len(closes) >= 1:
            current_price = closes[-1]
            previous_close = closes[-2] if len(closes) >= 2 else closes[-1]
            overnight_change = (current_price - previous_close) / previous_close

            # Z-score normalization
            if std_dev > 0:
                z_score = (overnight_change - mean_ret) / std_dev
            else:
                z_score = 0.0

            signal_strength = max(min(z_score / 3.0, 1.0), -1.0)  # Clamp to [-1, 1]

            confidence = "high" if len(returns) >= 20 else "medium"

            logger.info(
                f"{symbol}: overnight change={overnight_change:.4%}, "
                f"z_score={z_score:.2f}, signal={signal_strength:.3f}"
            )

            return {
                "symbol": symbol,
                "signal": signal_strength,
                "z_score": z_score,
                "raw_change": overnight_change,
                "confidence": confidence,
                "hist_vol": std_dev
            }

        return {"signal": 0.0, "confidence": "low", "raw_change": 0.0}

Signal Category 2: Implied Volatility Term Structure

Options implied volatility (IV) contains forward-looking fear and greed that futures prices do not. A steepening IV term structure (near-term IV > medium-term IV) indicates elevated near-term uncertainty—often associated with earnings, FDA decisions, or macroeconomic announcements.

For pre-market preparation, the key signal is the IV skew reversal: when the IV skew (difference between OTM put and OTM call IV) inverts from its typical configuration, it signals that market makers are pricing in a directional move.

Implementation approach: While full IV surface computation requires a dedicated options data feed, you can approximate this signal by monitoring the ratio of near-term put volume to call volume from pre-market trade reports. A put/call ratio above 1.5 during the pre-market window is a historically reliable contrarian signal for directional positioning.

Signal Category 3: Sector and Thematic Rotation via ETF Flow

ETF flows provide a macro-level signal on overnight positioning. The top 10 US equity ETFs by AUM collectively reveal how institutional capital is repositioning after hours.

Key instruments to monitor:

SPY (S&P 500): Broad market directional bias
QQQ (Nasdaq 100): Tech sector rotation signal
IWM (Russell 2000): Small-cap sentiment
XLE (Energy Select Sector): Commodity cycle rotation
XLK (Technology Select Sector): AI/infrastructure positioning

By computing the net-of NAV premium/discount for each ETF at 7:00 AM ET and comparing it to the 10-day average, you can detect whether overnight flows are concentrated in defensive (XLU, consumer staples) or aggressive (QQQ, ARKK) themes.

Pre-Market Liquidity Estimation: The Depth Problem

Estimating liquidity before the open is harder than it sounds. The fundamental issue is that pre-market order books are:

Thin: Volume during the 7:00–9:15 AM ET window is typically 5–15% of average daily volume (ADV).
Informed: A disproportionate share of pre-market volume comes from institutional desks with informational advantages.
Subject to quote congestion: Wide spreads during the pre-market create "dead zones" where limit orders cluster at round-number levels.

Estimating Pre-Market Spread Width

The bid-ask spread is the most direct liquidity proxy. During regular hours, US equities trade with spreads that are often sub-penny for large-caps. During the pre-market session, effective spreads widen by a factor of 3–10x depending on the stock's average daily volume.

A practical estimation formula:

Pre-market effective spread ≈ Base spread × (1 + (0.5 × ln(ADV_ratio)))

Where ADV_ratio is the ratio of current pre-market volume to the expected pre-market ADV (approximately 8% of 20-day ADV).

For stocks with ADV above $500 million, the pre-market spread remains manageable. For micro-caps and nano-caps, pre-market trading is often functionally illiquid until 9:20 AM when retail participation increases.

class PreMarketLiquidityEstimator:
    """
    Estimates pre-market bid-ask spread width and execution quality
    based on historical ADV and current session volume.
    """

    def __init__(self, symbol: str, adv_20d: float, avg_premarket_volume_ratio: float = 0.08):
        self.symbol = symbol
        self.adv_20d = adv_20d  # Average daily volume (shares)
        self.avg_premarket_ratio = avg_premarket_volume_ratio  # Pre-market is ~8% of ADV
        self.base_spread_bps = self._estimate_base_spread()

    def _estimate_base_spread(self) -> float:
        """
        Estimate base spread in basis points based on ADV tier.
        This is a simplified model; production use should incorporate
        real-time NBBO data.
        """
        if self.adv_20d >= 10_000_000:  # Large-cap (>10M shares/day)
            return 1.5  # ~1.5 bps base spread
        elif self.adv_20d >= 1_000_000:  # Mid-cap
            return 5.0
        elif self.adv_20d >= 100_000:  # Small-cap
            return 15.0
        else:  # Micro/nano-cap
            return 50.0

    def estimate_premarket_spread(self, current_premarket_volume: float) -> dict:
        """
        Estimate effective spread for the current pre-market session.
        Returns spread in basis points and estimated fill quality score.
        """
        expected_premarket_vol = self.adv_20d * self.avg_premarket_ratio
        adv_ratio = current_premarket_volume / expected_premarket_vol if expected_premarket_vol > 0 else 0

        # Spread widening factor (logarithmic, asymptotic)
        spread_multiplier = 1.0 + (0.5 * max(0, 1 - adv_ratio))

        estimated_spread_bps = self.base_spread_bps * spread_multiplier

        # Execution quality: 1.0 = excellent, 0.0 = non-tradeable
        if estimated_spread_bps <= 5:
            fill_quality = 1.0
        elif estimated_spread_bps <= 20:
            fill_quality = 0.8 - (estimated_spread_bps - 5) / 75
        elif estimated_spread_bps <= 100:
            fill_quality = 0.5 - (estimated_spread_bps - 20) / 160
        else:
            fill_quality = 0.2  # High risk of adverse selection

        logger.info(
            f"{self.symbol}: premarket_vol={current_premarket_volume:,.0f}, "
            f"adv_ratio={adv_ratio:.2f}, spread={estimated_spread_bps:.1f}bps, "
            f"fill_quality={fill_quality:.2f}"
        )

        return {
            "symbol": self.symbol,
            "estimated_spread_bps": estimated_spread_bps,
            "fill_quality_score": max(0.0, min(1.0, fill_quality)),
            "current_premarket_volume": current_premarket_volume,
            "is_tradeable": estimated_spread_bps < 50
        }

Opening Strategy Preparation: From Signals to Positions

Precomputed signals are only valuable if they integrate into a coherent opening strategy. The framework we recommend separates signal generation from signal consumption:

Phase 1: Overnight Signal Generation (4:00 PM – 7:00 AM ET)

Run your overnight signal engine to produce:

Directional bias scores (futures z-score, ADR movement composite)
IV skew reversal flags (from options volume ratios)
Sector rotation fingerprints (ETF flow analysis)
Earnings-adjacent event risk scores

Store these signals in a Redis or in-memory cache keyed by ticker with a TTL that expires at 9:30 AM.

Phase 2: Pre-Market Execution Conditioning (7:00 AM – 9:28 AM ET)

At 7:00 AM, begin monitoring the live pre-market feed. Compare current order flow against your overnight baseline:

If current pre-market price is 2% above your overnight close baseline AND your directional signal was bearish → reassess short entry timing (momentum may overpower mean reversion).
If current spread width exceeds your liquidity estimator's "tradeable" threshold → reduce position size or defer entry to regular hours.
If pre-market volume exceeds 120% of expected pre-market ADV within the first 30 minutes → anticipate elevated volatility at open (consider wider stops).

Phase 3: Open Execution with Signal Overlay (9:30 AM – 10:00 AM ET)

The first 30 minutes of regular trading are the highest-volatility window of the day. Your precomputed signals should govern two decisions:

Entry threshold adjustment: If your overnight directional signal was strongly bullish (z-score > 2.0) AND the pre-market has continued in the same direction, tighten your entry threshold by 0.5 standard deviations to avoid chasing an extended open.
Position sizing: Scale your base position size by the confidence score from your signal engine. A high-confidence signal (historical hit rate > 60%) justifies 1.25x base sizing. A low-confidence signal warrants 0.75x.

Production Monitoring Infrastructure

The following WebSocket subscription manages the overnight-to-open transition, automatically switching from pre-market monitoring to regular-hours depth tracking.

import threading
import queue
import websocket

class OvernightToOpenMonitor:
    """
    WebSocket monitor that transitions from pre-market order flow
    monitoring to regular-hours depth tracking at market open.
    Includes heartbeat, reconnection, and signal queuing.
    """

    def __init__(self, symbols: list, api_key: str):
        self.symbols = symbols
        self.api_key = api_key
        self.ws = None
        self.ws_url = f"wss://api.tickdb.ai/v1/market/stream?api_key={api_key}"
        self._running = False
        self._reconnect_delay = 1.0
        self._max_reconnect_delay = 60.0
        self._retry_count = 0
        self._max_retries = 10
        self.signal_queue = queue.Queue()

    def _build_subscribe_message(self) -> dict:
        """Subscribe to depth and trades channels for all target symbols."""
        return {
            "cmd": "subscribe",
            "params": {
                "channels": ["depth", "trades"],
                "symbols": self.symbols
            }
        }

    def _on_message(self, ws, message):
        """Handle incoming WebSocket messages."""
        try:
            data = json.loads(message)

            # Heartbeat response
            if data.get("cmd") == "pong":
                logger.debug("Heartbeat acknowledged")
                return

            # Data message
            if "type" in data:
                self._process_tick(data)

        except json.JSONDecodeError:
            logger.warning(f"Invalid JSON message: {message[:100]}")

    def _process_tick(self, tick: dict):
        """Process incoming tick data into signal events."""
        tick_type = tick.get("type")
        symbol = tick.get("symbol")
        timestamp = tick.get("t")

        if tick_type == "depth":
            # Extract bid/ask L1 for pressure ratio
            bid = float(tick.get("bid", 0))
            ask = float(tick.get("ask", 0))
            bid_size = tick.get("bidSize", 0)
            ask_size = tick.get("askSize", 0)

            if bid > 0 and ask > 0:
                spread = ask - bid
                pressure_ratio = bid_size / ask_size if ask_size > 0 else 1.0

                signal_event = {
                    "symbol": symbol,
                    "type": "depth_snapshot",
                    "timestamp": timestamp,
                    "spread": spread,
                    "pressure_ratio": pressure_ratio
                }
                self.signal_queue.put(signal_event)

                logger.debug(
                    f"{symbol}: spread={spread:.4f}, "
                    f"pressure_ratio={pressure_ratio:.2f}"
                )

        elif tick_type == "trade":
            price = float(tick.get("price", 0))
            volume = float(tick.get("volume", 0))
            side = tick.get("side", "unknown")

            signal_event = {
                "symbol": symbol,
                "type": "trade",
                "timestamp": timestamp,
                "price": price,
                "volume": volume,
                "side": side
            }
            self.signal_queue.put(signal_event)

    def _on_error(self, ws, error):
        logger.error(f"WebSocket error: {error}")

    def _on_close(self, ws, close_status_code, close_msg):
        logger.warning(
            f"WebSocket closed: {close_status_code} — {close_msg}"
        )
        self._running = False

    def _on_open(self, ws):
        logger.info("WebSocket connected. Subscribing to channels.")
        subscribe_msg = self._build_subscribe_message()
        ws.send(json.dumps(subscribe_msg))
        self._retry_count = 0
        self._reconnect_delay = 1.0
        self._running = True

    def _heartbeat_loop(self):
        """Send periodic heartbeat to keep connection alive."""
        while self._running:
            try:
                if self.ws and self.ws.sock and self.ws.sock.connected:
                    ping_msg = json.dumps({"cmd": "ping"})
                    self.ws.send(ping_msg)
                    logger.debug("Heartbeat sent")
                time.sleep(30)  # Heartbeat every 30 seconds
            except Exception as e:
                logger.warning(f"Heartbeat error: {e}")
                break

    def _heartbeat_with_backoff(self):
        """Reconnect with exponential backoff and jitter."""
        while self._retry_count < self._max_retries:
            try:
                self.ws = websocket.WebSocketApp(
                    self.ws_url,
                    on_message=self._on_message,
                    on_error=self._on_error,
                    on_close=self._on_close,
                    on_open=self._on_open
                )

                # Run WebSocket in a thread with timeout
                thread = threading.Thread(target=self.ws.run_forever)
                thread.daemon = True
                thread.start()

                # Start heartbeat thread
                hb_thread = threading.Thread(target=self._heartbeat_loop)
                hb_thread.daemon = True
                hb_thread.start()

                # Wait for connection to drop
                while self._running:
                    time.sleep(1)

                thread.join(timeout=5)

            except Exception as e:
                logger.error(f"Connection error: {e}")

            if self._retry_count < self._max_retries:
                # Exponential backoff with jitter
                delay = min(
                    self._reconnect_delay * (2 ** self._retry_count),
                    self._max_reconnect_delay
                )
                jitter = random.uniform(0, delay * 0.1)
                logger.info(
                    f"Reconnecting in {delay + jitter:.1f}s "
                    f"(attempt {self._retry_count + 1}/{self._max_retries})"
                )
                time.sleep(delay + jitter)
                self._retry_count += 1
                self._reconnect_delay = delay

        logger.error("Max retries exceeded. Monitor stopped.")

    def start(self):
        """Start the overnight-to-open monitor."""
        logger.info(
            f"Starting OvernightToOpenMonitor for: {', '.join(self.symbols)}"
        )
        self._heartbeat_with_backoff()

    def drain_signals(self, timeout: float = 0.1) -> list:
        """Retrieve accumulated signals from the queue."""
        signals = []
        while True:
            try:
                signal = self.signal_queue.get(timeout=timeout)
                signals.append(signal)
            except queue.Empty:
                break
        return signals

Event Calendar Integration: Overnights with Scheduled Catalysts

Pre-market signal computation becomes significantly more powerful when integrated with a scheduled event calendar. Earnings announcements, Federal Reserve meetings, and macroeconomic releases (CPI, NFP, FOMC) create predictable liquidity discontinuities that your overnight engine must anticipate.

Event Type	Typical Market Reaction	Pre-Market Signal Adjustment
Earnings release (after close)	Post-market gap risk	Increase IV skew monitoring; widen liquidity thresholds
Fed meeting / FOMC	Volatility spike regardless of direction	Pre-position for elevated spread; reduce size
CPI / NFP release (8:30 AM)	Intraday directional impulse	Neutralize positions 15 min before release; reassess post-release
Index rebalancing	Sector rotation flows	Shift directional bias toward rebalancing buy/sell pressure

For earnings-adjacent overnight periods, the pre-market signal engine should:

Flag the stock as a high-volatility event target 24 hours before the release.
Increase the spread estimation multiplier from 1.5x to 3.0x.
Set an alert for the post-market close-to-next-morning window when the gap risk is highest.

Strategic Deployment Configurations

The appropriate deployment configuration depends on your trading scale and infrastructure maturity.

User Segment	Overnight Signal Approach	Pre-Market Monitor	Position at Open
Individual quant (single ticker)	Manual script run at 7:00 AM; signals stored in-memory	Single-symbol WebSocket to `depth` channel	Signal-driven market orders; 0.5x size
Active retail trader (3–5 tickers)	Automated cron job; signals in Redis	Multi-symbol WebSocket subscription	Signals + live spread check; 0.75x size
Small fund (10–20 tickers)	Signal server with PostgreSQL persistence	WebSocket farm with per-symbol threads	Signal overlay with real-time drift check; full size with circuit breaker
Institutional desk	Full signal engine with event calendar API integration	Redundant WebSocket connections + exchange direct feed fallback	Multi-signal composite with volatility-targeting

Closing

The edge in opening strategy is not manufactured in the first second of trading. It is constructed the night before, from futures flows, from pre-market trade reports, from the microstructure of the overnight order book. The traders who arrive at 9:30 AM with signals already computed and positions already conditioned are not faster—they are earlier.

Pre-market signal precomputation is a data engineering problem as much as it is a trading problem. The infrastructure matters: reliable WebSocket connections with heartbeat and automatic reconnection, resilient HTTP clients with exponential backoff and rate-limit handling, and signal caches that survive the overnight window and persist into the open.

The code provided in this article implements the foundation. The rest—the event calendar integration, the multi-signal composite scoring, the position-sizing engine calibrated to your historical win rate—that is where your competitive differentiation lives.

Next Steps

If you are building a single-ticker pre-market monitor, the OvernightSignalEngine class provides the directional z-score signal you need. Start with ES futures data and expand to equity-specific signals once the base is stable.

If you need 10+ years of historical OHLCV data to calibrate your signal statistics, sign up at tickdb.ai for access to cleaned, time-aligned US equity data covering multiple market cycles.

If you are scaling to a multi-ticker signal engine, consider deploying the OvernightToOpenMonitor as a daemonized service with Redis-backed signal persistence. Ensure your reconnection logic includes circuit-breaker logic to prevent runaway reconnection loops during exchange outages.

If you use AI coding assistants, search for and install the tickdb-market-data SKILL in your tool's marketplace for integrated API access and code-completion support.

This article does not constitute investment advice. Markets involve risk; past performance does not guarantee future results. Pre-market trading involves additional risks including lower liquidity and wider spreads.