Graph Neural Networks Meet Quantitative Trading

TL;DR: Multi-relational GNNs combining daily stock features with supply chain graphs achieve 36% performance improvement over correlation-only models, reaching Sharpe ratio of ~1.2 on NIFTY 50 stocks.

The Alpha Signal Problem

In quantitative trading, the hunt for uncorrelated alpha signals is relentless. Traditional quant strategies rely heavily on price correlations, technical indicators, and fundamental metrics. But what if we could leverage the structural relationships between companies—their supply chain dependencies—as a predictive signal?

This work, conducted in collaboration with Precog and Alphagrep, explores exactly that: using Graph Neural Networks to model both price correlations and vendor-customer relationships simultaneously, creating a multi-relational graph that captures the interconnected nature of modern markets.

The Setup: Two Graphs, One Model

Most financial ML models treat stocks as independent entities. But markets don't work that way. When Tata Steel's stock moves, it affects not just correlated industrials, but also its direct customers in construction and automotive sectors. We can model this explicitly.

Stock OHLCV Data
(Yahoo Finance)
+
Supply Chain Graph
(Annual Reports)
Multi-Relational GNN
Return Predictions

Data Sources

Daily Stock Data: We pulled OHLCV (Open, High, Low, Close, Volume) data from Yahoo Finance for NIFTY 50 constituents. Volatility is computed over a 20-day rolling window—the same horizon used for predictions.

Supply Chain Graph: Here's where it gets interesting. We manually extracted vendor-customer relationships from annual reports of NIFTY 50 companies. This gives us directed edges: if Company A lists Company B as a supplier, we create an edge A → B. To handle sparsity, we also added intra-sector edges for companies in the same industry.

Graph Construction: Correlation vs. Structure

We construct two distinct edge types:

  • Correlation Graph: Using lagged (t-1) Pearson correlation matrices, we retain edges where correlation > 0.15. This captures co-movement patterns.
  • Vendor Graph: Directed edges from the supply chain, augmented with undirected intra-industry edges to compensate for data sparsity.
Key Insight: The correlation graph captures emergent market dynamics, while the vendor graph captures fundamental business relationships. A multi-relational GNN can learn to weight both signals dynamically.

Architecture: Learning to Trust Signals

Our model uses separate first-layer graph convolutions for each relation type:

  1. Layer 1: Separate GCN convolutions for correlation edges and vendor edges
  2. Layer 2+: Shared GCN layers with learnable attention weights that decide how much to trust each signal type
  3. Temporal Component: We concatenate current features with lagged features from the past 10 days, allowing the model to capture lead-lag dynamics

The architecture essentially asks: "Given this stock's neighbors in the correlation graph and its suppliers/customers, what's the best prediction for tomorrow's return?"

Training: IC Loss & Portfolio Constraints

Unlike typical regression tasks, quant models need to optimize for Information Coefficient (IC)—the rank correlation between predicted and actual returns. We use a custom loss function:

ICLoss = 0.5 × (1 - Pearson IC) + 0.5 × MSE + λ × Turnover_Penalty

The turnover penalty is critical for real-world trading—excessive position changes kill returns through transaction costs. We tune λ alongside other hyperparameters (lag steps, dropout, hidden dimensions) using Optuna with time-series split validation.

Portfolio Construction

We simulate a market-neutral long/short strategy with realistic constraints:

  • Position Weighting: 60% signal strength, 40% inverse volatility (risk parity)
  • Sector Caps: No more than 15% exposure per position
  • Retention Targets: Minimize turnover while maximizing signal capture
Sharpe Ratio
~1.2
Performance Gain
+36%
Market
NIFTY 50

Results: Supply Chain Signal Works

The key experiment: ablation study with and without vendor graph edges.

Model Performance Comparison

Baseline
GNN
(Correlation Only)
+36%
GNN
(+ Supply Chain)

Relative IC performance (higher is better)

The verdict: Incorporating supply chain structure improves Information Coefficient by 36% over a correlation-only GNN baseline. The final model achieves a Sharpe ratio of approximately 1.2—respectable for a market-neutral equity strategy without leverage.

Why Does This Work?

Three mechanisms seem to be at play:

  1. Information Propagation: Supply chain edges allow the model to propagate information from upstream suppliers to downstream customers before it's fully priced into correlations.
  2. Structural Priors: The vendor graph provides a strong inductive bias—relationships that exist in the real economy, not just emergent price patterns.
  3. Robustness: During regime changes (when correlations break down), supply chain relationships remain more stable, providing signal continuity.
Practical Note: Extracting supply chain data from annual reports is labor-intensive. For production systems, this would need to be automated (NLP on 10-Ks/annual reports) or sourced from commercial vendors like FactSet or Bloomberg Supply Chain Analytics.

Limitations & Future Work

This is early-stage research with clear limitations:

  • Data Sparsity: Only ~50 stocks, manually extracted supply chain edges
  • Transaction Costs: Backtest doesn't include realistic slippage/spread models
  • Market Impact: No modeling of how our own trades would move prices
  • Single Market: Results are specific to Indian equities (NIFTY 50)

Future Directions

Several promising research directions emerged from this work:

Causal Discovery: Apply the PC algorithm to discover causal relationships in supply chain networks rather than just correlational patterns. This could help distinguish genuine supply chain effects from spurious correlations and identify which vendor relationships actually drive stock movements.

Interpretability Tooling: Develop comprehensive interpretation methods for the GNN:

  • Attention Visualization Maps: Visualize which edges (correlation vs. vendor) the model relies on for each prediction
  • Backpropagation Through Layers: Track gradient flow to understand how information propagates from suppliers to customers
  • Association vs. Causation: Separate genuine causal signals from associational patterns

Graph Topology Analysis: Incorporate classical network metrics as features or constraints:

  • Centrality Measures: Degree, betweenness, and eigenvector centrality to identify systemically important stocks
  • Graph Entropy: Measure structural complexity and information content of the supply chain network
  • Random Ordering Tests: Validate that performance gains come from actual structure, not random edge patterns

Practical Extensions: Expand to broader universes (S&P 500, global equities), incorporate alternative data (shipping records, satellite imagery), and extend the model to handle corporate events (M&A, supplier changes) dynamically.

Conclusion

The intersection of graph learning and quantitative finance is rich territory. By explicitly modeling supply chain relationships alongside traditional price signals, we can build more robust predictors that leverage the actual structure of the economy. The 36% improvement isn't just a number—it's evidence that relational structure matters in financial prediction.

As markets become more interconnected and data becomes richer, approaches that model these dependencies explicitly will likely become table stakes for serious quant shops. The graph is not just a representation—it's the signal.