Graph Neural Networks Meet Quantitative Trading
TL;DR: Multi-relational GNNs combining daily stock features with supply chain graphs achieve 36% performance improvement over correlation-only models, reaching Sharpe ratio of ~1.2 on NIFTY 50 stocks.
The Alpha Signal Problem
In quantitative trading, the hunt for uncorrelated alpha signals is relentless. Traditional quant strategies rely heavily on price correlations, technical indicators, and fundamental metrics. But what if we could leverage the structural relationships between companies—their supply chain dependencies—as a predictive signal?
This work, conducted in collaboration with Precog and Alphagrep, explores exactly that: using Graph Neural Networks to model both price correlations and vendor-customer relationships simultaneously, creating a multi-relational graph that captures the interconnected nature of modern markets.
The Setup: Two Graphs, One Model
Most financial ML models treat stocks as independent entities. But markets don't work that way. When Tata Steel's stock moves, it affects not just correlated industrials, but also its direct customers in construction and automotive sectors. We can model this explicitly.
(Yahoo Finance)
(Annual Reports)
Data Sources
Daily Stock Data: We pulled OHLCV (Open, High, Low, Close, Volume) data from Yahoo Finance for NIFTY 50 constituents. Volatility is computed over a 20-day rolling window—the same horizon used for predictions.
Supply Chain Graph: Here's where it gets interesting. We manually extracted vendor-customer relationships from annual reports of NIFTY 50 companies. This gives us directed edges: if Company A lists Company B as a supplier, we create an edge A → B. To handle sparsity, we also added intra-sector edges for companies in the same industry.
Graph Construction: Correlation vs. Structure
We construct two distinct edge types:
- Correlation Graph: Using lagged (t-1) Pearson correlation matrices, we retain edges where correlation > 0.15. This captures co-movement patterns.
- Vendor Graph: Directed edges from the supply chain, augmented with undirected intra-industry edges to compensate for data sparsity.
Architecture: Learning to Trust Signals
Our model uses separate first-layer graph convolutions for each relation type:
- Layer 1: Separate GCN convolutions for correlation edges and vendor edges
- Layer 2+: Shared GCN layers with learnable attention weights that decide how much to trust each signal type
- Temporal Component: We concatenate current features with lagged features from the past 10 days, allowing the model to capture lead-lag dynamics
The architecture essentially asks: "Given this stock's neighbors in the correlation graph and its suppliers/customers, what's the best prediction for tomorrow's return?"
Training: IC Loss & Portfolio Constraints
Unlike typical regression tasks, quant models need to optimize for Information Coefficient (IC)—the rank correlation between predicted and actual returns. We use a custom loss function:
The turnover penalty is critical for real-world trading—excessive position changes kill returns through transaction costs. We tune λ alongside other hyperparameters (lag steps, dropout, hidden dimensions) using Optuna with time-series split validation.
Portfolio Construction
We simulate a market-neutral long/short strategy with realistic constraints:
- Position Weighting: 60% signal strength, 40% inverse volatility (risk parity)
- Sector Caps: No more than 15% exposure per position
- Retention Targets: Minimize turnover while maximizing signal capture
Results: Supply Chain Signal Works
The key experiment: ablation study with and without vendor graph edges.
Model Performance Comparison
Relative IC performance (higher is better)
The verdict: Incorporating supply chain structure improves Information Coefficient by 36% over a correlation-only GNN baseline. The final model achieves a Sharpe ratio of approximately 1.2—respectable for a market-neutral equity strategy without leverage.
Why Does This Work?
Three mechanisms seem to be at play:
- Information Propagation: Supply chain edges allow the model to propagate information from upstream suppliers to downstream customers before it's fully priced into correlations.
- Structural Priors: The vendor graph provides a strong inductive bias—relationships that exist in the real economy, not just emergent price patterns.
- Robustness: During regime changes (when correlations break down), supply chain relationships remain more stable, providing signal continuity.
Limitations & Future Work
This is early-stage research with clear limitations:
- Data Sparsity: Only ~50 stocks, manually extracted supply chain edges
- Transaction Costs: Backtest doesn't include realistic slippage/spread models
- Market Impact: No modeling of how our own trades would move prices
- Single Market: Results are specific to Indian equities (NIFTY 50)
Future Directions
Several promising research directions emerged from this work:
Causal Discovery: Apply the PC algorithm to discover causal relationships in supply chain networks rather than just correlational patterns. This could help distinguish genuine supply chain effects from spurious correlations and identify which vendor relationships actually drive stock movements.
Interpretability Tooling: Develop comprehensive interpretation methods for the GNN:
- Attention Visualization Maps: Visualize which edges (correlation vs. vendor) the model relies on for each prediction
- Backpropagation Through Layers: Track gradient flow to understand how information propagates from suppliers to customers
- Association vs. Causation: Separate genuine causal signals from associational patterns
Graph Topology Analysis: Incorporate classical network metrics as features or constraints:
- Centrality Measures: Degree, betweenness, and eigenvector centrality to identify systemically important stocks
- Graph Entropy: Measure structural complexity and information content of the supply chain network
- Random Ordering Tests: Validate that performance gains come from actual structure, not random edge patterns
Practical Extensions: Expand to broader universes (S&P 500, global equities), incorporate alternative data (shipping records, satellite imagery), and extend the model to handle corporate events (M&A, supplier changes) dynamically.
Conclusion
The intersection of graph learning and quantitative finance is rich territory. By explicitly modeling supply chain relationships alongside traditional price signals, we can build more robust predictors that leverage the actual structure of the economy. The 36% improvement isn't just a number—it's evidence that relational structure matters in financial prediction.
As markets become more interconnected and data becomes richer, approaches that model these dependencies explicitly will likely become table stakes for serious quant shops. The graph is not just a representation—it's the signal.