← SELECTED WORK

QUANTITATIVE TRADING · AI ENGINEERING

Knowing what to buy — and exactly when to act on it.

AI Trading System — LSTM share selection and reinforcement learning execution

+47%

PROFIT VS BUY-AND-HOLD

2-stage

LSTM SELECTION → RL TIMING

+12%

RETURN LIFT FROM NEWS

01 / THE PROBLEM

Two problems sink most active traders: picking the right shares, and catching the right moments to enter and exit positions. Human intuition is poorly suited to either — let alone both simultaneously — when applied consistently across a market-wide universe of daily prices, fundamentals, and news.

Most approaches solve one or the other. Selection models pick what to hold but ignore when to move. Timing models react to price without knowing what's worth holding. The two problems are coupled — getting one right while ignoring the other still loses money.

The goal: automate both decisions in a single pipeline — a model for what to hold and a model for when to act — refreshed every morning before the open on all data through the prior day.

02 / THE HARD PARTS

SELECTION

Ranking hundreds of shares for a one-month horizon requires fusing price-action, fundamentals, macro, volatility and options signals — and proving which architecture is actually best, not assumed.

TIMING

Deciding when to buy, hold or sell is a sequential decision problem with a delayed, noisy reward signal. Supervised learning doesn't apply; the agent must balance short-term gains against drawdown without overfitting to history.

DATA

Years of global news needed to become a compact, stable daily signal — without the cost or brittleness of a commercial feed. A commercial source was evaluated and replaced by open data when it proved too limited.

03 / WHAT MADE IT WORK

LSTM selection model

A rigorous bake-off against gradient-boosted trees and 1-D convolutional nets established the LSTM as the right architecture for this data. The model ranks the full market by expected one-month return, ingesting price-action, fundamentals, macro, volatility and options signals fused into a structured daily feature set.

Reinforcement learning for timing

The entry/exit agent went through several generations of reinforcement learning, each more stable and risk-aware — evolving from value-based approaches toward policy-based methods with continuous position sizing. Risk-adjusted reward shaping prevents the agent from chasing returns at the expense of consistency.

News as a quantitative signal

Years of global news distilled into daily language-model embeddings, then compressed to a compact signal the models can actually use — adding approximately 12% annual return. Evaluating and replacing a commercial news feed with open source proved the signal was robust and not a data artefact.

04 / THE SYSTEM

LSTM Selection Model

Ranks the market by expected one-month return; architecture chosen by rigorous competitive bake-off.

RL Entry/Exit Agent

Continuous position sizing and risk-adjusted reward shaping; trained directly on selection output.

Alternative Data Pipeline

Global news distilled into compact daily embeddings — adds ~12% annual return to the model.

Daily Cloud Automation

Ingests prior-day data, re-runs both models, outputs decisions — all before each morning's open.

Multi-Share Portfolio Engine

Manages a basket of large-cap equities; portfolio aggregation smooths single-share noise into edge.

Rigorous Evaluation Framework

Explicit baselines at every stage; cross-validated and benchmarked on risk-adjusted metrics.

Live Trading Infrastructure

Fully unattended from data ingestion to position decision — runs every morning without intervention.

END-TO-END

AI ENGINEERING

05 / OUR ROLE

Full R&D ownership: model architecture decisions, data engineering, training methodology and end-to-end automation. Every generation was defined against an explicit baseline, cross-validated and benchmarked on risk-adjusted terms — not just raw returns.

06 / OUTCOME

In two-year backtests the multi-share portfolio earned 46–47% more profit than buy-and-hold from the same starting capital. Even the broad-market ETF alone beat buy-and-hold by ~39% with the model active. The system now runs a fully automated daily data ingestion, selection and entry/exit decision pipeline — unattended, every morning before the open.