Multi-timeframe algorithmic trading bots using thick data heuristics with deep reinforcement learning
Abstract
This thesis presents an augmented Artificial Intelligence (AI) algorithmic trading
approach that combines Thick Data Heuristics (TDH), with Deep Reinforcement Learning
(DRL), to successfully learn trading execution timing policies. In this thesis, combining the
augmented AI human trader’s intuition and heuristics with DRL techniques to provide more
focused drivers for trading order execution timing is explored. In this financial technology
(Fintech) research, the goal is to solve the sequential decision-making problem of AI for
profitable day and swing trading order timing executions. Enabling trading bots with cognitive
intelligence and common-sense heuristics will offer traders including automatic traders an
insight to understand the day-to-day swing trading timeframes indicators and arrive at mature
trading decision-making. This thesis examines the performance of bots with Nasdaq and NYSE
stocks that have a strong catalyst (info. which increases directional momentum) to find that they
outperform benchmark algorithmic trading approaches. The thesis illustrates to the reader how to
combine TDH and Deep Q-networks (DQN) into a TDH-DQN augmented AI trading bot. The
bot learns through test data to predict the optimal timing of order executions autonomously on
idealized trading time series data. The results show the TDH-DQN bot outperformed the buy and
hold strategy plus two out of the three benchmark algorithmic trading strategies.