How Autoencoders Beat PCA for Non-Linear Signals
The financial markets produce vast amounts of data, and our ability to make sense of it often determines our success. Two powerful tools for this task are Principal Component Analysis ( PCA ) and Autoencoders . While both are used to reduce the dimensions of our data, they do it in fundamentally different ways, which can have a big impact on a trading model's performance. Our Simulated Market Data To rigorously compare PCA and autoencoders, we generated synthetic data with specific underlying structures. Our dataset comprises several features, some exhibiting linear correlations with the target variable, while others are designed to have non-linear relationships. For instance, we introduced: Linear Dependencies: Certain features were directly and linearly related to the target, representing straightforward correlations one might find in simpler market dynamics. Distance-Based Relationships: The target variable was also influenced by the proximity of data points in a two-dimension...