Decoding Financial Data: Machine Learning Approach to Predict Trading Actions

Yat Chun Fung; Bekzod Amonov

Decoding Financial Data: Machine Learning Approach to Predict Trading Actions

Yat Chun Fung, Bekzod Amonov

DOI: http://dx.doi.org/10.15439/2024F4556

Citation: Proceedings of the 19th Conference on Computer Science and Intelligence Systems (FedCSIS), M. Bolanowski, M. Ganzha, L. Maciaszek, M. Paprzycki, D. Ślęzak (eds). ACSIS, Vol. 39, pages 739–744 (2024)

Full text

Abstract. This paper presents a study on predicting stock trends using a dataset consisting of key financial indicators from 300 S&P 500 companies over a decade. Each company is characterized by 58 financial indicators along with their 1-year changes, offering valuable insights into potential trends. The objective is to develop predictive models to accurately forecast trading actions (buy, sell, hold) based on fundamental financial data. Three machine learning models---Random Forest, CatBoost, and XGBoost classifiers---were trained, employing two distinct voting mechanisms. The first voting mechanism was utilized in the competition, while the second was developed post-competition after the test labels were released. Notably, the second model was trained solely on the training data. The results demonstrate that both voting mechanisms effectively capture trends, as reflected by the average error cost measure, evaluated using the provided error cost matrix.

References

L. Prokhorenkova, G. Gusev, A. Vorobev, A. V. Dorogush, and A. Gulin, “Catboost: Unbiased boosting with categorical features,” Advances in Neural Information Processing Systems, vol. 31, pp. 6638–6648, 2018.
S. Van Buuren and K. Groothuis-Oudshoorn, “Mice: Multivariate imputation by chained equations in r,” Journal of statistical software, vol. 45, no. 3, 2011.
L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, pp. 5–32, 2001.
A. Liaw and M. Wiener, “Classification and regression by randomforest,” R news, vol. 2, no. 3, pp. 18–22, 2002.
T. Chen and C. Guestrin, “Xgboost: A scalable tree boosting system,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016, pp. 785–794.
K. Gurney, An introduction to neural networks. CRC press, 1997.
L. I. Kuncheva, Combining Pattern Classifiers: Methods and Algorithms. John Wiley & Sons, 2004.