Predicting Secondary Equity Offerings (SEOs) Using Machine Learning

Document Type

Conference Proceeding

Publication Date

7-2-2018

Abstract

This paper explores the application of machine learning techniques in finance to predict if a publicly-traded firm will issue a Seasoned Equity Offering (SEO) by analyzing the firm's 10-Q filing documents with Security and Exchange Commissions (SEC). Specifically, using the information content in the Management Discussion and Analysis section (MD&A) of 10-Q filings, we train five different algorithms, including Logistic Regression (LR), Support Vector Classification (SVC), Multinomial Naïve Bayes (NB), Artificial Neural Network (ANN) and Random Forest (RF). Two types of features, unigrams and phrases are considered. Term frequency-inverse document (TF-IDF) scores are used as independent variables in these models. Experimental results show that the accuracy of phrases-only models has a range of 0-2% improvement for LR, NB, and RF compared with unigrams-only models. The accuracy of phrase-only model for SVC is close to that of unigrams-only model. The 74.53% accuracy of unigrams-only model for SVC classifier performs the best among all tested classifiers. The precision of all models varies between 60% and 75%, while the recall varies between 55% and 85%. Further, we tune model parameters of one linear model (LR) and one non-linear model (RF) to see how these parameters will impact the models' performance. Finally, we apply RF to find the most important features on prediction and find that 'merger' is the most important feature in both unigrams-only model and phrases-only model. We conclude that text mining with SEC financial document filings could be an effective tool to predict important corporate events such as SEO.

Publication Source (Journal or Book title)

Proceedings - 17th IEEE International Conference on Machine Learning and Applications, ICMLA 2018

First Page

1219

Last Page

1224

This document is currently not available here.

Share

COinS