Keywords
textual analysis, machine learning, disclosure, conference calls
Abstract
We compare the ability of dictionary-based and machine-learning methods to capture disclosure sentiment at 10-K filing and conference call dates. Like Loughran and McDonald (2011), we use returns to assess sentiment. We find that measures based on machine learning offer a significant improvement in explanatory power over dictionary-based measures. Specifically, machine-learning measures explain returns at 10-K filing dates, while measures based on the Loughran and McDonald dictionary only explain returns at 10-K filing dates during the time period of their study. Moreover, at conference-call dates, machine-learning methods offer an improvement over the Loughran and McDonald dictionary method of a greater magnitude than the improvement of the Loughran and McDonald dictionary over the Harvard Psychosociological dictionary. We further find that the random forest regression tree method better captures disclosure sentiment than alternative algorithms, simplifying the application of the machine-learning approach. Overall, our results suggest that machine-learning methods offer an easily implementable, more powerful and reliable measure of disclosure sentiment than dictionary-based methods.
Original Publication Citation
Frankel, R., J. Jennings, and J. Lee. 2022. "Disclosure Sentiment: Machine Leaming vs Dictionary Methods," Management Science, 68 (7), 4755-5555.
BYU ScholarsArchive Citation
Frankel, Richard; Jennings, Jared; and Lee, Joshua A., "Disclosure Sentiment: Machine Learning vs Dictionary Methods" (2021). Faculty Publications. 8488.
https://scholarsarchive.byu.edu/facpub/8488
Document Type
Peer-Reviewed Article
Publication Date
2021
Publisher
Management Science
Language
English
College
Marriott School of Business
Department
Accountancy
Copyright Use Information
https://lib.byu.edu/about/copyright/