Keywords

textual analysis, machine learning, disclosure, conference calls

Abstract

We compare the ability of dictionary-based and machine-learning methods to capture disclosure sentiment at 10-K filing and conference call dates. Like Loughran and McDonald (2011), we use returns to assess sentiment. We find that measures based on machine learning offer a significant improvement in explanatory power over dictionary-based measures. Specifically, machine-learning measures explain returns at 10-K filing dates, while measures based on the Loughran and McDonald dictionary only explain returns at 10-K filing dates during the time period of their study. Moreover, at conference-call dates, machine-learning methods offer an improvement over the Loughran and McDonald dictionary method of a greater magnitude than the improvement of the Loughran and McDonald dictionary over the Harvard Psychosociological dictionary. We further find that the random forest regression tree method better captures disclosure sentiment than alternative algorithms, simplifying the application of the machine-learning approach. Overall, our results suggest that machine-learning methods offer an easily implementable, more powerful and reliable measure of disclosure sentiment than dictionary-based methods.

Original Publication Citation

Frankel, R., J. Jennings, and J. Lee. 2022. "Disclosure Sentiment: Machine Leaming vs Dictionary Methods," Management Science, 68 (7), 4755-5555.

Document Type

Peer-Reviewed Article

Publication Date

2021

Publisher

Management Science

Language

English

College

Marriott School of Business

Department

Accountancy

University Standing at Time of Publication

Full Professor

Included in

Accounting Commons

Share

COinS