Data Cleaning
We primarily collect data from the Fair Disclosure (FD) Wire corpus on Nexis. A regulatory change in August 2000 (“Regulation Fair Disclosure”) led to mandatory public reporting requirements around how companies disclose information. Fair Disclosure (FD) Wire began in April 2001 as a press release service to disseminate information for companies in response to this regulatory change. One service FD Wire provides is the dissemination of earnings call transcripts, which companies opt into. We are searching for FD Wire transcripts on LexisNexis, using the Nexis API. It appears that earnings calls are not consistently available before 2005, though for some companies, call transcripts are available back to 2001. As a result, we are not only looking at earnings calls from 2005 to 2024.
We are looking at three sectors of the Global Industry Classification Standard (GICS®). GICS® is an industry analysis framework that helps investors understand the key business activities for companies around the world. MSCI and S&P Dow Jones Indices developed this classification standard to provide investors with consistent and exhaustive industry definitions. The three sectors include financial, energy, and materials. From here, we filter out irrelevant companies from the dataframe based on PERMNO–a unique stock (share class) level identifier, sort them by date, and verify if we have the earnings calls from the desired timeframe, flag duplicates, and retrieve missing earnings calls in either Nexis Lexis or Factiva.
Natural Language Processing
forthcoming
In-Depth Interviewing
forthcoming