New Dataset Takes Aim at Subjective Misinformation in Earnings Calls and Other Public Hearings

Tuesday, December 3, 2024

Georgia Tech researchers have created a dataset that trains computer models to understand nuances in human speech during financial earnings calls. The dataset provides a new resource to study how public correspondence affects businesses and markets.

SubjECTive-QA is the first human-curated dataset on question-answer pairs from earnings call transcripts (ECTs). The dataset teaches models to identify subjective features in ECTs, like clarity and cautiousness.

The dataset lays the foundation for a new approach to identifying disinformation and misinformation caused by nuances in speech. While ECT responses can be technically true, unclear or irrelevant information can misinform stakeholders and affect their decision-making.

Tests on White House press briefings showed that the dataset applies to other sectors with frequent question-and-answer encounters, notably politics, journalism, and sports. This increases the odds of effectively informing audiences and improving transparency across public spheres.

School of Computational Science and Engineering

College of Computing

Search

New Dataset Takes Aim at Subjective Misinformation in Earnings Calls and Other Public Hearings

Recent Stories

Vision AI Models Improve Decision…

Transformer Explainer Shows How AI is More Math than Human…

Tech Swarms into Athens for Clean, Old-Fashioned Computing

News Feed

College of Computing

Georgia Institute of Technology