LLMs outperform outsourced human coders on complex textual analysis

dc.contributor.authorBermejo, Vicente J.
dc.contributor.authorGago, Andrés
dc.contributor.authorGálvez, Ramiro H.
dc.contributor.authorHarari, Nicolás
dc.date.accessioned2025-11-28T16:39:37Z
dc.date.issued2025-11-17
dc.description.abstractThis paper evaluates the effectiveness of large language models (LLMs) in extracting complex information from text data. Using a corpus of Spanish news articles, we compare how accurately various LLMs and outsourced human coders reproduce expert annotations on five natural language processing tasks, ranging from named entity recognition to identifying nuanced political criticism in news articles. We find that LLMs consistently outperform outsourced human coders, particularly in tasks requiring deep contextual understanding. These findings suggest that current LLM technology offers researchers without programming expertise a cost-effective alternative for sophisticated text analysis.
dc.description.bibliographicCitationBermejo, V.J., Gago, A., Gálvez, R.H. et al. LLMs outperform outsourced human coders on complex textual analysis. Sci Rep 15, 40122 (2025). https://doi.org/10.1038/s41598-025-23798-y
dc.format.extent19 p.
dc.format.mediumapplication/pdf
dc.identifier.urihttps://repositorio.utdt.edu/handle/20.500.13098/13856
dc.languageeng
dc.publisherScientific Reports (e-ISSN 2045- 2322)
dc.relation.ispartofScientific Reports (e-ISSN 2045- 2322)
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rights.licensehttps://creativecommons.org/licenses/by-sa/2.5/ar/
dc.subjectInteligencia Artificial
dc.subjectCiencias Sociales computacionales
dc.subjectLingüística Informática
dc.subjectAnálisis de datos
dc.subjectArtificial intelligence
dc.subjectComputational Social Sciences
dc.subjectComputational Linguistics
dc.subjectData analysis
dc.subject.keywordLarge Language Models (LLM)
dc.subject.keywordProcesamiento del Lenguaje Natural (PLN)
dc.subject.keywordAnálisis Textual Automatizado
dc.subject.keywordComparación Humano-IA
dc.subject.keywordMetodología
dc.subject.keywordCiencias de la Computación aplicadas a Ciencias Sociales.
dc.titleLLMs outperform outsourced human coders on complex textual analysis
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion
organization.identifier.rorhttps://ror.org/04sxme922

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Scientific_reports_Gago_2025.pdf
Size:
2.12 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: