LLMs outperform outsourced human coders on complex textual analysis

Bermejo, Vicente J.; Gago, Andrés; Gálvez, Ramiro H.; Harari, Nicolás

LLMs outperform outsourced human coders on complex textual analysis

dc.contributor.author	Bermejo, Vicente J.
dc.contributor.author	Gago, Andrés
dc.contributor.author	Gálvez, Ramiro H.
dc.contributor.author	Harari, Nicolás
dc.date.accessioned	2025-11-28T16:39:37Z
dc.date.issued	2025-11-17
dc.description.abstract	This paper evaluates the effectiveness of large language models (LLMs) in extracting complex information from text data. Using a corpus of Spanish news articles, we compare how accurately various LLMs and outsourced human coders reproduce expert annotations on five natural language processing tasks, ranging from named entity recognition to identifying nuanced political criticism in news articles. We find that LLMs consistently outperform outsourced human coders, particularly in tasks requiring deep contextual understanding. These findings suggest that current LLM technology offers researchers without programming expertise a cost-effective alternative for sophisticated text analysis.
dc.description.bibliographicCitation	Bermejo, V.J., Gago, A., Gálvez, R.H. et al. LLMs outperform outsourced human coders on complex textual analysis. Sci Rep 15, 40122 (2025). https://doi.org/10.1038/s41598-025-23798-y
dc.format.extent	19 p.
dc.identifier.uri	https://repositorio.utdt.edu/handle/20.500.13098/13856
dc.language	eng
dc.publisher	Scientific Reports (e-ISSN 2045- 2322)
dc.relation.ispartof	Scientific Reports (e-ISSN 2045- 2322)
dc.rights	info:eu-repo/semantics/openAccess
dc.rights.license	https://creativecommons.org/licenses/by-sa/2.5/ar/
dc.subject	Inteligencia Artificial
dc.subject	Ciencias Sociales computacionales
dc.subject	Lingüística Informática
dc.subject	Análisis de datos
dc.subject	Artificial intelligence
dc.subject	Computational Social Sciences
dc.subject	Computational Linguistics
dc.subject	Data analysis
dc.subject.keyword	Large Language Models (LLM)
dc.subject.keyword	Procesamiento del Lenguaje Natural (PLN)
dc.subject.keyword	Análisis Textual Automatizado
dc.subject.keyword	Comparación Humano-IA
dc.subject.keyword	Metodología
dc.subject.keyword	Ciencias de la Computación aplicadas a Ciencias Sociales.
dc.title	LLMs outperform outsourced human coders on complex textual analysis
dc.type	info:eu-repo/semantics/article
dc.type.version	info:eu-repo/semantics/publishedVersion
organization.identifier.ror	https://ror.org/04sxme922

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Scientific_reports_Gago_2025.pdf
Size:: 2.12 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Documentos de Investigación generados por la Escuela de Negocios