LLMs outperform outsourced human coders on complex textual analysis
| dc.contributor.author | Bermejo, Vicente J. | |
| dc.contributor.author | Gago, Andrés | |
| dc.contributor.author | Gálvez, Ramiro H. | |
| dc.contributor.author | Harari, Nicolás | |
| dc.date.accessioned | 2025-11-28T16:39:37Z | |
| dc.date.issued | 2025-11-17 | |
| dc.description.abstract | This paper evaluates the effectiveness of large language models (LLMs) in extracting complex information from text data. Using a corpus of Spanish news articles, we compare how accurately various LLMs and outsourced human coders reproduce expert annotations on five natural language processing tasks, ranging from named entity recognition to identifying nuanced political criticism in news articles. We find that LLMs consistently outperform outsourced human coders, particularly in tasks requiring deep contextual understanding. These findings suggest that current LLM technology offers researchers without programming expertise a cost-effective alternative for sophisticated text analysis. | |
| dc.description.bibliographicCitation | Bermejo, V.J., Gago, A., Gálvez, R.H. et al. LLMs outperform outsourced human coders on complex textual analysis. Sci Rep 15, 40122 (2025). https://doi.org/10.1038/s41598-025-23798-y | |
| dc.format.extent | 19 p. | |
| dc.format.medium | application/pdf | |
| dc.identifier.uri | https://repositorio.utdt.edu/handle/20.500.13098/13856 | |
| dc.language | eng | |
| dc.publisher | Scientific Reports (e-ISSN 2045- 2322) | |
| dc.relation.ispartof | Scientific Reports (e-ISSN 2045- 2322) | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.rights.license | https://creativecommons.org/licenses/by-sa/2.5/ar/ | |
| dc.subject | Inteligencia Artificial | |
| dc.subject | Ciencias Sociales computacionales | |
| dc.subject | Lingüística Informática | |
| dc.subject | Análisis de datos | |
| dc.subject | Artificial intelligence | |
| dc.subject | Computational Social Sciences | |
| dc.subject | Computational Linguistics | |
| dc.subject | Data analysis | |
| dc.subject.keyword | Large Language Models (LLM) | |
| dc.subject.keyword | Procesamiento del Lenguaje Natural (PLN) | |
| dc.subject.keyword | Análisis Textual Automatizado | |
| dc.subject.keyword | Comparación Humano-IA | |
| dc.subject.keyword | Metodología | |
| dc.subject.keyword | Ciencias de la Computación aplicadas a Ciencias Sociales. | |
| dc.title | LLMs outperform outsourced human coders on complex textual analysis | |
| dc.type | info:eu-repo/semantics/article | |
| dc.type.version | info:eu-repo/semantics/publishedVersion | |
| organization.identifier.ror | https://ror.org/04sxme922 |
