计算机科学
透明度(行为)
政府(语言学)
退伍军人事务部
问责
领域(数学分析)
工作(物理)
监管事务
公共关系
政治学
公共行政
计算机安全
语言学
法学
医学
工程类
机械工程
内科学
数学分析
哲学
数学
作者
Alejandro Peña,Aythami Morales,Julián Fiérrez,Ignacio Serna,Javier Ortega-García,Íñigo Puente,Jorge Córdova,Gonzalo Córdova
标识
DOI:10.1007/978-3-031-41498-5_2
摘要
The analysis of public affairs documents is crucial for citizens as it promotes transparency, accountability, and informed decision-making. It allows citizens to understand government policies, participate in public discourse, and hold representatives accountable. This is crucial, and sometimes a matter of life or death, for companies whose operation depend on certain regulations. Large Language Models (LLMs) have the potential to greatly enhance the analysis of public affairs documents by effectively processing and understanding the complex language used in such documents. In this work, we analyze the performance of LLMs in classifying public affairs documents. As a natural multi-label task, the classification of these documents presents important challenges. In this work, we use a regex-powered tool to collect a database of public affairs documents with more than 33K samples and 22.5M tokens. Our experiments assess the performance of 4 different Spanish LLMs to classify up to 30 different topics in the data in different configurations. The results shows that LLMs can be of great use to process domain-specific documents, such as those in the domain of public affairs.
科研通智能强力驱动
Strongly Powered by AbleSci AI