Paper published in a book (Scientific congresses and symposiums)
Machine Extraction of Tax Laws from Legislative Texts
Ash, Elliott; Guillot, Malka; Han, Luyang
2025In Proceedings of the Natural Legal Language Processing Workshop 2021
Peer reviewed
 

Files


Full Text
_2021.nllp-1.7.pdf
(335.45 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] Using a corpus of compiled codes from U.S. states containing labeled tax law sections, we train text classifiers to automatically tag taxlaw documents and, further, to identify the associated revenue source (e.g. income, property, or sales). After evaluating classifier performance in held-out test data, we apply them to an historical corpus of U.S. state legislation to extract the flow of relevant laws over the years 1910 through 2010. We document that the classifiers are effective in the historical corpus, for example by automatically detecting establishments of state personal income taxes. The trained models with replication code are published at https://github.com/ luyang521/tax-classification.
Disciplines :
Business & economic sciences: Multidisciplinary, general & others
Author, co-author :
Ash, Elliott
Guillot, Malka  ;  Université de Liège - ULiège > HEC Liège : UER > UER Economie : Microéconomie appliquée
Han, Luyang
Language :
English
Title :
Machine Extraction of Tax Laws from Legislative Texts
Publication date :
November 2025
Event name :
Natural Legal Language Processing Workshop 2021
Event date :
2021
Audience :
International
Main work title :
Proceedings of the Natural Legal Language Processing Workshop 2021
Publisher :
Association for Computational Linguistics, Punta Cana, Dominican Republic
Peer reviewed :
Peer reviewed
Available on ORBi :
since 08 January 2025

Statistics


Number of views
45 (1 by ULiège)
Number of downloads
29 (0 by ULiège)

Bibliography


Similar publications



Contact ORBi