No full text
Paper published in a book (Scientific congresses and symposiums)
Deep Learning As an Aid to Text Mining in the Choice of Texts to Lemmatise for a Comparison Corpus: A Stylistic Study of Peter Damian’s Letters
Thon, Valérie; Vanni, Laurent; Longrée, Dominique
2024In Giordano, Giuseppe (Ed.) New Frontiers in Textual Data Analysis
Peer reviewed
 

Files


Full Text
No document available.

Send to



Details



Keywords :
Lemmatisation; Morphosyntax; Prediction; Distance calculation; Labelings; Learning models; Lemmatization; Linguistic patterns; Text-mining; Computer Science Applications; Information Systems; Information Systems and Management; Analysis
Abstract :
[en] Lemmatising and morphosyntactically labelling a Latin text is a time-consuming process. Focusing in this contribution on the epistolary corpus of Peter Damian (eleventh century), an ecclesiastical author of 180 Latin letters, we cross intertextual distance calculation (Brunet and Jaccard) and a deep learning model trained on authorship classification on a selection of unlemmatised texts from 39 of his literary predecessors; the idea is to theoretically identify which text(s) share a similar style to Peter, and would therefore be suitable candidates for a precise lemmatisation. A dialogue between both methods seems promising, and the areas of activation in the deep learning model even suggest a recognition of complex linguistic patterns that Peter possibly shares with some of his predecessors.
Disciplines :
Computer science
Languages & linguistics
Author, co-author :
Thon, Valérie  ;  Université de Liège - ULiège > Mondes anciens
Vanni, Laurent;  CNRS, UCA, UMR7320 BCL, Nice, France
Longrée, Dominique ;  Université de Liège - ULiège > Département des sciences de l'antiquité > Langue et littérature latines
Language :
English
Title :
Deep Learning As an Aid to Text Mining in the Choice of Texts to Lemmatise for a Comparison Corpus: A Stylistic Study of Peter Damian’s Letters
Publication date :
24 September 2024
Event name :
JADT 2022
Event place :
Naples, Ita
Event date :
06-07-2022 => 08-07-2022
Main work title :
New Frontiers in Textual Data Analysis
Editor :
Giordano, Giuseppe
Publisher :
Springer Science and Business Media Deutschland GmbH
ISBN/EAN :
978-3-03-155916-7
Pages :
173-184
Peer review/Selection committee :
Peer reviewed
Available on ORBi :
since 05 November 2024

Statistics


Number of views
52 (1 by ULiège)
Number of downloads
0 (0 by ULiège)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBi