Paper published in a book (Scientific congresses and symposiums)
To what extent are lemmatisation and annotation relevant for deep learning assignments and textual motifs detection? The case-study of Peter Damian’s letters (11th century)
Thon, Valérie; Vanni, Laurent; Longrée, Dominique
2023In Carbé, Emmanuela; Lo Piccolo, Gabriele; Valenti, Alessia et al. (Eds.) La memoria digitale: forme del testo e organizzazione della conoscenza. Atti del XII Convegno Annuale AIUCD.
Peer reviewed
 

Files


Full Text
Siena_AIUCD_2023.pdf
Author postprint (587.68 kB)
Download
Annexes
AIUCD_ThonVanniLongrée_PDF.pdf
(2.08 MB)
Presentation of the article made at the AIUCD2023.
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Deep learning; textual motif; lemmatisation; annotation; Peter Damian
Abstract :
[en] This paper wishes to explore to what extent lemmatisation and morphosyntactic annotation are important for deep learning predictions and textual motif detection. A broader research on the style of Peter Damian’s letters (11th century) was the occasion to explore this question. After having trained two deep learning models on a selection of 12 classical authors using the Hyperdeep platform, one on lexical forms alone and the other on lemmatised and annotated texts, we introduced to them the medieval letters of Peter Damian in order to not only examine which authors are deemed to be stylistically close to Peter according to both models, but also to compare whether the results are similar and whether the same linguistic structures receive a high activation rate. The results suggest that a dialogue between both methods could be an interesting path to explore in the search for textual motifs, as the first “lexical” model may indicate rough outlines of these motifs, whereas the second model can offer concrete examples and/or variants of the first motifs identified.
Disciplines :
Languages & linguistics
Computer science
Author, co-author :
Thon, Valérie  ;  Université de Liège - ULiège > Mondes anciens
Vanni, Laurent;  Université Côte d'Azur > Bases, Corpus, Langage
Longrée, Dominique ;  Université de Liège - ULiège > Département des sciences de l'antiquité > Langue et littérature latines
Language :
English
Title :
To what extent are lemmatisation and annotation relevant for deep learning assignments and textual motifs detection? The case-study of Peter Damian’s letters (11th century)
Publication date :
2023
Event name :
La memoria digitale. XII convegno annuale AIUCD
Event place :
Siena, Italy
Event date :
05/06/2023 - 07/06/2023
Audience :
International
Main work title :
La memoria digitale: forme del testo e organizzazione della conoscenza. Atti del XII Convegno Annuale AIUCD.
Author, co-author :
Carbé, Emmanuela
Lo Piccolo, Gabriele
Valenti, Alessia
Stella, Francesco
Publisher :
Siena: Università degli Studi di Siena, Siena, Italy
ISBN/EAN :
978-88-942535-7-3
Pages :
254-259
Peer reviewed :
Peer reviewed
Available on ORBi :
since 09 July 2023

Statistics


Number of views
30 (4 by ULiège)
Number of downloads
25 (2 by ULiège)

Bibliography


Similar publications



Contact ORBi