No document available.
Abstract :
[en] Lemmatising and morphosyntactically labelling a Latin text is a time-consuming process. Focusing in this contribution on the epistolary corpus of Peter Damian (eleventh century), an ecclesiastical author of 180 Latin letters, we cross intertextual distance calculation (Brunet and Jaccard) and a deep learning model trained on authorship classification on a selection of unlemmatised texts from 39 of his literary predecessors; the idea is to theoretically identify which text(s) share a similar style to Peter, and would therefore be suitable candidates for a precise lemmatisation. A dialogue between both methods seems promising, and the areas of activation in the deep learning model even suggest a recognition of complex linguistic patterns that Peter possibly shares with some of his predecessors.
Scopus citations®
without self-citations
0