Classification; Corpus driven; Deep learning; Intertextuality; Case-studies; Hybrid architectures; Learning architectures; Multi channel; New approaches; Reuse; Textual data; Computer Science Applications; Information Systems; Information Systems and Management; Analysis
Abstract :
[en] The detection of intertextuality is at the heart of many linguistic studies. A lot of efforts are currently underway to provide tools for analyzing relationships between authors. Most of them use standard statistics to compare textual data and find traces of text reuse from one author to another. The main objective of this work is to provide a new approach based on deep learning architectures and corpus-driven analysis. Building on previous contributions, we propose a hybrid architecture called multichannel convolutional transformer (MCT). Using this method, we develop a new tool for intertextuality detection based on authorship attribution. We have empirically demonstrated its efficiency using a Latin corpus. We conclude that our model can highlight complex linguistic patterns as features responsible for the classification decision. We consider these patterns as new categories of intertextuality traces, complementary to the existing ones.
Disciplines :
Languages & linguistics
Author, co-author :
Vanni, Laurent; CNRS, UCA, Nice, France
Mahmoudi, Hadi; UCA, Nice, France
Longrée, Dominique ; Université de Liège - ULiège > Département des sciences de l'antiquité > Langue et littérature latines
Mayaffre, Damon; CNRS, Nice, France
Language :
English
Title :
Multichannel Convolutional Transformer and Intertextuality: A Latin Case Study
Publication date :
2024
Event name :
JADT
Event place :
Naples, Ita
Event date :
06-07-2022 => 08-07-2022
Audience :
International
Main work title :
New Frontiers in Textual Data Analysis
Editor :
Giordano, Giuseppe
Publisher :
Springer Science and Business Media Deutschland GmbH
Apoorv, N. (2020). Text classification with transformer. In GitHub repository https://github.com/keras-team/keras-io
Barthes, R. (1973). Text (theory of). Encyclopaedia universalis.
Buchler, M. (2013). Informationstechnische Aspekte des Historical Text Re-use. PhD thesis, Universitat Leipzig.
Burns, P. J., Brofos, J. A., Li, K., Chaudhuri, P., & Dexter, J. P. (2021). Profiling of intertextuality in Latin literature using word embeddings. In Proceedings of NAACL-HLT2021 (pp. 4900–4907). ACL.
Capozzi, R. (1990). Palimpsests and laughter: The dialogical pleasure of unlimited intertextuality in thenameofthe rose. Italica, 66, 412–428.
Chattopadhay, A., Sarkar, A., Howlader, P., & Balasubramanian, V. N. (2018). Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks. In 2018 IEEE WACV (pp. 839–847).
Chen, Y., Wang, K., Yang, W., Qing, Y., et al. (2020). A multi-channel deep neural network for relation extraction. IEEE Access, 8, 13195–13203.
Coffee, N., Koenig, J.-P., Shakti, P., Ossewaarde, R., et al. (2013). The tesserae project: Intertextual analysis of Latin poetry. Literary and Linguistic Computing, 28, 221–228.
Feng, Y., & Cheng, Y. (2021). Short text sentiment analysis based on multi-channel CNN with multi-head attention mechanism. IEEE Access, 9, 19854–19863.
Forstall, C. W., & Scheirer, W. J. (2019). Quantitative intertextuality: Analyzing the markers of information reuse/Christopher W. Forstall, Walter J. Scheier. Springer. ISBN 9783030234133.
Gehring, J., Auli, M., Grangier, D., Yarats, D., & Dauphin, Y. (2017). Convolutional sequence to sequence learning. In International Conference on Machine Learning.
Genette, G. (1982). Palimpsestes: la littérature au second degrê. Collection Poétique. Éditions du Seuil. ISBN 2-02-006116-3.
Joulin, A., Grave, E., Bojanowski, P., & Mikolov, T. (2016). Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759.
Kim, Y. (2014). Convolutional neural networks for sentence classification. In Proceedings of EMNLP 2014, Doha (pp. 1746–1751). ACL.
Kristeva, J. (1969). Sèméiotiké. Seuil.
Lafon, P. (1980). Sur la variabilité de la fréquence des formes DANs un corpus. Mots, 1, 127–165.
Lapesa, G., Evert, S., & Schulte im Walde, S. (2014). Contrasting syntagmatic and paradigmatic relations: Insights from distributional semantic models. In Proceedings of SEM 2014, Dublin (pp. 160–170). ACL.
Li, P., Zhong, P., Mao, K., Wang, D., Yang, X., Liu, Y., Yin, J., & See, S. (2021). Act: An attentive convolutional transformer for efficient text classification. Proceedings of the AAAI 2021, 35(15), 13261–13269.
Selvaraju, R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-cam: Visual explanations from deep networks via gradient-based localization. In IEEE ICCV 2017 (pp. 618–626).
Tognini-Bonelli, E. (2002). Corpus linguistics at work. Computational Linguistics, 28, 583–583.
Vanni, L., Ducoffe, M., Mayaffre, D., Precioso, F., et al. (2018). Text deconvolution saliency (TDS): A deep tool box for linguistic analysis. In Proceedings of ACL 2018, Melbourne (pp. 548–557).
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., et al. (2017). Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, & R. Garnett (Eds.), Advances in Neural Information Processing Systems (Vol. 30, pp. 5998–6008).
Verkerk, P., Ouvrard, Y., Fantoli, M., & Longrée, D. (2019). L.A.S.L.A. and Collatinus: A convergence in lexica. Studi e saggi linguistici 2020, 58, 1–26.
Wallace, E., Gardner, M., & Singh, S. (2020). Interpreting predictions of NLP models. In Proceedings of the EMNLP 2020 (pp. 20–23). ACL.
Wu, H., Xiao, B., Codella, N., Liu, M., Dai, X., Yuan, L., & Zhang, L. (2021). Cvt: Introducing convolutions to vision transformers. In IEEE/CVF ICCV 2021 (pp. 22–31).