Paper published in a book (Scientific congresses and symposiums)
Exploration of Rationale-Extraction Methods for Closed-Domain Question Answering with a New Sentence-Level Rationale Dataset
Pirenne, Lize; Mokeddem, Samy; Ernst, Damien et al.
2025In Ryutaro Ichise (Ed.) Natural Language Processing and Information Systems
Peer reviewed Dataset
 

Files


Full Text
ExploRE_Pirenne.pdf
Author postprint (722.7 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Large Language Models; Rationale Extraction; Classifier; Natural Language Processing; Reinforcement Learning; Fine Tuning; Embedding
Abstract :
[en] In this paper, we address the problem of Rationale Extraction (RE) from Natural Language Processing: given a context (C), a related question (Q) and its answer (A), the task is to find the best sentence-level rationale (R*). This rationale is loosely defined as being the subset of sentences of the context C such that producing A would require at least R*. We have constructed a dataset where each entry is composed of the four terms (C, Q, A, R*) to explore different methods in the particular case where the answer is one or multiple full sentences. The methods studied are based on TF-IDF scores, embedding similarity, classifiers and attention and have been evaluated using a sentence overlap metric akin to the Intersection over Union (IoU). Results show that the best scores were achieved by the classifier-based approach with the nuance of a better scaling with the attention-based method as the size of the context increases, which is a challenge for all other methods. We also show that generating A significantly decreases the performance of the attention-based method, but training the model to generate A can improve the results, linking the ability to generate with the accomplishment of the task.
Research Center/Unit :
Montefiore Institute - Montefiore Institute of Electrical Engineering and Computer Science - ULiège
Disciplines :
Computer science
Author, co-author :
Pirenne, Lize   ;  Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
Mokeddem, Samy   ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Ernst, Damien  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Louppe, Gilles  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Big Data
 These authors have contributed equally to this work.
Language :
English
Title :
Exploration of Rationale-Extraction Methods for Closed-Domain Question Answering with a New Sentence-Level Rationale Dataset
Publication date :
01 July 2025
Event name :
30th Annual International Conference on Natural Language & Information Systems (NLDB 2025)
Event place :
kanazawa, Japan
Event date :
4 July 2025 - 6 July 2025
Audience :
International
Main work title :
Natural Language Processing and Information Systems
Main work alternative title :
[en] NLDB25
Editor :
Ryutaro Ichise;  Institute of Science Tokyo, Tokyo, Japan
Publisher :
Springer, Cham, Switzerland
ISBN/EAN :
978-3-031-97144-0
978-3-031-97143-3
Collection name :
Lecture Notes in Computer Science, vol 15837
Collection ISSN :
0302-9743
Pages :
3-13
Peer review/Selection committee :
Peer reviewed
Name of the research project :
ARIAC by DW4AI
Funders :
Walloon region
Funding number :
2010235
Funding text :
Lize Pirenne gratefully acknowledges the financial support of the Walloon Region for Grant No. 2010235 – ARIAC by DW4AI.
Available on ORBi :
since 28 September 2024

Statistics


Number of views
243 (71 by ULiège)
Number of downloads
148 (22 by ULiège)

Scopus citations®
 
0
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBi