Contribution to collective works (Parts of books)
Declarative Generation of RDF Collections and Containers from Heterogeneous Data
Debruyne, Christophe; Jaadari, Souail; Chaves-Fraga, David
2025In Studies on the Semantic Web
Peer reviewed
 

Files


Full Text
SSW-62-SSW250007.pdf
Publisher postprint (523.6 kB) Creative Commons License - Attribution
Download

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] Purpose: This paper addresses the lack of practical support for generating RDF Collections and Containers from heterogeneous sources in existing mapping tools with the RDF Mapping Language (RML). While the RML Collections and Container (CC) module defines their generation, RML-CC implementations remain limited to a reference implementation called BURP that was not conceived for efficiency or scalability. We aim to close this gap by extending a tool for efficient RML generation with support for RML-CC. Methodology: We extended Morph-KGC to support RML-CC and developed YARRRML-CC for user-friendly mapping definitions. We also updated Yatter to enable translation from YARRRML-CC to RML-CC. We validated these tools using 35 RML-CC test cases and 22 additional YARRRML-CC cases and conducted performance evaluations using synthetic datasets. Findings: While BURP passes all RML-CC test cases and scales up to 1M records, Morph-KGC passes only 51% due to architectural constraints and struggles with larger datasets—Morph-KGC’s reliance on Pandas limits support for complex constructs such as nested collections. Yatter fully supports YARRRML-CC. Morph-KGC performs well in standard RDF generation but struggles with RML-CC at scale. This highlights the importance of selecting tools that align with the structural complexity and performance demands of specific use cases. Value: Our work enhances the practical applicability of RML-CC in knowledge graph construction by providing different tooling supports (BURP for the Java ecosystem, and Morph-KGC for complete Python pipelines), interoperability through YARRRML-CC, and validated performance insights.
Disciplines :
Computer science
Author, co-author :
Debruyne, Christophe  ;  Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
Jaadari, Souail;  Montefiore Institute, University of Liège, Belgium
Chaves-Fraga, David;  Intelligent Systems Group, Universidade de Santiago de Compostela, Spain ; CiTIUS, Universidade de Santiago de Compostela, Spain
Language :
English
Title :
Declarative Generation of RDF Collections and Containers from Heterogeneous Data
Publication date :
03 September 2025
Main work title :
Studies on the Semantic Web
Publisher :
IOS Press
ISBN/EAN :
978-1-64368-616-5
Peer reviewed :
Peer reviewed
Available on ORBi :
since 21 September 2025

Statistics


Number of views
19 (0 by ULiège)
Number of downloads
27 (0 by ULiège)

OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBi