TraQuLA: Transparent Question Answering Over RDF Through Linguistic Analysis

Tutkimustuotos: KonferenssiartikkeliTieteellinenvertaisarvioitu

Abstrakti

Answering complex questions over knowledge graphs has gained popularity recently. Systems based on large language models seem to achieve top performance. However, these models may generate content that looks reasonable but is incorrect. They also lack transparency, making it impossible to exactly explain why a particular answer was generated. To tackle these problems we present the TraQuLA (Transparent QUestion-answering through Linguistic Analysis) system – a rule-based system developed through linguistic analysis of datasets of complex questions over DBpedia and Wikidata. TraQuLA defines a question’s type and extracts its semantic component candidates (named entities, properties and class names). For the extraction of properties, whose natural language verbalisations are most diverse, we built an extensive database which matches DBpedia/Wikidata properties to natural language expressions, allowing linguistic variation. TraQuLA generates semantic parses for the components and ranks them by each question’s structure and morphological features. The ranked parses are then analysed top down according to their patterns, also noting linguistic aspects, until a solution is found and a SPARQL query is produced. TraQuLA outperforms the existing baseline systems on the LC-QuAD 1.0 and competes with ChatGPT-based systems on LC-QuAD 2.0. For the LC-QuAD 1.0 test set, we developed an evaluation approach that accepts multiple ways to answer the questions (some ignored by the dataset) and curated some errors. TraQuLa contains no “black boxes” of neural networks or machine learning and makes its answer construction traceable. Users can therefore better rely on them and assess their correctness.

AlkuperäiskieliEnglanti
OtsikkoWeb Engineering - 24th International Conference, ICWE 2024, Proceedings
ToimittajatKostas Stefanidis, Kari Systä, Maristella Matera, Sebastian Heil, Haridimos Kondylakis, Elisa Quintarelli
KustantajaSpringer
Sivut19-33
Sivumäärä15
ISBN (elektroninen)978-3-031-62362-2
ISBN (painettu)978-3-031-62361-5
DOI - pysyväislinkit
TilaJulkaistu - 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Conference on Web Engineering - Tampere, Suomi
Kesto: 17 kesäk. 202420 kesäk. 2024

Julkaisusarja

NimiLecture Notes in Computer Science
Vuosikerta14629 LNCS
ISSN (painettu)0302-9743
ISSN (elektroninen)1611-3349

Conference

ConferenceInternational Conference on Web Engineering
Maa/AlueSuomi
KaupunkiTampere
Ajanjakso17/06/2420/06/24

Julkaisufoorumi-taso

  • Jufo-taso 1

!!ASJC Scopus subject areas

  • Theoretical Computer Science
  • Yleinen tietojenkäsittelytiede

Sormenjälki

Sukella tutkimusaiheisiin 'TraQuLA: Transparent Question Answering Over RDF Through Linguistic Analysis'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä