Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying

Many efforts have been deployed by the IR community to extend freetext query processing toward semi-structured XML search. Most methods rely on the concept of Lowest Comment Ancestor (LCA) between two or multiple structural nodes to identify the most specific XML elements containing query keywords p...

Full description

Saved in:
Bibliographic Details
Main Author: Tekli, Joe (author)
Other Authors: Tekli, Gilbert (author), Chbeir, Richard (author)
Format: article
Published: 2023
Online Access:http://hdl.handle.net/10725/15998
https://doi.org/10.2298/CSIS220228063T
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://doiserbia.nb.rs/Article.aspx?id=1820-02142200063T
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513471848120320
author Tekli, Joe
author2 Tekli, Gilbert
Chbeir, Richard
author2_role author
author
author_facet Tekli, Joe
Tekli, Gilbert
Chbeir, Richard
author_role author
dc.creator.none.fl_str_mv Tekli, Joe
Tekli, Gilbert
Chbeir, Richard
dc.date.none.fl_str_mv 2023
2023
2024-08-20T10:47:53Z
2024-08-20T10:47:53Z
dc.identifier.none.fl_str_mv 2406-1018
http://hdl.handle.net/10725/15998
https://doi.org/10.2298/CSIS220228063T
Tekli, J., Tekli, G., & Chbeir, R. (2023). Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying. Computer Science and Information Systems, 20(1), 423-457.
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://doiserbia.nb.rs/Article.aspx?id=1820-02142200063T
dc.language.none.fl_str_mv en
dc.relation.none.fl_str_mv Computer Science and Information Systems
dc.rights.*.fl_str_mv info:eu-repo/semantics/openAccess
dc.title.none.fl_str_mv Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
dc.type.none.fl_str_mv Article
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
description Many efforts have been deployed by the IR community to extend freetext query processing toward semi-structured XML search. Most methods rely on the concept of Lowest Comment Ancestor (LCA) between two or multiple structural nodes to identify the most specific XML elements containing query keywords posted by the user. Yet, few of the existing approaches consider XML semantics, and the methods that process semantics generally rely on computationally expensive word sense disambiguation (WSD) techniques, or apply semantic analysis in one stage only: performing query relaxation/refinement over the bag of words retrieval model, to reduce processing time. In this paper, we describe a new approach for XML keyword search aiming to solve the limitations mentioned above. Our solution first transforms the XML document collection (offline) and the keyword query (on-the-fly) into meaningful semantic representations using context-based and global disambiguation methods, specially designed to allow almost linear computation efficiency. We use a semantic-aware inverted index to allow semantic-aware search, result selection, and result ranking functionality. The semantically augmented XML data tree is processed for structural node clustering, based on semantic query concepts (i.e., key-concepts), in order to identify and rank candidate answer sub-trees containing related occurrences of query key-concepts. Dedicated weighting functions and various search algorithms have been developed for that purpose and will be presented here. Experimental results highlight the quality and potential of our approach.
eu_rights_str_mv openAccess
format article
id LAURepo_cd181e1fdc0fab5a5b64285ef138d1f7
identifier_str_mv 2406-1018
Tekli, J., Tekli, G., & Chbeir, R. (2023). Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying. Computer Science and Information Systems, 20(1), 423-457.
language_invalid_str_mv en
network_acronym_str LAURepo
network_name_str Lebanese American University repository
oai_identifier_str oai:laur.lau.edu.lb:10725/15998
publishDate 2023
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Combining offline and on-the-fly disambiguation to perform semantic-aware XML queryingTekli, JoeTekli, GilbertChbeir, RichardMany efforts have been deployed by the IR community to extend freetext query processing toward semi-structured XML search. Most methods rely on the concept of Lowest Comment Ancestor (LCA) between two or multiple structural nodes to identify the most specific XML elements containing query keywords posted by the user. Yet, few of the existing approaches consider XML semantics, and the methods that process semantics generally rely on computationally expensive word sense disambiguation (WSD) techniques, or apply semantic analysis in one stage only: performing query relaxation/refinement over the bag of words retrieval model, to reduce processing time. In this paper, we describe a new approach for XML keyword search aiming to solve the limitations mentioned above. Our solution first transforms the XML document collection (offline) and the keyword query (on-the-fly) into meaningful semantic representations using context-based and global disambiguation methods, specially designed to allow almost linear computation efficiency. We use a semantic-aware inverted index to allow semantic-aware search, result selection, and result ranking functionality. The semantically augmented XML data tree is processed for structural node clustering, based on semantic query concepts (i.e., key-concepts), in order to identify and rank candidate answer sub-trees containing related occurrences of query key-concepts. Dedicated weighting functions and various search algorithms have been developed for that purpose and will be presented here. Experimental results highlight the quality and potential of our approach.Published2024-08-20T10:47:53Z2024-08-20T10:47:53Z20232023Articleinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article2406-1018http://hdl.handle.net/10725/15998https://doi.org/10.2298/CSIS220228063TTekli, J., Tekli, G., & Chbeir, R. (2023). Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying. Computer Science and Information Systems, 20(1), 423-457.http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.phphttps://doiserbia.nb.rs/Article.aspx?id=1820-02142200063TenComputer Science and Information Systemsinfo:eu-repo/semantics/openAccessoai:laur.lau.edu.lb:10725/159982024-08-20T10:48:06Z
spellingShingle Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
Tekli, Joe
status_str publishedVersion
title Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
title_full Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
title_fullStr Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
title_full_unstemmed Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
title_short Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
title_sort Combining offline and on-the-fly disambiguation to perform semantic-aware XML querying
url http://hdl.handle.net/10725/15998
https://doi.org/10.2298/CSIS220228063T
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://doiserbia.nb.rs/Article.aspx?id=1820-02142200063T