Building semantic trees from XML documents

The distributed nature of the Web, as a decentralized system exchanging information between heterogeneous sources, has underlined the need to manage interoperability, i.e., the ability to automatically interpret information in Web documents exchanged between different sources, necessary for efficien...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Tekli, Joe (author)
مؤلفون آخرون:	Charbel, Nathalie (author), Chbeir, Richard (author)
التنسيق:	article
منشور في:	2016
الوصول للمادة أونلاين:	http://hdl.handle.net/10725/5081 http://dx.doi.org/10.1016/j.websem.2016.03.002 http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php http://www.sciencedirect.com/science/article/pii/S1570826816000202
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

_version_	1864513465012453376
author	Tekli, Joe
author2	Charbel, Nathalie Chbeir, Richard
author2_role	author author
author_facet	Tekli, Joe Charbel, Nathalie Chbeir, Richard
author_role	author
dc.creator.none.fl_str_mv	Tekli, Joe Charbel, Nathalie Chbeir, Richard
dc.date.none.fl_str_mv	2016 2016-05-13 2017-01-27T08:12:32Z 2017-01-27T08:12:32Z
dc.identifier.none.fl_str_mv	1570-8268 http://hdl.handle.net/10725/5081 http://dx.doi.org/10.1016/j.websem.2016.03.002 Tekli, J., Charbel, N., & Chbeir, R. (2016). Building semantic trees from XML documents. Web Semantics: Science, Services and Agents on the World Wide Web, 37, 1-24. http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php http://www.sciencedirect.com/science/article/pii/S1570826816000202
dc.language.none.fl_str_mv	en
dc.relation.none.fl_str_mv	Journal of Web Semantics
dc.rights.*.fl_str_mv	info:eu-repo/semantics/openAccess
dc.title.none.fl_str_mv	Building semantic trees from XML documents
dc.type.none.fl_str_mv	Article info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article
description	The distributed nature of the Web, as a decentralized system exchanging information between heterogeneous sources, has underlined the need to manage interoperability, i.e., the ability to automatically interpret information in Web documents exchanged between different sources, necessary for efficient information management and search applications. In this context, XML was introduced as a data representation standard that simplifies the tasks of interoperation and integration among heterogeneous data sources, allowing to represent data in (semi-) structured documents consisting of hierarchically nested elements and atomic attributes. However, while XML was shown most effective in exchanging data, i.e., in syntactic interoperability, it has been proven limited when it comes to handling semantics, i.e., semantic interoperability, since it only specifies the syntactic and structural properties of the data without any further semantic meaning. As a result, XML semantic-aware processing has become a motivating challenge in Web data management, requiring dedicated semantic analysis and disambiguation methods to assign well-defined meaning to XML elements and attributes. In this context, most existing approaches: (i) ignore the problem of identifying ambiguous XML elements/nodes, (ii) only partially consider their structural relationships/context, (iii) use syntactic information in processing XML data regardless of the semantics involved, and (iv) are static in adopting fixed disambiguation constraints thus limiting user involvement. In this paper, we provide a new XML Semantic Disambiguation Framework titled XSDFdesigned to address each of the above limitations, taking as input: an XML document, and then producing as output a semantically augmented XML tree made of unambiguous semantic concepts extracted from a reference machine-readable semantic network. XSDF consists of four main modules for: (i) linguistic pre-processing of simple/compound XML node labels and values, (ii) selecting ambiguous XML nodes as targets for disambiguation, (iii) representing target nodes as special sphere neighborhood vectors including all XML structural relationships within a (user-chosen) range, and (iv) running context vectors through a hybrid disambiguation process, combining two approaches: concept-basedand context-based disambiguation, allowing the user to tune disambiguation parameters following her needs. Conducted experiments demonstrate the effectiveness and efficiency of our approach in comparison with alternative methods. We also discuss some practical applications of our method, ranging over semantic-aware query rewriting, semantic document clustering and classification, Mobile and Web services search and discovery, as well as blog analysis and event detection in social networks and tweets. © 2016 Elsevier B.V. All rights reserved.
eu_rights_str_mv	openAccess
format	article
id	LAURepo_0c81f0c2f0bdcc72d3c9bb930d97dcc6
identifier_str_mv	1570-8268 Tekli, J., Charbel, N., & Chbeir, R. (2016). Building semantic trees from XML documents. Web Semantics: Science, Services and Agents on the World Wide Web, 37, 1-24.
language_invalid_str_mv	en
network_acronym_str	LAURepo
network_name_str	Lebanese American University repository
oai_identifier_str	oai:laur.lau.edu.lb:10725/5081
publishDate	2016
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling	Building semantic trees from XML documentsTekli, JoeCharbel, NathalieChbeir, RichardThe distributed nature of the Web, as a decentralized system exchanging information between heterogeneous sources, has underlined the need to manage interoperability, i.e., the ability to automatically interpret information in Web documents exchanged between different sources, necessary for efficient information management and search applications. In this context, XML was introduced as a data representation standard that simplifies the tasks of interoperation and integration among heterogeneous data sources, allowing to represent data in (semi-) structured documents consisting of hierarchically nested elements and atomic attributes. However, while XML was shown most effective in exchanging data, i.e., in syntactic interoperability, it has been proven limited when it comes to handling semantics, i.e., semantic interoperability, since it only specifies the syntactic and structural properties of the data without any further semantic meaning. As a result, XML semantic-aware processing has become a motivating challenge in Web data management, requiring dedicated semantic analysis and disambiguation methods to assign well-defined meaning to XML elements and attributes. In this context, most existing approaches: (i) ignore the problem of identifying ambiguous XML elements/nodes, (ii) only partially consider their structural relationships/context, (iii) use syntactic information in processing XML data regardless of the semantics involved, and (iv) are static in adopting fixed disambiguation constraints thus limiting user involvement. In this paper, we provide a new XML Semantic Disambiguation Framework titled XSDFdesigned to address each of the above limitations, taking as input: an XML document, and then producing as output a semantically augmented XML tree made of unambiguous semantic concepts extracted from a reference machine-readable semantic network. XSDF consists of four main modules for: (i) linguistic pre-processing of simple/compound XML node labels and values, (ii) selecting ambiguous XML nodes as targets for disambiguation, (iii) representing target nodes as special sphere neighborhood vectors including all XML structural relationships within a (user-chosen) range, and (iv) running context vectors through a hybrid disambiguation process, combining two approaches: concept-basedand context-based disambiguation, allowing the user to tune disambiguation parameters following her needs. Conducted experiments demonstrate the effectiveness and efficiency of our approach in comparison with alternative methods. We also discuss some practical applications of our method, ranging over semantic-aware query rewriting, semantic document clustering and classification, Mobile and Web services search and discovery, as well as blog analysis and event detection in social networks and tweets. © 2016 Elsevier B.V. All rights reserved.PublishedN/A2017-01-27T08:12:32Z2017-01-27T08:12:32Z20162016-05-13Articleinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/article1570-8268http://hdl.handle.net/10725/5081http://dx.doi.org/10.1016/j.websem.2016.03.002Tekli, J., Charbel, N., & Chbeir, R. (2016). Building semantic trees from XML documents. Web Semantics: Science, Services and Agents on the World Wide Web, 37, 1-24.http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.phphttp://www.sciencedirect.com/science/article/pii/S1570826816000202enJournal of Web Semanticsinfo:eu-repo/semantics/openAccessoai:laur.lau.edu.lb:10725/50812024-08-09T08:55:44Z
spellingShingle	Building semantic trees from XML documents Tekli, Joe
status_str	publishedVersion
title	Building semantic trees from XML documents
title_full	Building semantic trees from XML documents
title_fullStr	Building semantic trees from XML documents
title_full_unstemmed	Building semantic trees from XML documents
title_short	Building semantic trees from XML documents
title_sort	Building semantic trees from XML documents
url	http://hdl.handle.net/10725/5081 http://dx.doi.org/10.1016/j.websem.2016.03.002 http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php http://www.sciencedirect.com/science/article/pii/S1570826816000202

Building semantic trees from XML documents

مواد مشابهة