An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis

<p dir="ltr">The rapid growth in the number of scholarly documents on the Web and in other digital platforms makes it challenging for researchers to find research publications most relevant to their information needs. This challenge has been mitigated to a greater extent by the major...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Shah Khalid (15164202) (author)
مؤلفون آخرون: Shengli Wu (641329) (author), Abdul Wahid (4395094) (author), Aftab Alam (5158601) (author), Irfan Ullah (847820) (author)
منشور في: 2021
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513505678327808
author Shah Khalid (15164202)
author2 Shengli Wu (641329)
Abdul Wahid (4395094)
Aftab Alam (5158601)
Irfan Ullah (847820)
author2_role author
author
author
author
author_facet Shah Khalid (15164202)
Shengli Wu (641329)
Abdul Wahid (4395094)
Aftab Alam (5158601)
Irfan Ullah (847820)
author_role author
dc.creator.none.fl_str_mv Shah Khalid (15164202)
Shengli Wu (641329)
Abdul Wahid (4395094)
Aftab Alam (5158601)
Irfan Ullah (847820)
dc.date.none.fl_str_mv 2021-08-25T06:00:00Z
dc.identifier.none.fl_str_mv 10.1109/access.2021.3107939
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/An_Effective_Scholarly_Search_by_Combining_Inverted_Indices_and_Structured_Search_With_Citation_Networks_Analysis/26984161
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Information and computing sciences
Data management and data science
Semantics
Indexes
Internet
Search problems
Search engines
Computer science
Task analysis
Academic search
knowledge graph
inverted index
structure search
citation networks analysis
dc.title.none.fl_str_mv An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">The rapid growth in the number of scholarly documents on the Web and in other digital platforms makes it challenging for researchers to find research publications most relevant to their information needs. This challenge has been mitigated to a greater extent by the major scholarly retrieval systems, such as Google Scholar, Semantic Scholar, PubMed, CiteSeerX, and others. The reason for the success of these retrieval solutions lies in the advances in ranking approaches. However, the existing studies advocate for the fact that we are still far from the method's effectiveness ceiling, leaving ample room for further improvement to meet the scholarly needs of users. The existing methods adopt different approaches; some use classical Information Retrieval (IR), others use semantics-aware methods, including Knowledge Graph (KG) to support scholarly search. However, we hypothesize that combining the best of both worlds can further improve search relevance. In this context, this work incorporates inverted index from the classical IR with BM25 as the weighting scheme, combined with Citation Networks Analysis (CNA) for the baseline search results, which are then re-ranked by passing the selected entities from the top-k initial search results as the search query to the KG. This way, not only the textual content but also the structural semantics of the research publications are well exploited in the retrieval processes. The goal is to exploit IR and KG-based retrieval techniques to gain insights into the behavior of both textual and structured information in the strategic ranking of scholarly articles. The proposed solution has been evaluated using the ACL Anthology Network (AAN) dataset. The results show that the proposed technique can comparatively improve the retrieval performance in terms of Normalized Discounted Cumulative Gain (nDCG) and precision rates.</p><h2>Other Information</h2><p dir="ltr">Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/" rel="noreferrer" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2021.3107939" target="_blank">https://dx.doi.org/10.1109/access.2021.3107939</a></p>
eu_rights_str_mv openAccess
id Manara2_bc1b474607a008058a064d0e620499ff
identifier_str_mv 10.1109/access.2021.3107939
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/26984161
publishDate 2021
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks AnalysisShah Khalid (15164202)Shengli Wu (641329)Abdul Wahid (4395094)Aftab Alam (5158601)Irfan Ullah (847820)Information and computing sciencesData management and data scienceSemanticsIndexesInternetSearch problemsSearch enginesComputer scienceTask analysisAcademic searchknowledge graphinverted indexstructure searchcitation networks analysis<p dir="ltr">The rapid growth in the number of scholarly documents on the Web and in other digital platforms makes it challenging for researchers to find research publications most relevant to their information needs. This challenge has been mitigated to a greater extent by the major scholarly retrieval systems, such as Google Scholar, Semantic Scholar, PubMed, CiteSeerX, and others. The reason for the success of these retrieval solutions lies in the advances in ranking approaches. However, the existing studies advocate for the fact that we are still far from the method's effectiveness ceiling, leaving ample room for further improvement to meet the scholarly needs of users. The existing methods adopt different approaches; some use classical Information Retrieval (IR), others use semantics-aware methods, including Knowledge Graph (KG) to support scholarly search. However, we hypothesize that combining the best of both worlds can further improve search relevance. In this context, this work incorporates inverted index from the classical IR with BM25 as the weighting scheme, combined with Citation Networks Analysis (CNA) for the baseline search results, which are then re-ranked by passing the selected entities from the top-k initial search results as the search query to the KG. This way, not only the textual content but also the structural semantics of the research publications are well exploited in the retrieval processes. The goal is to exploit IR and KG-based retrieval techniques to gain insights into the behavior of both textual and structured information in the strategic ranking of scholarly articles. The proposed solution has been evaluated using the ACL Anthology Network (AAN) dataset. The results show that the proposed technique can comparatively improve the retrieval performance in terms of Normalized Discounted Cumulative Gain (nDCG) and precision rates.</p><h2>Other Information</h2><p dir="ltr">Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/" rel="noreferrer" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2021.3107939" target="_blank">https://dx.doi.org/10.1109/access.2021.3107939</a></p>2021-08-25T06:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1109/access.2021.3107939https://figshare.com/articles/journal_contribution/An_Effective_Scholarly_Search_by_Combining_Inverted_Indices_and_Structured_Search_With_Citation_Networks_Analysis/26984161CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/269841612021-08-25T06:00:00Z
spellingShingle An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
Shah Khalid (15164202)
Information and computing sciences
Data management and data science
Semantics
Indexes
Internet
Search problems
Search engines
Computer science
Task analysis
Academic search
knowledge graph
inverted index
structure search
citation networks analysis
status_str publishedVersion
title An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_full An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_fullStr An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_full_unstemmed An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_short An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
title_sort An Effective Scholarly Search by Combining Inverted Indices and Structured Search With Citation Networks Analysis
topic Information and computing sciences
Data management and data science
Semantics
Indexes
Internet
Search problems
Search engines
Computer science
Task analysis
Academic search
knowledge graph
inverted index
structure search
citation networks analysis