Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data

<p dir="ltr">In the current ongoing crisis, people mostly rely on mobile phones for all the activities, but query analysis and mobile data security are major issues. Several research works have been made on efficient detection of antipatterns for minimizing the complexity of query an...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Rajesh Kumar Dhanaraj (19646269) (author)
مؤلفون آخرون: Vinothsaravanan Ramakrishnan (19646272) (author), M. Poongodi (14158869) (author), Lalitha Krishnasamy (19646275) (author), Mounir Hamdi (14150652) (author), Ketan Kotecha (11272198) (author), V. Vijayakumar (9544795) (author)
منشور في: 2021
الموضوعات:
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513505674133504
author Rajesh Kumar Dhanaraj (19646269)
author2 Vinothsaravanan Ramakrishnan (19646272)
M. Poongodi (14158869)
Lalitha Krishnasamy (19646275)
Mounir Hamdi (14150652)
Ketan Kotecha (11272198)
V. Vijayakumar (9544795)
author2_role author
author
author
author
author
author
author_facet Rajesh Kumar Dhanaraj (19646269)
Vinothsaravanan Ramakrishnan (19646272)
M. Poongodi (14158869)
Lalitha Krishnasamy (19646275)
Mounir Hamdi (14150652)
Ketan Kotecha (11272198)
V. Vijayakumar (9544795)
author_role author
dc.creator.none.fl_str_mv Rajesh Kumar Dhanaraj (19646269)
Vinothsaravanan Ramakrishnan (19646272)
M. Poongodi (14158869)
Lalitha Krishnasamy (19646275)
Mounir Hamdi (14150652)
Ketan Kotecha (11272198)
V. Vijayakumar (9544795)
dc.date.none.fl_str_mv 2021-11-24T03:00:00Z
dc.identifier.none.fl_str_mv 10.1155/2021/2730246
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Random_Forest_Bagging_and_X_Means_Clustered_Antipattern_Detection_from_SQL_Query_Log_for_Accessing_Secure_Mobile_Data/26984272
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Information and computing sciences
Data management and data science
Machine learning
Mobile Data Security
Query Analysis
Antipattern Detection
Clustering Process
Design Errors
Random Forest
Bagging
dc.title.none.fl_str_mv Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">In the current ongoing crisis, people mostly rely on mobile phones for all the activities, but query analysis and mobile data security are major issues. Several research works have been made on efficient detection of antipatterns for minimizing the complexity of query analysis. However, more focus needs to be given to the accuracy aspect. In addition, for grouping similar antipatterns, a clustering process was performed to eradicate the design errors. To address the above‐said issues and further enhance the antipattern detection accuracy with minimum time and false positive rate, in this work, Random Forest Bagging X‐means SQL Query Clustering (RFBXSQLQC) technique is proposed. Different patterns or queries are initially gathered from the input SQL query log, and bootstrap samples are created. Then, for each pattern, various weak clusters are constructed via X‐means clustering and are utilized as the weak learner (clusters). During this process, the input patterns are categorized into different clusters. Using the Bayesian information criterion, the similarity measure is employed to evaluate the similarity between the patterns and cluster weight. Based on the similarity value, patterns are assigned to either relevant or irrelevant groups. The weak learner results are aggregated to form strong clusters, and, with the aid of voting, a majority vote is considered for designing strong clusters with minimum time. Experiments are conducted to evaluate the performance of the RFBXSQLQC technique using the IIT Bombay dataset using the metrics like antipattern detection accuracy, time complexity, false-positive rate, and computational overhead with respect to the differing number of queries. The results revealed that the RFBXSQLQC technique outperforms the existing algorithms by 19% with pattern detection accuracy, 34% minimized time complexity, 64% false-positive rate, and 31% in terms of computational overhead.</p><h2>Other Information</h2><p dir="ltr">Published in: Wireless Communications and Mobile Computing<br>License: <a href="http://creativecommons.org/licenses/by/4.0/" target="_blank">http://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1155/2021/2730246" target="_blank">https://dx.doi.org/10.1155/2021/2730246</a></p>
eu_rights_str_mv openAccess
id Manara2_c55d3fb95a56e73fac95bd9eefffda41
identifier_str_mv 10.1155/2021/2730246
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/26984272
publishDate 2021
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile DataRajesh Kumar Dhanaraj (19646269)Vinothsaravanan Ramakrishnan (19646272)M. Poongodi (14158869)Lalitha Krishnasamy (19646275)Mounir Hamdi (14150652)Ketan Kotecha (11272198)V. Vijayakumar (9544795)Information and computing sciencesData management and data scienceMachine learningMobile Data SecurityQuery AnalysisAntipattern DetectionClustering ProcessDesign ErrorsRandom ForestBagging<p dir="ltr">In the current ongoing crisis, people mostly rely on mobile phones for all the activities, but query analysis and mobile data security are major issues. Several research works have been made on efficient detection of antipatterns for minimizing the complexity of query analysis. However, more focus needs to be given to the accuracy aspect. In addition, for grouping similar antipatterns, a clustering process was performed to eradicate the design errors. To address the above‐said issues and further enhance the antipattern detection accuracy with minimum time and false positive rate, in this work, Random Forest Bagging X‐means SQL Query Clustering (RFBXSQLQC) technique is proposed. Different patterns or queries are initially gathered from the input SQL query log, and bootstrap samples are created. Then, for each pattern, various weak clusters are constructed via X‐means clustering and are utilized as the weak learner (clusters). During this process, the input patterns are categorized into different clusters. Using the Bayesian information criterion, the similarity measure is employed to evaluate the similarity between the patterns and cluster weight. Based on the similarity value, patterns are assigned to either relevant or irrelevant groups. The weak learner results are aggregated to form strong clusters, and, with the aid of voting, a majority vote is considered for designing strong clusters with minimum time. Experiments are conducted to evaluate the performance of the RFBXSQLQC technique using the IIT Bombay dataset using the metrics like antipattern detection accuracy, time complexity, false-positive rate, and computational overhead with respect to the differing number of queries. The results revealed that the RFBXSQLQC technique outperforms the existing algorithms by 19% with pattern detection accuracy, 34% minimized time complexity, 64% false-positive rate, and 31% in terms of computational overhead.</p><h2>Other Information</h2><p dir="ltr">Published in: Wireless Communications and Mobile Computing<br>License: <a href="http://creativecommons.org/licenses/by/4.0/" target="_blank">http://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1155/2021/2730246" target="_blank">https://dx.doi.org/10.1155/2021/2730246</a></p>2021-11-24T03:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1155/2021/2730246https://figshare.com/articles/journal_contribution/Random_Forest_Bagging_and_X_Means_Clustered_Antipattern_Detection_from_SQL_Query_Log_for_Accessing_Secure_Mobile_Data/26984272CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/269842722021-11-24T03:00:00Z
spellingShingle Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
Rajesh Kumar Dhanaraj (19646269)
Information and computing sciences
Data management and data science
Machine learning
Mobile Data Security
Query Analysis
Antipattern Detection
Clustering Process
Design Errors
Random Forest
Bagging
status_str publishedVersion
title Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
title_full Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
title_fullStr Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
title_full_unstemmed Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
title_short Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
title_sort Random Forest Bagging and X‐Means Clustered Antipattern Detection from SQL Query Log for Accessing Secure Mobile Data
topic Information and computing sciences
Data management and data science
Machine learning
Mobile Data Security
Query Analysis
Antipattern Detection
Clustering Process
Design Errors
Random Forest
Bagging