Fuzzy Data Deduplication at Edge Nodes in Connected Environments

The Internet of Things (IoT) is ushering-in the era of connected environments, i.e., networks of physical objects that are embedded with sensors and softwar, connecting and exchanging data with other devices and systems. The huge amount of data produced by such systems calls for solutions to reduce...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Yakhni, Sylvana (author)
مؤلفون آخرون: Tekli, Joe (author), Chbeir, Richard (author)
التنسيق: conferenceObject
منشور في: 2023
الموضوعات:
الوصول للمادة أونلاين:http://hdl.handle.net/10725/16294
https://doi.org/10.1007/978-3-031-39764-6_8
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://link.springer.com/chapter/10.1007/978-3-031-39764-6_8
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
_version_ 1864513472553811968
author Yakhni, Sylvana
author2 Tekli, Joe
Chbeir, Richard
author2_role author
author
author_facet Yakhni, Sylvana
Tekli, Joe
Chbeir, Richard
author_role author
dc.contributor.none.fl_str_mv Younas, Muhammad
Awan, Irfan
Grønli, Tor-Morten
dc.creator.none.fl_str_mv Yakhni, Sylvana
Tekli, Joe
Chbeir, Richard
dc.date.none.fl_str_mv 2023
2023-08-03
2024-11-13T08:27:56Z
2024-11-13T08:27:56Z
dc.identifier.none.fl_str_mv 9783031397646
http://hdl.handle.net/10725/16294
https://doi.org/10.1007/978-3-031-39764-6_8
Yakhni, S., Tekli, J., Mansour, E., & Chbeir, R. (2023, August). Fuzzy Data Deduplication at Edge Nodes in Connected Environments. In International Conference on Mobile Web and Intelligent Information Systems (pp. 112-128). Cham: Springer Nature Switzerland.
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://link.springer.com/chapter/10.1007/978-3-031-39764-6_8
dc.language.none.fl_str_mv en
dc.publisher.none.fl_str_mv Springer
dc.relation.none.fl_str_mv Lecture Notes in Computer Science
LNCS 13977
dc.rights.*.fl_str_mv info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Ambient intelligence -- Congresses
Mobile computing -- Congresses
dc.title.none.fl_str_mv Fuzzy Data Deduplication at Edge Nodes in Connected Environments
dc.type.none.fl_str_mv Conference Paper / Proceeding
info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/conferenceObject
description The Internet of Things (IoT) is ushering-in the era of connected environments, i.e., networks of physical objects that are embedded with sensors and softwar, connecting and exchanging data with other devices and systems. The huge amount of data produced by such systems calls for solutions to reduce the amount of data being handled and transmitted over the network. In this study, we investigate data deduplication as a prominent pre-processing method that can address such a challenge. Data deduplication techniques have been traditionally developed for data storage and data warehousing applications, and aim at identifying and eliminating redundant data items. Few recent approaches have been designed for sensor networks and connected environments, yet existing solutions mostly rely on crisp thresholds and provide minimum-to-no expert control over the deduplication process, disregarding the domain expert’s needs in defining redundancy. In this study, we propose a new approach for Fuzzy Redundancy Elimination for Data Deduplication in a connected environment. We use simple natural language rules to represent domain knowledge and expert preferences regarding data duplication boundaries. We then apply pattern codes and fuzzy reasoning to detect duplicate data items at the outer-most edge (sensor node) level of the network. This reduces the time required to hard-code the deduplication process, while adapting to the domain expert’s needs for different data sources and applications. Experiments on a real-world dataset highlight our solutions’ potential and improvement compared with existing solutions.
eu_rights_str_mv openAccess
format conferenceObject
id LAURepo_81f53f14cf5bd44d4c9dbf7764f1143e
identifier_str_mv 9783031397646
Yakhni, S., Tekli, J., Mansour, E., & Chbeir, R. (2023, August). Fuzzy Data Deduplication at Edge Nodes in Connected Environments. In International Conference on Mobile Web and Intelligent Information Systems (pp. 112-128). Cham: Springer Nature Switzerland.
language_invalid_str_mv en
network_acronym_str LAURepo
network_name_str Lebanese American University repository
oai_identifier_str oai:laur.lau.edu.lb:10725/16294
publishDate 2023
publisher.none.fl_str_mv Springer
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
spelling Fuzzy Data Deduplication at Edge Nodes in Connected EnvironmentsYakhni, SylvanaTekli, JoeChbeir, RichardAmbient intelligence -- CongressesMobile computing -- CongressesThe Internet of Things (IoT) is ushering-in the era of connected environments, i.e., networks of physical objects that are embedded with sensors and softwar, connecting and exchanging data with other devices and systems. The huge amount of data produced by such systems calls for solutions to reduce the amount of data being handled and transmitted over the network. In this study, we investigate data deduplication as a prominent pre-processing method that can address such a challenge. Data deduplication techniques have been traditionally developed for data storage and data warehousing applications, and aim at identifying and eliminating redundant data items. Few recent approaches have been designed for sensor networks and connected environments, yet existing solutions mostly rely on crisp thresholds and provide minimum-to-no expert control over the deduplication process, disregarding the domain expert’s needs in defining redundancy. In this study, we propose a new approach for Fuzzy Redundancy Elimination for Data Deduplication in a connected environment. We use simple natural language rules to represent domain knowledge and expert preferences regarding data duplication boundaries. We then apply pattern codes and fuzzy reasoning to detect duplicate data items at the outer-most edge (sensor node) level of the network. This reduces the time required to hard-code the deduplication process, while adapting to the domain expert’s needs for different data sources and applications. Experiments on a real-world dataset highlight our solutions’ potential and improvement compared with existing solutions.1 online resource (xiii, 280 pages) : illustrations (some color)Includes bibliographical references.SpringerYounas, MuhammadAwan, IrfanGrønli, Tor-Morten2024-11-13T08:27:56Z2024-11-13T08:27:56Z20232023-08-03Conference Paper / Proceedinginfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject9783031397646http://hdl.handle.net/10725/16294https://doi.org/10.1007/978-3-031-39764-6_8Yakhni, S., Tekli, J., Mansour, E., & Chbeir, R. (2023, August). Fuzzy Data Deduplication at Edge Nodes in Connected Environments. In International Conference on Mobile Web and Intelligent Information Systems (pp. 112-128). Cham: Springer Nature Switzerland.http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.phphttps://link.springer.com/chapter/10.1007/978-3-031-39764-6_8enLecture Notes in Computer ScienceLNCS 13977info:eu-repo/semantics/openAccessoai:laur.lau.edu.lb:10725/162942024-11-13T08:58:26Z
spellingShingle Fuzzy Data Deduplication at Edge Nodes in Connected Environments
Yakhni, Sylvana
Ambient intelligence -- Congresses
Mobile computing -- Congresses
status_str publishedVersion
title Fuzzy Data Deduplication at Edge Nodes in Connected Environments
title_full Fuzzy Data Deduplication at Edge Nodes in Connected Environments
title_fullStr Fuzzy Data Deduplication at Edge Nodes in Connected Environments
title_full_unstemmed Fuzzy Data Deduplication at Edge Nodes in Connected Environments
title_short Fuzzy Data Deduplication at Edge Nodes in Connected Environments
title_sort Fuzzy Data Deduplication at Edge Nodes in Connected Environments
topic Ambient intelligence -- Congresses
Mobile computing -- Congresses
url http://hdl.handle.net/10725/16294
https://doi.org/10.1007/978-3-031-39764-6_8
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://link.springer.com/chapter/10.1007/978-3-031-39764-6_8