Fuzzy Data Deduplication at Edge Nodes in Connected Environments
The Internet of Things (IoT) is ushering-in the era of connected environments, i.e., networks of physical objects that are embedded with sensors and softwar, connecting and exchanging data with other devices and systems. The huge amount of data produced by such systems calls for solutions to reduce...
محفوظ في:
| المؤلف الرئيسي: | |
|---|---|
| مؤلفون آخرون: | , |
| التنسيق: | conferenceObject |
| منشور في: |
2023
|
| الموضوعات: | |
| الوصول للمادة أونلاين: | http://hdl.handle.net/10725/16294 https://doi.org/10.1007/978-3-031-39764-6_8 http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php https://link.springer.com/chapter/10.1007/978-3-031-39764-6_8 |
| الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
| _version_ | 1864513472553811968 |
|---|---|
| author | Yakhni, Sylvana |
| author2 | Tekli, Joe Chbeir, Richard |
| author2_role | author author |
| author_facet | Yakhni, Sylvana Tekli, Joe Chbeir, Richard |
| author_role | author |
| dc.contributor.none.fl_str_mv | Younas, Muhammad Awan, Irfan Grønli, Tor-Morten |
| dc.creator.none.fl_str_mv | Yakhni, Sylvana Tekli, Joe Chbeir, Richard |
| dc.date.none.fl_str_mv | 2023 2023-08-03 2024-11-13T08:27:56Z 2024-11-13T08:27:56Z |
| dc.identifier.none.fl_str_mv | 9783031397646 http://hdl.handle.net/10725/16294 https://doi.org/10.1007/978-3-031-39764-6_8 Yakhni, S., Tekli, J., Mansour, E., & Chbeir, R. (2023, August). Fuzzy Data Deduplication at Edge Nodes in Connected Environments. In International Conference on Mobile Web and Intelligent Information Systems (pp. 112-128). Cham: Springer Nature Switzerland. http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php https://link.springer.com/chapter/10.1007/978-3-031-39764-6_8 |
| dc.language.none.fl_str_mv | en |
| dc.publisher.none.fl_str_mv | Springer |
| dc.relation.none.fl_str_mv | Lecture Notes in Computer Science LNCS 13977 |
| dc.rights.*.fl_str_mv | info:eu-repo/semantics/openAccess |
| dc.subject.none.fl_str_mv | Ambient intelligence -- Congresses Mobile computing -- Congresses |
| dc.title.none.fl_str_mv | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| dc.type.none.fl_str_mv | Conference Paper / Proceeding info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/conferenceObject |
| description | The Internet of Things (IoT) is ushering-in the era of connected environments, i.e., networks of physical objects that are embedded with sensors and softwar, connecting and exchanging data with other devices and systems. The huge amount of data produced by such systems calls for solutions to reduce the amount of data being handled and transmitted over the network. In this study, we investigate data deduplication as a prominent pre-processing method that can address such a challenge. Data deduplication techniques have been traditionally developed for data storage and data warehousing applications, and aim at identifying and eliminating redundant data items. Few recent approaches have been designed for sensor networks and connected environments, yet existing solutions mostly rely on crisp thresholds and provide minimum-to-no expert control over the deduplication process, disregarding the domain expert’s needs in defining redundancy. In this study, we propose a new approach for Fuzzy Redundancy Elimination for Data Deduplication in a connected environment. We use simple natural language rules to represent domain knowledge and expert preferences regarding data duplication boundaries. We then apply pattern codes and fuzzy reasoning to detect duplicate data items at the outer-most edge (sensor node) level of the network. This reduces the time required to hard-code the deduplication process, while adapting to the domain expert’s needs for different data sources and applications. Experiments on a real-world dataset highlight our solutions’ potential and improvement compared with existing solutions. |
| eu_rights_str_mv | openAccess |
| format | conferenceObject |
| id | LAURepo_81f53f14cf5bd44d4c9dbf7764f1143e |
| identifier_str_mv | 9783031397646 Yakhni, S., Tekli, J., Mansour, E., & Chbeir, R. (2023, August). Fuzzy Data Deduplication at Edge Nodes in Connected Environments. In International Conference on Mobile Web and Intelligent Information Systems (pp. 112-128). Cham: Springer Nature Switzerland. |
| language_invalid_str_mv | en |
| network_acronym_str | LAURepo |
| network_name_str | Lebanese American University repository |
| oai_identifier_str | oai:laur.lau.edu.lb:10725/16294 |
| publishDate | 2023 |
| publisher.none.fl_str_mv | Springer |
| repository.mail.fl_str_mv | |
| repository.name.fl_str_mv | |
| repository_id_str | |
| spelling | Fuzzy Data Deduplication at Edge Nodes in Connected EnvironmentsYakhni, SylvanaTekli, JoeChbeir, RichardAmbient intelligence -- CongressesMobile computing -- CongressesThe Internet of Things (IoT) is ushering-in the era of connected environments, i.e., networks of physical objects that are embedded with sensors and softwar, connecting and exchanging data with other devices and systems. The huge amount of data produced by such systems calls for solutions to reduce the amount of data being handled and transmitted over the network. In this study, we investigate data deduplication as a prominent pre-processing method that can address such a challenge. Data deduplication techniques have been traditionally developed for data storage and data warehousing applications, and aim at identifying and eliminating redundant data items. Few recent approaches have been designed for sensor networks and connected environments, yet existing solutions mostly rely on crisp thresholds and provide minimum-to-no expert control over the deduplication process, disregarding the domain expert’s needs in defining redundancy. In this study, we propose a new approach for Fuzzy Redundancy Elimination for Data Deduplication in a connected environment. We use simple natural language rules to represent domain knowledge and expert preferences regarding data duplication boundaries. We then apply pattern codes and fuzzy reasoning to detect duplicate data items at the outer-most edge (sensor node) level of the network. This reduces the time required to hard-code the deduplication process, while adapting to the domain expert’s needs for different data sources and applications. Experiments on a real-world dataset highlight our solutions’ potential and improvement compared with existing solutions.1 online resource (xiii, 280 pages) : illustrations (some color)Includes bibliographical references.SpringerYounas, MuhammadAwan, IrfanGrønli, Tor-Morten2024-11-13T08:27:56Z2024-11-13T08:27:56Z20232023-08-03Conference Paper / Proceedinginfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject9783031397646http://hdl.handle.net/10725/16294https://doi.org/10.1007/978-3-031-39764-6_8Yakhni, S., Tekli, J., Mansour, E., & Chbeir, R. (2023, August). Fuzzy Data Deduplication at Edge Nodes in Connected Environments. In International Conference on Mobile Web and Intelligent Information Systems (pp. 112-128). Cham: Springer Nature Switzerland.http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.phphttps://link.springer.com/chapter/10.1007/978-3-031-39764-6_8enLecture Notes in Computer ScienceLNCS 13977info:eu-repo/semantics/openAccessoai:laur.lau.edu.lb:10725/162942024-11-13T08:58:26Z |
| spellingShingle | Fuzzy Data Deduplication at Edge Nodes in Connected Environments Yakhni, Sylvana Ambient intelligence -- Congresses Mobile computing -- Congresses |
| status_str | publishedVersion |
| title | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| title_full | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| title_fullStr | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| title_full_unstemmed | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| title_short | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| title_sort | Fuzzy Data Deduplication at Edge Nodes in Connected Environments |
| topic | Ambient intelligence -- Congresses Mobile computing -- Congresses |
| url | http://hdl.handle.net/10725/16294 https://doi.org/10.1007/978-3-031-39764-6_8 http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php https://link.springer.com/chapter/10.1007/978-3-031-39764-6_8 |