Hadoop Extensions for Distributed Computing on Reconfigurable Active SSD Clusters

In this article, we propose new extensions to Hadoop to enable clusters of reconfigurable active solid-state drives (RASSDs) to process streaming data from SSDs using FPGAs. We also develop an analytical model to estimate the performance of RASSD clusters running under Hadoop. Using the Hadoop RASSD...

Full description

Saved in:
Bibliographic Details
Main Author: Kaitoua, Abdul Rahman (author)
Other Authors: Hajj, Hazem (author), Saghir, Mazen A.R. (author), Artail, Hassan (author), Akkary, Haitham (author), Awad, Mariette (author), Sharafeddine, Mageda (author), Mershad, Khaleel (author)
Format: article
Published: 2014
Online Access:http://hdl.handle.net/10725/15388
https://doi.org/10.1145/2608199
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://dl.acm.org/doi/abs/10.1145/2608199
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this article, we propose new extensions to Hadoop to enable clusters of reconfigurable active solid-state drives (RASSDs) to process streaming data from SSDs using FPGAs. We also develop an analytical model to estimate the performance of RASSD clusters running under Hadoop. Using the Hadoop RASSD platform and network simulators, we validate our design and demonstrate its impact on performance for different workloads taken from Stanford's Phoenix MapReduce project. Our results show that for a hardware acceleration factor of 20×, compute-intensive workloads processing 153MB of data can run up to 11× faster than a standard Hadoop cluster.