An Email based Offline Download Manager for Large Distributed File System using Hadoop MapReduce Framework

International Journal of Computer Science and Engineering
© 2014 by SSRG - IJCSE Journal
Volume 1 Issue 10
Year of Publication : 2014
Authors : Pradeep H K, Rohitaksha K, Abhilash C B

pdf
How to Cite?

Pradeep H K, Rohitaksha K, Abhilash C B, "An Email based Offline Download Manager for Large Distributed File System using Hadoop MapReduce Framework," SSRG International Journal of Computer Science and Engineering , vol. 1,  no. 10, pp. 1-5, 2014. Crossref, https://doi.org/10.14445/23488387/IJCSE-V1I10P110

Abstract:

People are using P2P (Peer to Peer) network for sharing and transferring digital content contains video, audio, and any other data files over the internet from different part of the world. The general peer to peer file sharing protocols was designed to work optimally in the case that all the peers have an end node on the internet. The end peers should capable of delivering the contents in proper way with limited time constraint. To solve this problem, a method is proposed a new approach that will be implemented using email’s as a medium for data transfer and load balancing with the help of Hadoop- MapReduce framework and moreover if we use systems like Gmail and Yahoo then most of the mails would be transferred internally and with more efficiency, thus improving the overall efficiency of the internet. MapReduce is a framework which is pioneered by Goggle for distributed programming. It includes user specified Map and Reduce functions which process inputs in the form of key/value pairs. Along with the MapReduce paradigm, Hadoop also implements HDFS which is known as Hadoop distributed file system.

Keywords:

Peer to peer (P2P), Hadoop, MapReduce, HDFS

References:

[1] MailZoro Email Based P2P File Sharing. Ajit D Dhiwal1, Sudip Gautam2, Akshay K Singh3, Vijay K. Chaurasiya 4 IIITAllahabad, India, 211011. 
[2] A Scalable Two-Phase Top-Down Specialization Approach for DataAnonymization using MapReduce on Cloud Xuyun Zhang, Laurence T. Yang, Senior Member, IEEE, Chang Liu, Jinjun Chen, Member, IEEE. 
[3] MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat. 
[4] The Hadoop Distributed File System Konstantin Shvachko, Hairong Kuang, Sanjay Radia, Robert Chansler Yahoo!Sunnyvale, California USA. 
[5] The Gnutella Protocol Specification v0.4 Clip2 Distributed Search Services. 
[6] THE BITTORRENT P2P FILE-SHARING SYSTEM: MEASUREMENTS AND ANALYSIS J.A. Pouwelse, P. Garbacki, D.H.J. Epema, H.J. Sips Department of Computer Science, Delft University of Technology, the Netherlands. 
[7] Parallel & distributed processing (ipdps), 2010 ieee international symposium on mapreduce programming with apache hadoop bhandarkar, m. ; yahoo! Inc., hadoop solutions architect. 
[8] Big Data analytics Singh, S. ; Bus. Analytics Div., IBM India Software Lab. (ISL), Pune, India ; Singh, N.