TY - JOUR TI - Data Analysis Using R and Hadoop AU - Amit Rajbanshi AU - Birendra Kumar Sah AU - C. K. Raina JO - International Journal of Scientific Research in Computer Science, Engineering and Information Technology PB - Technoscience Academy DA - 2017/12/31 PY - 2017 DO - https://doi.org/10.32628/IJSRCSEIT UR - https://ijsrcseit.com/CSEIT1726297 VL - 2 IS - 6 SP - 1093 EP - 1097 AB - Analyzing and managing huge information may be very hard exploitation classical means like electronic data service management systems or desktop package package packages for statistics and image. Instead, huge information desires huge clusters with an entire heap or even thousands of computing nodes. Official statistics is progressively} considering huge information for clarification new statistics as a results of huge information sources would possibly manufacture additional relevant and timely statistics than ancient sources. one of the package package tools successfully and wide unfold used for storage and method of huge information sets on clusters of artefact hardware is Hadoop. Hadoop framework contains libraries, a distributed file-system (HDFS), and a resource-management platform and implements a version of the MapReduce programming model for big scale process. throughout this paper we've got an inclination to analyze the possibilities of integration Hadoop with R that would be a stylish package package used for applied mathematics computing and information image. we've got an inclination to gift three ways in which of integration them: R with Streaming, Rhipe and RHadoop which we have a tendency to emphasize the advantages and downsides of each answer.