Contact at mumbai.academics@gmail.com or 8097636691
Responsive Ads Here

Friday, 8 June 2018

Twitter data analysis and visualizations using the R language on top of the Hadoop platform

Abstract:-


The main objective of the work presented within this paper was to design and implement the system for twitter data analysis and visualization in R environment using the big data processing technologies. Our focus was to leverage existing big data processing frameworks with its storage and computational capabilities to support the analytical functions implemented in R language. We decided to build the backend on top of the Apache Hadoop framework including the Hadoop HDFS as a distributed filesystem and MapReduce as a distributed computation paradigm. RHadoop packages were then used to connect the R environment to the processing layer and to design and implement the analytical functions in a distributed manner. Visualizations were implemented on top of the solution as a RShiny application.

No comments:

Post a Comment