Hadoop based Analysis and Visualization of Diabetes Data through Tableau
Due to rapid development of diverse healthcare practices, various procedures used in healthcare, produce data. This healthcare data has been scaled to a bigger size, thus, there is a dire need to analyze this scaled data in an efficient manner. Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of commodity computers. Analytical processing of big data with Hadoop is very helpful in performing significant actual-point in time analysis on massive amount of data and is capable to forecast an emergency situation. In this paper Hadoop driven analysis has been performed on a diabetes case study through comparison among Pig, Hive, and Tableau The superiority of Tableau is established as it allows users with minimal statistical background to visualize the results of analysis in an easy ‚??onbutton- click‚?? manner.
Hadoop, Pig, Hive, Tableau, Diabetes