Open Access Journal

ISSN : 2394-2320 (Online)

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

Open Access Journal

International Journal of Engineering Research in Computer Science and Engineering (IJERCSE)

Monthly Journal for Computer Science and Engineering

ISSN : 2394-2320 (Online)

Hadoop Data Analysis on YouTube Statistics

Author : Jatti Mounika 1 Nagaveni B. Biradar 2

Date of Publication :18th April 2018

Abstract: In past decades the analysis of structured data has seen remarkable achievement. The principle objective of this project is to show, how data produced from YouTube can be mined and utilized to achieve targeted and real time decisions by using Hadoop framework. In this Project the dataset is gathered using the YouTube API and stored in Hadoop Distributed File System (HDFS). MapReduce algorithm is applied to process the dataset and identify the top video categories and video uploaders as well as most viewed videos.

Reference :

    1. PrathyushaRani Merla Yiheng Liang, Data Analysis using Hadoop MapReduce Environment, 2017 IEEE International Conference on Big Data (BIGDATA)
    2. Hadoop Map-Reduce Tutorial at http://hadoop.apache.org/docs/current/ hadoopmapreduce-client/hadoop-mapreduce-clientcore/MapReduceTutorial.html
    3.  Statistics and facts about YouTube. https://www.statista.com/topics/2019/
    4.  Hadoop Setup, http://www.bogotobogo.com/Hadoop/BigData_hadoop_Install_ on_ubuntu_single_node_cluster.php
    5. Tom White, 2012, Hadoop: The Definitive Guide, O’reilly
    6. Hadoop Tutorial, Yahoo Developer Network, http://developer.yahoo.com/hadoop/tutorial

Recent Article