Author : P.Sushma 1
Date of Publication :25th April 2018
Abstract: We live in a digitalized world today. An enormous amount of data is generated from every digital service we use. This enormous amount of generated data is called Big Data. According to Wikipedia, Big data is a word for data sets that are enormous in size or compound that traditional data supervision application software is pathetic to compact with them [5].Big data defies embrace receiving data, data storage, data analysis, search, sharing, transfer, visualization, querying, updating and information privacy. Google‘s video streaming services, YouTube, is one of the best examples of services which produces a huge quantity of data in a very short period. Data mining of such an enormous quantity of data is performed using Hadoop and MapReduce to measure performance. Hadoop is a system which delivers a consistent collective storage. The storage is provided by HDFS (Hadoop Distributed File System) and analysis by MapReduce. MapReduce is a programming model and an associated implementation for processing large data sets. This paper presents big data analysis on Youtube using Hadoop and MapReduce techniques.
Reference :
-
- Webster, John. "MapReduce: Simplified Data Processing on Large Clusters", "Search Storage",2004. Retrieved on 25 March 2013. https://static.googleusercontent.com/media/research.goog le.com/en//archive/mapreduce-osdi04.pdf.
- Bibliography: Big Data Analytics: Methods and Applications by SaumyadiptaPyne, B.L.S. PrakasaRao, S.B. Rao.
- YOUTUBE COMPANY STATISTICS. https://www.statisticbrain.com/youtube-statistics/.
- Youtube.com @2017. YouTube for media. https://www.youtube.com/yt/about/press/
- Big data;Wikipedia https://en.wikipedia.org/wiki/Big_data
- Kallerhoff,Phillip. ―Big Data and Credit Unions: Machine Learning in Member Transactions .
- Marr,Barnard.―Why only one of the 5 Vs of big data really matters http://www.ibmbigdatahub.com/blog/why-only-one-5-vsbig-data-really-matters.
- 2016. Information. "Chapter 1 - Big Data Overview". Big Data: Concepts, Methodologies, Tools, and Applications, Volume I. IGI Global. http://common.books24x7.com/toc.aspx?bookid=114 046
- Apache Hadoophttp://hadoop.apache.org/
- How To Analyze Big Data With HadoopTechnologies ; 3pillarglobal.com. 2017. https://www.3pillarglobal.com/insights/analyze-big-datahadoop-technologies
- J. Dean, S. Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, in:OSDI‘04, 6th Symposium on Operating Systems Design and Implementation, Sponsored by USENIX, in cooperation with ACM SIGOPS, 2004, pp. 137– 150.