计算机科学
大数据
数据仓库
多样性(控制论)
分析
数据库
体积热力学
过程(计算)
数据库事务
非结构化数据
软件
数据挖掘
数据科学
操作系统
人工智能
物理
量子力学
作者
T A Ashwitha,Anisha P Rodrigues,Niranjan N. Chiplunkar
标识
DOI:10.1109/csitss.2017.8447828
摘要
In today's world there is a huge growth in data. This data is generated from variety of sources like social media, industry, transaction records, cell phone, GPS signals etc. It is difficult and challenging to store such a huge amount data in traditional data warehouse. Big Data is the dataset with 3 V's that are Volume, Variety and Velocity and difficult to store and process using traditional database management systems. Big Data Analytics is the way of processing the large amount of data. Hadoop is a popular open source software which is very useful in analyzing the larger data. Hadoop provides several tools for this purpose like Hive, Pig, Hbase, Cassandra etc. In this paper, we have used Hadoop framework. For the analysis of movie dataset Hive tool is used with Hadoop framework. We have got significant improvement in processing time for analyzing dataset compared to traditional system.
科研通智能强力驱动
Strongly Powered by AbleSci AI