What is BigData in Layman's Language
BigData Problem
Hey Everyone.!
So I've just started a research on data like how much data does same company works per day. What I found what left me in raising some questions.
I searched on the internet about home much data does Facebook has to deal with per day and from techcrunch.com I found out that facebook actually deals with over 500+ TerraBytes(TBs) of data per day. This made me wonder okay it deals with it but how and where does it store and many more questions came up in my mind.
These Questions that I just raised here are termed as BIGDATA PROBLEMS. As Facebook needs to interact with 500TBs a day, it needs some huge hard disks. It can get those hardwares simply by investing lots of money but th ( problem will not get solved here. Due to very large storages and Very much of data incoming, processing those data will not be fast as Harddisks are not that much faster. It will take a couple of Days just to read the data. Now think of this, if you have a requirement of some particular data on Facebook and it will reach you after 4days(2 days for reading and 2 for writing), will you be using Facebook anymore? This problem I stated here is a subproblem of BigData which is known has velocity problem. The other problem is that how to get that much amount of storage, which is known has volume problem.
To solve this velocity problem The Tech Guys came up with the Distributed Storage Cluster System, in which The data is stored simultaneously in small parts in different storage hardwares. This will speed the process by no.of storage devices you use. More the storage devices more speed you will get. By getting n no. of storage units you will solve both volume and velocity problem.