The benefits and pitfalls, for different Big Data analyzing, can be an intense topic of conversation spanning a wide range of industries. Now, we are overtaken by the large amount of data, which is not piling up like government files, but yet needs to be maintained.
Here, we are talking about that data which is so large and complex that it is inadequate to handle and deal with that. So here, the challenges that I noticed, are very common but are inevitable. While looking at surveys via surfing over the internet, it has come up that some of the organizations are messed up with their own data. They want to escape out of it but can't. The data challenges like "Capturing , Analyzing , Search and Sharing etc" all are being faced by these organizations.
So what is Big Data? Spending so much time on social networks, it's now human nature to upload something to express your views and feelings, etc. These sometimes can be images, videos, articles, news, etc. Daily, on an hourly basis, it's being uploaded and usually the same stuff is uploaded at the speed of 1000 per minute (general quotes of Development team of Facebook for the pic uploading).
How will the social networking developer manage databases and servers to maintain and store this data? This Big Data is not only media information, but also any kind of unstructured data.
We, sometimes, there is SQL to handle the structured data, and the data which is not structured will be converted into structural form. But, a contradiction is here. How much SQL will come in the market for this? It's just not the warehousing queries concept but bringing the smart analyzing techniques to work on this.
Yes! Then, Hadoop comes. Hadoop is a framework designed to work over the Big Data and to drop down the complexity of it. The working of Big Data is not like SQL, it's way faster than that. Hadoop brings the idea of maintaining the blocks of the data of a fixed size, so that it can be dealt with easily. These blocks may be further maintained over any fashion, but the main focus is on operating. The "Yet Another Resource Negotiator" (called as YARN) is very unique and beneficial Cluster Management Technique.
YARN is a part of HADOOP along with other master and slave nodes.
The HADOOP framework lets the market sector analyze and solve the issues for the product marketing. Now, they are able to advertise their product to those who had shown them interest while surfing over the internet. This removes unwanted ads from your browser.
Also, you can try it over YouTube if you have any concerns. Try opening some advertisements like shopping sites or any product more than twice. The YouTube advertising team will get your static I.P. with the area of interest for products and you will be able to see it while seeing a video.
This is a short blog about Hadoop and Big Data just for showing you and educating you about the technology.