What is BIG DATA
As Name suggests BIG Data is huge data created due to IOT, Apps and real time application data colelction
• Walmart handles more than 1 million customer transactions every hour.
• Facebook stores, accesses, and analyzes 30+ Petabytes of user generated data.
• 230+ millions of tweets are created every day.
• More than 5 billion people are calling, texting, tweeting and browsing on mobile phones worldwide.
The three different formats of big data are:
1. Structured: Organised data format with a fixed schema. Ex: RDBMS
2. Semi-Structured: Partially organised data which does not have a fixed format. Ex: XML, JSON
3. Unstructured: Unorganised data with an unknown schema. Ex: Audio, video files etc.
Main Challanges
• Validity: correctness of data
• Variability: dynamic behaviour
• Volatility: tendency to change in time
• Vulnerability: vulnerable to breach or attacks
• Visualization: visualizing meaningful usage of data
Solutions -
HadoopPigHiveCAssandraSparkKafkaetc


Comments
Post a Comment