Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
WHAT IS BIG DATA?
What is Big Data?
• There are humungous amount of data, available which have a
lot of meaningful insights – they need to b...
Three V’s of Big Data
Three V’s of Big Data
THE CHALLENGE
Background
Shorter Time to React
• Data that enters your organization and has some kind of value
for a limited window of time
• This ...
Data Economics
• Why Volume is good ?
– No individual record is particularly valuable
– Having every record is incredibly ...
Data Storage
Schema Structured Un Structured
Storage Medium RDBMS Filers
Storage Reliability Very reliable Very reliable
P...
BIG DATA’S APPROACH
Big Data Approach
Big Data refer to
technologies that
can capture, process
and analyze data.
No SQL Database Types
• Key-value store
– Key can be custom or auto generated
– Value can be complex objects like XML, BLO...
No SQL Database Types
• Document database
– Designed to store, retrieve & manage document
oriented information; expands on...
Analytical Database
• An analytical database is a type of database built to store,
manage, and consume big data.
• Optimiz...
BIG DATA USE CASE
PATTERNS
Preprocess & Store
• Scenario
– Data getting continuously generated in large volume
– Need to pre-process before loading i...
Real Time Actions
• Scenario
– Manage actions to be taken
on continuously changing
data in real time
Credit Card Issuer
Sears – Competes on Big Data
• They have data of over 100 million customers, which they
analyse to offer real-time, releva...
THE FUTURE OF BIG
DATA
Compound Annual Growth Rate
IDC Report Analysis
Careers in Big Data
THE END
Next : Hadoop
What is Big DATA | Hadoop online training
What is Big DATA | Hadoop online training
Upcoming SlideShare
Loading in …5
×

of

What is Big DATA | Hadoop online training Slide 1 What is Big DATA | Hadoop online training Slide 2 What is Big DATA | Hadoop online training Slide 3 What is Big DATA | Hadoop online training Slide 4 What is Big DATA | Hadoop online training Slide 5 What is Big DATA | Hadoop online training Slide 6 What is Big DATA | Hadoop online training Slide 7 What is Big DATA | Hadoop online training Slide 8 What is Big DATA | Hadoop online training Slide 9 What is Big DATA | Hadoop online training Slide 10 What is Big DATA | Hadoop online training Slide 11 What is Big DATA | Hadoop online training Slide 12 What is Big DATA | Hadoop online training Slide 13 What is Big DATA | Hadoop online training Slide 14 What is Big DATA | Hadoop online training Slide 15 What is Big DATA | Hadoop online training Slide 16 What is Big DATA | Hadoop online training Slide 17 What is Big DATA | Hadoop online training Slide 18 What is Big DATA | Hadoop online training Slide 19 What is Big DATA | Hadoop online training Slide 20 What is Big DATA | Hadoop online training Slide 21 What is Big DATA | Hadoop online training Slide 22 What is Big DATA | Hadoop online training Slide 23 What is Big DATA | Hadoop online training Slide 24 What is Big DATA | Hadoop online training Slide 25
Upcoming SlideShare
Hadoop demo ppt
Next
Download to read offline and view in fullscreen.

3 Likes

Share

Download to read offline

What is Big DATA | Hadoop online training

Download to read offline

we provide best Hadoop devlopment and hadoop admin online training.

Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment.

hadoop training, hadoop online training, hadoop training in bangalore, hadoop training in hyderabad, best hadoop training institutes, hadoop online training in chicago, hadoop training in mumbai, hadoop training in pune, hadoop training institutes ameerpet

Related Books

Free with a 30 day trial from Scribd

See all

Related Audiobooks

Free with a 30 day trial from Scribd

See all

What is Big DATA | Hadoop online training

  1. 1. WHAT IS BIG DATA?
  2. 2. What is Big Data? • There are humungous amount of data, available which have a lot of meaningful insights – they need to be analysed • Existing Online Transaction Processing (OLTP) and Business Intelligence (BI) are not easily scalable considering cost, effort, and manageability aspect. • It is not just volume, but also the variety and velocity of data. • Big data is a terminology that refers to challenges that we are facing due to exponential volume, variety and velocity of data.
  3. 3. Three V’s of Big Data
  4. 4. Three V’s of Big Data
  5. 5. THE CHALLENGE
  6. 6. Background
  7. 7. Shorter Time to React • Data that enters your organization and has some kind of value for a limited window of time • This window usually shuts well before the data has been transformed and loaded into a data warehouse for deeper analysis. • The higher the volumes of data entering your organization per second, the bigger your challenge.
  8. 8. Data Economics • Why Volume is good ? – No individual record is particularly valuable – Having every record is incredibly valuable • Why storage decision is important ? • How much value can I extract from every byte of data verses the cost to save that data? – If value > cost – then keep it online, on DB or filer – If cost > value – I discard it or archive on tape (expensive to throw data)
  9. 9. Data Storage Schema Structured Un Structured Storage Medium RDBMS Filers Storage Reliability Very reliable Very reliable Processing ability Very reliable unstructured schema poses challenges Location of processing SQL queries pull data to server Random means to retrieve sense Impact of data increase Cost increases linearly Cost increases linearly Support for Big Data No No
  10. 10. BIG DATA’S APPROACH
  11. 11. Big Data Approach Big Data refer to technologies that can capture, process and analyze data.
  12. 12. No SQL Database Types • Key-value store – Key can be custom or auto generated – Value can be complex objects like XML, BLOB, JSON etc – Popular : DynamoDB, Azure Table Store (ATS), Riak • Column store – Data is stored as families of columns; high scalability with very high performance architecture – Examples : HBase, Cassandra, Vertica and Hypertable
  13. 13. No SQL Database Types • Document database – Designed to store, retrieve & manage document oriented information; expands on key-value store – Example: MongoDB, CouchDB • Graph database – Designed for data that whose relations are well represented in graphs, usually with nodes connected to edges – Examples : Neo4J and Polyglot
  14. 14. Analytical Database • An analytical database is a type of database built to store, manage, and consume big data. • Optimized for processing advanced analytics that involves highly complex queries on terabytes of data and complex statistical processing, data mining, and NLP (natural language processing). • Examples of analytical databases are Vertica (acquired by HP), Aster Data (acquired by Teradata), Greenplum (acquired by EMC), and so on.
  15. 15. BIG DATA USE CASE PATTERNS
  16. 16. Preprocess & Store • Scenario – Data getting continuously generated in large volume – Need to pre-process before loading into target systems
  17. 17. Real Time Actions • Scenario – Manage actions to be taken on continuously changing data in real time
  18. 18. Credit Card Issuer
  19. 19. Sears – Competes on Big Data • They have data of over 100 million customers, which they analyse to offer real-time, relevant offers to their customers. • The solution was 3 years in the making, which included programming that would capture, analyze, and report on customer activity at an individual level, across all 4,000 locations. • Sears has a Hadoop cluster of 300-nodes that is populated with over 2 petabytes of structure customer transaction data, sales data and supply chain data. • Results: Sears achieved an active member base in the 8 digits, exceeding the projected 36 month membership target in 17 months.
  20. 20. THE FUTURE OF BIG DATA
  21. 21. Compound Annual Growth Rate IDC Report Analysis
  22. 22. Careers in Big Data
  23. 23. THE END Next : Hadoop
  • salesforce-training

    Dec. 30, 2014
  • hadoop-training

    Dec. 30, 2014
  • suresh575

    Dec. 30, 2014

we provide best Hadoop devlopment and hadoop admin online training. Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing environment. hadoop training, hadoop online training, hadoop training in bangalore, hadoop training in hyderabad, best hadoop training institutes, hadoop online training in chicago, hadoop training in mumbai, hadoop training in pune, hadoop training institutes ameerpet

Views

Total views

654

On Slideshare

0

From embeds

0

Number of embeds

2

Actions

Downloads

9

Shares

0

Comments

0

Likes

3

×