Integrate Big Data into Your Organization with Lower Total Costs

Integrate Big Data into Your Organization
with Lower Total Costs

2
Perficient is a leading information technology consulting firm serving clients
throughout North America.
We help clients implement business-driven technology solutions that integrate
business processes, improve worker productivity, increase customer loyalty and create
a more agile enterprise to better respond to new business opportunities.
About Perficient

3
• Founded in 1997
• Public, NASDAQ: PRFT
• 2012 revenue of $327 million
• Major market locations throughout North America
• Atlanta, Austin, Boston, Charlotte, Chicago, Cincinnati, Cleveland, Columbus,
Dallas, Denver, Detroit, Fairfax, Houston, Indianapolis, Minneapolis, New
Orleans, New York, Northern California, Philadelphia, Southern California, St.
Louis , Toronto, and Washington, D.C.
• Global delivery centers in China, Europe and India
• ~2,000 colleagues
• Dedicated solution practices
• ~85% repeat business rate
• Alliance partnerships with major technology vendors
• Multiple vendor/industry technology and growth awards
Perficient Profile

4
Business Solutions
• Business Intelligence
• Business Process Management
• Customer Experience and CRM
• Enterprise Performance Management
• Enterprise Resource Planning
• Experience Design (XD)
• Management Consulting
Technology Solutions
• Business Integration/SOA
• Cloud Services
• Commerce
• Content Management
• Custom Application Development
• Education
• Information Management
• Mobile Platforms
• Platform Integration
• Portal & Social
Our Solutions Expertise

Speakers
Randall Gayle
• Data Management Director for Perficient
• 30+ years of data management experience
• Helps companies develop solutions around master data
management, data quality, data governance and data
integration.
• Provides data management expertise to industries including
oil and gas, financial services, banking, healthcare,
government, retail and manufacturing.
John Haddad
• Senior Director of Big Data Product Marketing for Informatica
• 25+ years of experience developing and marketing
enterprise applications.
• Advises organizations on Big Data best practices from a
management and technology perspective.
5

Interesting Facts about BIG Data
1. It took from the dawn of civilization to the year 2003 for the world to generate 1.8
zettabytes (10 to the 12th gigabytes) of data. In 2011 it took two days on average to
generate the same amount of data.
2. If you stacked a pile of CD-ROMs on top of one another until you’d reached the current
global storage capacity for digital information – about 295 Exabyte – if would stretch
80,000 km beyond the moon.
3. Every hour, enough information is consumed by internet traffic to fill 7 million DVDs.
Side by side, they’d scale Mount Everest 95 times.
4. 247 billion e-mail messages are sent each day… up to 80% of them are spam.
5. 48 hours of video are uploaded to YouTube every minute, resulting in 8 years’ worth of
digital content each day
6. The world’s data doubles every two years
7. There are nearly as many bits of information in the digital universe as there are stars in
our actual universe.
8. There are 30 billion pieces of content shared on Facebook every day and 750 million
photos uploaded every two days
6

Agenda
• Innovation vs. Cost
• PowerCenter Big Data Edition
• What else does Informatica offer for Big Data?
• What Are Customers Doing with Informatica and Big
Data?
• Next Steps
• Q&A
7

How do you balance innovation and
cost?

Business
CEO and VP/Director of
Sales & Marketing,
Customer Service,
Product Development
INNOVATION
cost?

IT
CIO and VP/Director of
Information Management,
BI / Data Warehousing,
Enterprise Architecture
Business
CEO and VP/Director of
Sales & Marketing,
Customer Service,
Product Development
COSTINNOVATION
cost?

Financial Services Retail & Telco Media & Entertainment
Public SectorManufacturing Healthcare & Pharma
Business is connecting innovation to Big Data

Risk & Portfolio
Analysis,
Investment
Recommendations
Proactive Customer
Engagement,
Location Based
Services
Financial Services Retail & Telco
Media & Entertainment
Online & In-Game
Behavior
Customer X/Up-Sell

Risk & Portfolio
Analysis,
Investment
Recommendations
Connected Vehicle,
Predictive Maintenance
Health Insurance
Exchanges,
Public Safety,
Tax Optimization
Fraud Detection
Predicting Patient
Outcomes,
Total Cost of Care
Drug Discovery
Proactive Customer
Engagement,
Location Based
Services
Financial Services Retail & Telco
Media & Entertainment
Online & In-Game
Behavior
Customer X/Up-Sell

IT is struggling with the cost of Big Data
• Growing data volume is
quickly consuming capacity

• Need to onboard, store, &
process new types of data

• Need to onboard, store, &
process new types of data
• High expense and lack of
big data skills

Big Data
Analysis
Big Data
Integration &
Quality
Big Data Projects

Big Data
Analysis
Big Data
Integration &
Quality
80% of the work in Big Data projects is
data integration and data quality
Big Data Projects

T i m e
a v a i l a b
l e f o r
d a t a
a n a l y s i
s
T i m e s p e n t o n d a t a
p r e p a r a t i o n (p a r s e ,
p r o f i l e , c l e a n s e ,
t r a n s f o r m , m a t c h )
Without
PowerCenter
Big Data Edition

T i m e
a v a i l a b
l e f o r
d a t a
a n a l y s i
s
T i m e s p e n t o n d a t a
p r e p a r a t i o n (p a r s e ,
p r o f i l e , c l e a n s e ,
t r a n s f o r m , m a t c h )
Without
PowerCenter
Big Data Edition
With
PowerCenter
Big Data Edition

Informatica + Hadoop
PowerCenter Developers are Now Hadoop Developers
Transactions,
OLTP, OLAP
Social Media, Web Logs
Machine Device,
Scientific
Documents and Emails
Analytics & Op
Dashboards
Mobile
Apps
Real-Time
Alerts
Archive Profile Parse ETL Cleanse Match

23
The Vibe Virtual Data Machine
Optimizer
Virtual Data Machine
Executor
Connectors
Transformation Library
Defines logic
Deploys most efficiently
based on data, logic and
execution environment
Run-time physical
execution
Connectivity to data
sources

24
Virtual Data Machine
Information
Exchange
Master Data
Management
3rd Party
Solutions
Data Integration Data Quality
Information
Lifecycle
Infrastructureservices
Role-basedtools
INFORMATION
SOLUTIONS
AND DATA
SERVICES
Vibe Virtual Data Machine
Map Once. Deploy Anywhere.
DEPLOY
ANYWHERE
Cloud
Embedded
DQ in apps
Data
Virtualization
ServerDesktop HADOOP
Data
IntegrationHub

PowerCenter Big Data Edition
The Safe On-Ramp To Big Data
Big Transaction Data Big Interaction Data
Online Transaction
Processing (OLTP)
Oracle
DB2
Ingres
Informix
Sysbase
SQL Server
…
Cloud
Salesforce.com
Concur
Google App Engine
Amazon
…
Other Interaction Data
Clickstream
image/Text
Scientific
Genomoic/pharma
Medical
Medical/Device
Sensors/meters
RFID tags
CDR/mobile
…
Social Media & Web Data
Facebook
Twitter
Linkedin
Youtube
…
Big Data Processing
Online Analytical
Processing (OLAP) &
DW Appliances
Teradata
Redbrick
EssBase
Sybase IQ
Netezza
Exadata
HANA
Greenplum
DataAllegro
Asterdata
Vertica
Paraccel …
Web applications
Blogs
Discussion forums
Communities
Partner portals
…

The Safe On-Ramp To Big Data
Big Transaction Data Big Interaction Data
Online Transaction
Processing (OLTP)
Oracle
DB2
Ingres
Informix
Sysbase
SQL Server
…
Cloud
Salesforce.com
Concur
Google App Engine
Amazon
…
Other Interaction Data
Clickstream
image/Text
Scientific
Genomoic/pharma
Medical
Medical/Device
Sensors/meters
RFID tags
CDR/mobile
…
Social Media & Web Data
Facebook
Twitter
Linkedin
Youtube
…
Big Data Processing
Online Analytical
Processing (OLAP) &
DW Appliances
Teradata
Redbrick
EssBase
Sybase IQ
Netezza
Exadata
HANA
Greenplum
DataAllegro
Asterdata
Vertica
Paraccel …
Web applications
Blogs
Discussion forums
Communities
Partner portals
…
Universal Data Access
High-Speed Data
Ingestion and
Extraction
ETL on Hadoop
Profiling on Hadoop
Complex Data
Parsing on Hadoop
Entity Extraction and
Data Classification on
Hadoop
No-Code Productivity
Business-IT
Collaboration
Unified Administration
the VibeTM virtual
data machine
PowerCenter
Big Data Edition

Lower Costs
Transactions,
OLTP, OLAP
Machine Device,
Scientific
EDW
Data
Mart
Data
Mart
Optimize processing on
low cost hardware
Increase productivity up to 5X
Traditional Grid

Traditional Grid
Deploy On-Premise or
in the Cloud
Quickly staff projects
with trained experts
Map Once. Deploy AnywhereTM
Minimize Risk

Innovate Faster
Transactions,
OLTP, OLAP
Machine Device,
Scientific
Analytics & Op
Dashboards
Mobile
Apps
Real-Time
Alerts
Onboard and analyze any type of
data to gain big data insights
Discover insights faster through
rapid development and collaboration
Operationalize big data insights to
generate new revenue streams

• Currently using Hadoop?
• Plan to implement Hadoop in 3-6 months
• Plan to implement Hadoop in 6-12 months
• No plans for Hadoop
30
What are your plans for Hadoop? (select one)
Poll Question #1

What Else Does Informatica
Offer for Big Data?

Inactive data
Active data
Performance
T I M E
DATABASESIZE
Enterprise Data
Warehouse
Transactions,
OLTP, OLAP
• Identify dormant data
• Archive inactive data to low-cost storage
Lower Data Management Costs

Active data
T I M E
DATABASESIZE
Enterprise Data
Warehouse
Low-Cost
Storage
Archive
Transactions,
OLTP, OLAP
Low-Cost
Storage
Archive
• Identify dormant data
• Archive inactive data to low-cost storage
Lower Data Management Costs

Data
Mart
Data
Mart
Data
Mart
Data
Mart
Data
Mart
Data
Mart
Data
Mart
Data
Mart
Data
Mart
EDW
BI Reports /
Dashboards
ODS MDM
• Avoid copies of data and augment the data warehouse using
data virtualization
• Role-based fine-grained secure access
Minimize Risk

EDW
BI Reports /
Dashboards
ODS MDM
• Avoid copies of data and augment the data warehouse using
data virtualization
• Role-based fine-grained secure access
Minimize Risk
Dynamic Data Masking
Data Virtualization

Production
(ERP, CRM,
EDW, Custom)
BI Reports /
Dashboards
Development
Test
• Mask sensitive data in non-production systems
Minimize Risk
Training

Apply
Data
Governance
Apply
Measure
and
Monitor
Define
Discover
Discover
• Data discovery
• Data profiling
• Data inventories
• Process inventories
• CRUD analysis
• Capabilities assessment
Define
• Business glossary creation
• Data classifications
• Data relationships
• Reference data
• Business rules
• Data governance policies
• Other dependent policies
Measure and Monitor
• Proactive monitoring
• Operational dashboards
• Reactive operational DQ audits
• Dashboard monitoring/audits
• Data lineage analysis
• Program performance
• Business value/ROI
Apply
• Automated rules
• Manual rules
• End to end workflows
• Business/IT collaboration
Innovate Faster With Big Data

• Enrich master data to proactively engage
customers & improve products and services

• Analyze data in real-time using event-based
processing and proactive monitoring
Customer
Business Rules
Social Data
Alert
Geo-location
Data
Transaction Data
Merchant
Offers

• Data archiving
• Data masking
• Data virtualization
• Data quality
• Data discovery
• MDM
• Real-time event based processing
40
What other data management technologies are you
considering within the next 12 months? (check all that apply)
Poll Question #2

What Are Customers Doing with
Informatica and Big Data?

The Challenge. Data volumes growing at 3-5 times over the next 2-3 years
The Solution The Result
• Manage data integration
and load of 10+ billion
records from multiple
disparate data sources
• Flexible data integration
architecture to support
changing business
requirements in a
heterogeneous data
management
environment
Flexible architecture to support rapid changes
EDW
Mainframe
DataVirtualization
RDBMS
Unstructured
Data
Business
Reports
Traditional Grid
Large Government Agency
DW
DW

The Challenge. Data warehouse exploding with over 200TB of data. User activity
generating up to 5 million queries a day impacting query performance
• Saved $20M + $2-3M
on-going by archiving
& optimization
• Reduced project
timeline from
6 months to 2 weeks
• Improved
performance by 25%
• Return on investment
in less than 6 months
Lower costs of Big Data projects
ERP
CRM
Custom
Business
Reports
Archived
DataInteraction Data
Large Global Financial Institution
EDW
Archived
Data

Web Logs
Traditional Grid
Near Real-Time
The Challenge. Increasing demand for faster data driven decision making and analytics
as data volumes and processing loads rapidly increase
• Cost-effectively scale
performance
• Lower hardware costs
• Increased agility by
standardizing on one
data integration
platform
• Leverage new data
sources for faster
innovation
Lower costs and minimize risk
Datamarts
Data
Warehouse
RDBMS
RDBMS
Large Global Financial Institution

The Challenge. Collect data in real-time from all cars by end of the year for
“Connected Car” program
• Helps enable goals of
connected vehicle program:
• Embedding mobile
technologies to enhance
customer experience
• Predictive maintenance
and improved fuel
efficiency
• On call roadside
assistance and auto
scheduling service
Create Innovative Products and Services
Connected Vehicle Program
Business
Reports
Large Global Automotive Manufacturer
EDW
Complex
Event
Processing

What should you be doing?
• Tomorrow
– Identify a business goal where data can have a significant impact
– Identify the skills you need to build a big data analytics team
• 3 months
– Identify and prioritize the data you need to achieve your business
goals
– Put a business plan and reference architecture together to optimize
your enterprise information management infrastructure
– Execute a quick win big data project with measurable ROI
• 1 year
– Extend data governance to include more data and more types of data
that impacts the business
– Consider a shared-services model to promote best practices and
further lower infrastructure and labor costs

Integrate Big Data into Your Organization with Lower Total Costs

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (20)

Ähnlich wie Integrate Big Data into Your Organization with Lower Total Costs

Ähnlich wie Integrate Big Data into Your Organization with Lower Total Costs (20)

Mehr von Perficient, Inc.

Mehr von Perficient, Inc. (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Integrate Big Data into Your Organization with Lower Total Costs

Hinweis der Redaktion