SlideShare a Scribd company logo
1 of 14
Download to read offline
Big Data for Startups
Ngon Pham - Lead Engineer & CTO at Lana
Content
● Introduction
● 10 practical problems & solutions
● Q & A
Introduction
● How to make use of social platforms to
empower your startups
○ Facebook
○ Youtube
○ Foursquare
Problem 1
● How to integrate Facebook user information
into your product
● Solution
○ Allow Facebook sign-in, ask for related permissions
○ Use the token to build user reflection for your
product: Bio, Statuses, Events, Groups, Likes…
○ Demo
Problem 2
● How to integrate Facebook into your vertical
eCommerce site for social marketing
● Solution
○ Integrate Facebook sign in, ask for token to analyze
user bio, their likes and compute user interest score
~ how they are interested in your category
○ Encourage them to like your page, share your
product basing on their interest score
Problem 3
● How to analyze Facebook users, and do the
target marketing even if you don’t know how
to code
● Solution
○ Use http://www.ocommunity.net/
○ Facebook ad targeting by user ID
○ Demo
Problem 4
● How to crawl all Foursquare places for a
specific country
● Solution
○ Use a simple spreading algorithm
○ Demo: Vietnam, Singapore, Indonesia, Cambodia
Problem 5
● How to use Foursquare as a social layer to
empower your reviewing site: Travel, Food...
● Solution
○ Do a matching (manually / semi-auto) between
Foursquare places and your database
○ Constantly get updates from Foursquare for:
Number of checkins, Photos, Tips...
Problem 6
● How to crawl all videos for a specific
language on Youtube
● Solution
○ Use a language detection library https://code.google.
com/p/language-detection/
○ Crawl using Related Video API provided by Youtube
○ Demo
● How to build a classified video app on top of
Youtube
● Solution
○ Crawl all videos & develop some filtering algorithms
○ Some useful tools for cleaning / sampling data:
Google Refine, R…
○ Demo
Problem 7
● How to scale processes across machines?
● Solution
○ Use a Queue server for dividing the workload https:
//www.rabbitmq.com/
Problem 8
● How to export processed data for end-users
as a website
● Solution
○ Use Wordpress
■ Data model is very simple ~12 tables
■ Tons of plugins and themes
○ Demo
Problem 9
● How to export processed data for Business
Intelligence / Visualization
● Solution
○ Export processed data to MySQL
○ Use Pentaho / JasperReport to hook in
○ Demo
Problem 10
Q & A
Q & A

More Related Content

Similar to Big Data for Startups

Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016
Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016
Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016Peter Mead
 
Tools to Save Time
Tools to Save TimeTools to Save Time
Tools to Save TimeBeMyApp
 
Focus on (the Right) Content - Oracle Developer Community Day 2017
Focus on (the Right) Content - Oracle Developer Community Day 2017Focus on (the Right) Content - Oracle Developer Community Day 2017
Focus on (the Right) Content - Oracle Developer Community Day 2017Mano Marks
 
Tech Tools Anyone Can Use
Tech Tools Anyone Can UseTech Tools Anyone Can Use
Tech Tools Anyone Can UseNicoleGraham48
 
Google Cloud: Next'19 Extended Hanoi
Google Cloud: Next'19 Extended HanoiGoogle Cloud: Next'19 Extended Hanoi
Google Cloud: Next'19 Extended HanoiGCPUserGroupVietnam
 
Data analytic for mobile app development
Data analytic for mobile app developmentData analytic for mobile app development
Data analytic for mobile app developmentTrieu Nguyen
 
Data Analytics for Mobile App Development
Data Analytics for Mobile App DevelopmentData Analytics for Mobile App Development
Data Analytics for Mobile App DevelopmentBarcamp Saigon
 
Workshop Stanford University - 28th July 2018 on Website Optimization
Workshop Stanford University - 28th July 2018 on Website Optimization  Workshop Stanford University - 28th July 2018 on Website Optimization
Workshop Stanford University - 28th July 2018 on Website Optimization Raj Lal
 
2017 Asia Startup Ecosystem Winter Digital Accelerator
2017 Asia Startup Ecosystem Winter Digital Accelerator2017 Asia Startup Ecosystem Winter Digital Accelerator
2017 Asia Startup Ecosystem Winter Digital AcceleratorChristine Souffrant Ntim
 
By the power of Plone - Case Carousel at University of Jyväskylä
By the power of Plone - Case Carousel at University of JyväskyläBy the power of Plone - Case Carousel at University of Jyväskylä
By the power of Plone - Case Carousel at University of JyväskyläRikupekka Oksanen
 
Niche parent 13 Plugins to Enhance Your WordPress Blog
Niche parent 13   Plugins to Enhance Your WordPress BlogNiche parent 13   Plugins to Enhance Your WordPress Blog
Niche parent 13 Plugins to Enhance Your WordPress BlogKarla Campos
 
ShopekLobek first term work summary
ShopekLobek first term work summaryShopekLobek first term work summary
ShopekLobek first term work summaryAshraf Hamdy
 
Internet in-advocacy-work
Internet in-advocacy-workInternet in-advocacy-work
Internet in-advocacy-workintal
 
The Art of the Minimum Viable Product (MVP)
The Art of the Minimum Viable Product (MVP)The Art of the Minimum Viable Product (MVP)
The Art of the Minimum Viable Product (MVP)Movel
 
Product Management for non Product Managers
Product Management for non Product ManagersProduct Management for non Product Managers
Product Management for non Product ManagersIsaac Souweine
 
Facebook F8 2016
Facebook F8 2016Facebook F8 2016
Facebook F8 2016Fate Chang
 
2016 MiddleEastStartups.com Winter Digital Accelerator
2016 MiddleEastStartups.com Winter Digital Accelerator2016 MiddleEastStartups.com Winter Digital Accelerator
2016 MiddleEastStartups.com Winter Digital AcceleratorChristine Souffrant Ntim
 
2016 LatinAmericaStartups.com Winter Digital Accelerator
2016 LatinAmericaStartups.com Winter Digital Accelerator2016 LatinAmericaStartups.com Winter Digital Accelerator
2016 LatinAmericaStartups.com Winter Digital AcceleratorChristine Souffrant Ntim
 

Similar to Big Data for Startups (20)

Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016
Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016
Paid Traffic with WordPress PPC Hacks - by Peter Mead for BigDigital 2016
 
Facebook For Small BusinessES
Facebook For Small BusinessESFacebook For Small BusinessES
Facebook For Small BusinessES
 
Tools to Save Time
Tools to Save TimeTools to Save Time
Tools to Save Time
 
Focus on (the Right) Content - Oracle Developer Community Day 2017
Focus on (the Right) Content - Oracle Developer Community Day 2017Focus on (the Right) Content - Oracle Developer Community Day 2017
Focus on (the Right) Content - Oracle Developer Community Day 2017
 
We Economy - Drupalsouth
We Economy - DrupalsouthWe Economy - Drupalsouth
We Economy - Drupalsouth
 
Tech Tools Anyone Can Use
Tech Tools Anyone Can UseTech Tools Anyone Can Use
Tech Tools Anyone Can Use
 
Google Cloud: Next'19 Extended Hanoi
Google Cloud: Next'19 Extended HanoiGoogle Cloud: Next'19 Extended Hanoi
Google Cloud: Next'19 Extended Hanoi
 
Data analytic for mobile app development
Data analytic for mobile app developmentData analytic for mobile app development
Data analytic for mobile app development
 
Data Analytics for Mobile App Development
Data Analytics for Mobile App DevelopmentData Analytics for Mobile App Development
Data Analytics for Mobile App Development
 
Workshop Stanford University - 28th July 2018 on Website Optimization
Workshop Stanford University - 28th July 2018 on Website Optimization  Workshop Stanford University - 28th July 2018 on Website Optimization
Workshop Stanford University - 28th July 2018 on Website Optimization
 
2017 Asia Startup Ecosystem Winter Digital Accelerator
2017 Asia Startup Ecosystem Winter Digital Accelerator2017 Asia Startup Ecosystem Winter Digital Accelerator
2017 Asia Startup Ecosystem Winter Digital Accelerator
 
By the power of Plone - Case Carousel at University of Jyväskylä
By the power of Plone - Case Carousel at University of JyväskyläBy the power of Plone - Case Carousel at University of Jyväskylä
By the power of Plone - Case Carousel at University of Jyväskylä
 
Niche parent 13 Plugins to Enhance Your WordPress Blog
Niche parent 13   Plugins to Enhance Your WordPress BlogNiche parent 13   Plugins to Enhance Your WordPress Blog
Niche parent 13 Plugins to Enhance Your WordPress Blog
 
ShopekLobek first term work summary
ShopekLobek first term work summaryShopekLobek first term work summary
ShopekLobek first term work summary
 
Internet in-advocacy-work
Internet in-advocacy-workInternet in-advocacy-work
Internet in-advocacy-work
 
The Art of the Minimum Viable Product (MVP)
The Art of the Minimum Viable Product (MVP)The Art of the Minimum Viable Product (MVP)
The Art of the Minimum Viable Product (MVP)
 
Product Management for non Product Managers
Product Management for non Product ManagersProduct Management for non Product Managers
Product Management for non Product Managers
 
Facebook F8 2016
Facebook F8 2016Facebook F8 2016
Facebook F8 2016
 
2016 MiddleEastStartups.com Winter Digital Accelerator
2016 MiddleEastStartups.com Winter Digital Accelerator2016 MiddleEastStartups.com Winter Digital Accelerator
2016 MiddleEastStartups.com Winter Digital Accelerator
 
2016 LatinAmericaStartups.com Winter Digital Accelerator
2016 LatinAmericaStartups.com Winter Digital Accelerator2016 LatinAmericaStartups.com Winter Digital Accelerator
2016 LatinAmericaStartups.com Winter Digital Accelerator
 

Recently uploaded

Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsChristian Birchler
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Mater
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 

Recently uploaded (20)

Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving CarsSensoDat: Simulation-based Sensor Dataset of Self-driving Cars
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)Ahmed Motair CV April 2024 (Senior SW Developer)
Ahmed Motair CV April 2024 (Senior SW Developer)
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
Odoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting ServiceOdoo Development Company in India | Devintelle Consulting Service
Odoo Development Company in India | Devintelle Consulting Service
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 

Big Data for Startups

  • 1. Big Data for Startups Ngon Pham - Lead Engineer & CTO at Lana
  • 2. Content ● Introduction ● 10 practical problems & solutions ● Q & A
  • 3. Introduction ● How to make use of social platforms to empower your startups ○ Facebook ○ Youtube ○ Foursquare
  • 4. Problem 1 ● How to integrate Facebook user information into your product ● Solution ○ Allow Facebook sign-in, ask for related permissions ○ Use the token to build user reflection for your product: Bio, Statuses, Events, Groups, Likes… ○ Demo
  • 5. Problem 2 ● How to integrate Facebook into your vertical eCommerce site for social marketing ● Solution ○ Integrate Facebook sign in, ask for token to analyze user bio, their likes and compute user interest score ~ how they are interested in your category ○ Encourage them to like your page, share your product basing on their interest score
  • 6. Problem 3 ● How to analyze Facebook users, and do the target marketing even if you don’t know how to code ● Solution ○ Use http://www.ocommunity.net/ ○ Facebook ad targeting by user ID ○ Demo
  • 7. Problem 4 ● How to crawl all Foursquare places for a specific country ● Solution ○ Use a simple spreading algorithm ○ Demo: Vietnam, Singapore, Indonesia, Cambodia
  • 8. Problem 5 ● How to use Foursquare as a social layer to empower your reviewing site: Travel, Food... ● Solution ○ Do a matching (manually / semi-auto) between Foursquare places and your database ○ Constantly get updates from Foursquare for: Number of checkins, Photos, Tips...
  • 9. Problem 6 ● How to crawl all videos for a specific language on Youtube ● Solution ○ Use a language detection library https://code.google. com/p/language-detection/ ○ Crawl using Related Video API provided by Youtube ○ Demo
  • 10. ● How to build a classified video app on top of Youtube ● Solution ○ Crawl all videos & develop some filtering algorithms ○ Some useful tools for cleaning / sampling data: Google Refine, R… ○ Demo Problem 7
  • 11. ● How to scale processes across machines? ● Solution ○ Use a Queue server for dividing the workload https: //www.rabbitmq.com/ Problem 8
  • 12. ● How to export processed data for end-users as a website ● Solution ○ Use Wordpress ■ Data model is very simple ~12 tables ■ Tons of plugins and themes ○ Demo Problem 9
  • 13. ● How to export processed data for Business Intelligence / Visualization ● Solution ○ Export processed data to MySQL ○ Use Pentaho / JasperReport to hook in ○ Demo Problem 10
  • 14. Q & A Q & A