SlideShare ist ein Scribd-Unternehmen logo
1 von 27
Downloaden Sie, um offline zu lesen
Scaling	
  MongoDB	
  with	
  
  Sharding	
  –	
  A	
  Case	
  Study	
  



  Presented	
  by	
  
  Yash	
  Badiani	
  and	
  Rahul	
  Nair	
  
  	
  
  	
  
  	
  
  	
  

CIGNEX	
  Datamatics	
  Con1idential	
          www.cignex.com	
  
About	
  CIGNEX	
  Datamatics	
  



                                A	
  subsidiary	
  of	
  Datamatics	
  Global	
  Services	
  
                                                          Limited	
  
                                                             	
  




CIGNEX	
  Datamatics	
  Con1idential	
                  www.cignex.com	
                        2	
  
Introduction	
  of	
  Datamatics	
  (DGSL)	
  
     •  Mission	
                                                                 Strategic	
  Alliances	
  
               –  Experts	
  in	
  improving	
  
                  Enterprise	
  productivity	
  	
  
                  through	
  	
  Process	
  Engineering	
  &	
  	
  
                  Information	
  Management	
  
                  Solutions	
  

     •  Key	
  Highlights	
  
               –  Founded	
  in	
  1975	
  

               –  Publicly	
  listed	
  in	
  India	
  

               –  Annual	
  consolidated	
  revenue	
  of	
  
                  US$100	
  Million	
  

               –  Fortune	
  500	
  clients	
  

               –  4,400+	
  employees	
  across	
  22	
  
                  of1ices	
  in	
  9	
  countries	
  


CIGNEX	
  Datamatics	
  Con1idential	
                       www.cignex.com	
                                  3	
  
What	
  Does	
  CIGNEX	
  Datamatics	
  Do?	
  

Since	
  2000,	
  making	
  Open	
  Source	
  work	
  for	
  
the	
  enterprise	
  through	
  adoption	
  and	
  
integration	
  to:	
                                                          Portal	
  
                                                                              Solutions	
                       Content	
  	
  
          •  Address	
  business	
  goals	
                                                                     Solutions	
  

          •  Increase	
  business	
  velocity	
  

          •  Lower	
  the	
  cost	
  of	
  doing	
  business	
  

          •  Reduce	
  TCO	
  
                                                                                              Big	
  Data	
  
          •  Gain	
  competitive	
  advantage	
                                               Solutions	
  




                                      400+	
  implementations	
  worldwide	
  across	
  industries	
  

CIGNEX	
  Datamatics	
  Con1idential	
                         www.cignex.com	
                                                   4	
  
Where	
  We	
  Can	
  Help	
  You	
  
            	
  SOLUTIONS	
  
                                                                                                                                    •      Intranet	
  	
          •  S o c i a l	
  
                                                  Portals	
                                  Liferay,	
  Drupal,	
  JBoss,	
        • 
                                                                                                                                    • 
                                                                                                                                           Extranet	
  
                                                                                                                                           EAI	
  
                                                                                                                                                                        Collabora>on	
  
                                                                                                                                                                   •  Mobile	
  Portals	
  
                                                  User	
  eXperience	
  	
                           ZK,	
  HTML5,	
  
                                                                                                                                    •      SOA	
                   	
  
                                                  PlaRorm	
                                           MuleSoW	
  



                                                                                              Alfresco,	
  Adobe	
  CQ,	
  	
       •      WCM	
  
                                                  Content	
                                     Drupal,	
  Magento,	
  	
  
                                                                                                                                    •      DM	
                   •  E-­‐Commerce	
  
                                                  Enterprise	
  Content	
                                                           •      RM	
                   •  E-­‐learning	
  
                                                                                           	
  JBoss,	
  Moodle,	
  EphesoW,	
      •      CMS	
                  •  ERP	
  
                                                  Management	
  
                                                                                                                                    •      DAM	
                  •  Imaging	
  
                                                                                                        Liferay	
  
                                                                                                                                    	
                            	
  	
  	
  	
  Solu>ons	
  
                                                                                                             	
  
                                                                                   	
                                               •  Analy>cs	
                •  DW	
  -­‐	
  BI	
  
                                                                                          Hadoop,	
  	
  MongoDB,	
  Neo4j,	
  
                                                  Big	
  Data	
                                                                            • 
                                                                                                                                           • 
                                                                                                                                                Mobile	
  
                                                                                                                                                Social	
  
                                                                                                                                                                 •  Log	
  Processing	
  
                                                                                                    Flume,	
  Hive	
  	
                                            and	
  Analysis	
  	
  
                                                  Making	
  Data	
  Work	
                                                                 •    Web	
            •  Enterprise	
  
                                                                                           Solr,	
  	
  Pentaho,	
  JaspersoW	
            •    Real-­‐>me	
        Search	
  
                                                                                                                                    	
  
            	
  SERVICES	
  
                           UI,	
  	
  Development	
  ,	
  Integra>on,	
  	
  Customiza>on,	
  	
  Migra>on	
  ,	
  Tes>ng,	
  	
  	
  Training	
  ,	
  	
  Support	
  (24*7)	
  

                                                                     Managed	
  Cloud	
  Services	
  -­‐	
  Develop,	
  Deploy,	
  Manage	
  
                                      VAR/Annual	
  Product	
  Subscrip>on	
  -­‐	
  Liferay,	
  Alfresco,	
  Cloudera	
  Hadoop,	
  MongoDB	
  	
  
                                                                    Extended	
  Development	
  Center	
  –	
  Center	
  of	
  Excellence	
  	
  



CIGNEX	
  Datamatics	
  Con1idential	
                                                       www.cignex.com	
                                                                                    5	
  
About	
  the	
  Presenters	
  
     •  Yash	
  Badiani	
  is	
  the	
  Big	
  Data	
  Practice	
  Lead	
  at	
  CIGNEX	
  Datamatics	
  and	
  
              focuses	
  on	
  Big	
  Data	
  Technologies	
  including	
  MongoDB	
  &	
  Hadoop.	
  He	
  
              has	
  worked	
  extensively	
  on	
  large	
  Data	
  warehousing	
  &	
  Business	
  
              Intelligence	
  projects	
  with	
  tools	
  such	
  as	
  Business	
  Objects,	
  Microsoft	
  SQL	
  
              Server,	
  Microstrategy,	
  IBM	
  Cognos.	
  	
  
     	
  	
  
     •  Gaurav	
  Khambhala	
  works	
  at	
  CIGNEX	
  Datamatics	
  as	
  Technical	
  Lead.	
  
              He	
  is	
  the	
  senior	
  member	
  of	
  the	
  PHP	
  Practice	
  at	
  CIGNEX	
  Datamatics	
  and	
  
              is	
  involved	
  on	
  various	
  technology	
  initiatives	
  like	
  Big	
  Data	
  where	
  he	
  
              focuses	
  on	
  integration	
  of	
  PHP	
  with	
  NoSQL	
  sources	
  like	
  MongoDB.	
  He	
  
              has	
  a	
  wide	
  industry	
  experience	
  in	
  software	
  development	
  &	
  
              management	
  in	
  Open	
  Source	
  technologies	
  such	
  as	
  Drupal	
  &	
  Moodle	
  




CIGNEX	
  Datamatics	
  Con1idential	
                 www.cignex.com	
                                                      6	
  
Agenda	
  

       •      CIGNEX	
  Datamatics	
  –	
  Introduction	
  &	
  Offerings	
  
       •      Use	
  Case	
  &	
  Database	
  Requirements	
  
       •      Challenges	
  with	
  Traditional	
  Databases	
  
       •      Why	
  MongoDB?	
  
       •      Solution	
  	
  
                –  Approach	
  
                –  Architecture	
  and	
  Hardware	
  Sizing	
  
       •  Scaling	
  with	
  Sharding	
  
                –  Sharding	
  Basics	
  
                –  Sharding	
  –	
  Choosing	
  the	
  RIGHT	
  Shard	
  Key	
  
                –  Benchmarking	
  with	
  Results	
  
       •  Key	
  Takeaways	
  	
  


CIGNEX	
  Datamatics	
  Con1idential	
                 www.cignex.com	
            7	
  
Big	
  Data	
  Practice	
  At	
  CIGNEX	
  Datamatics	
  
             Brief	
  Snapshot	
  
          •  ~40	
  employee	
  Big	
  Data	
  Practice	
                     Technology	
  Partnership	
  
             focused	
  on	
  Hadoop,	
  MongoDB,	
  Neo4j,	
  
             Solr	
  

          •  Professionals	
  formally	
  trained	
  /	
  
             certi1ied	
  from	
  Cloudera	
  and	
  10gen	
  

          •  Expertize	
  in	
  Hadoop	
  Eco-­‐System	
  
             (HBase,	
  Pig,	
  Hive,	
  Flume,	
  Sqoop,	
  
             Oozie,	
  Zookeeper)	
  

          •  Strong	
  partnerships:	
  
               •  System	
  Integration	
  partners	
  
                  with	
  Cloudera	
  for	
  CDH	
  
               •  Global	
  partner	
  with	
  10gen	
  for	
  
                  MongoDB	
  –	
  multiple	
  webinars	
  
                  on	
  different	
  solutions	
  



CIGNEX	
  Datamatics	
  Con1idential	
                   www.cignex.com	
                                     8	
  
Our	
  Offerings	
  –	
  Big	
  Data	
  



                                                                                      Support	
  &	
  
            Consulting	
                    Implementation	
  
                                                                                       Training	
  



    Consulting	
                           Implementation	
                    Support	
  &	
  Training	
  
    •  Business	
  Analysis	
  	
          •  UI	
  Development	
              •  DBA	
  Support	
  
    •  Technology	
  Evaluation	
          •  Application	
  Integration	
     •  Application	
  Support	
  
    •  Architecture	
  	
                  •  Customization	
                  •  Enhancements	
  
    •  Design	
  Framework	
               •  Migration	
                      •  24*7	
  Production	
  
    •  Cluster	
  sizing	
                 •  Testing	
                           Support(Tier	
  1/2/3)	
  
    •  Deployment	
  planning	
            •  Performance	
  Tuning	
          •  Trainings	
  
    •  Proof-­‐of-­‐Concept	
  
    •  Health	
  Check	
  
    •  Performance	
  
       Benchmarking	
  




CIGNEX	
  Datamatics	
  Con1idential	
                 www.cignex.com	
                                        9	
  
Use	
  Case	
  

        	
  
                                                                                                                           Load	
  
                                Users	
                                Devices	
                                                                                                 Database	
  
                                                                                                                          Balancer	
  



        	
  




                                                                                                                                                    Data	
  Storage	
  
                                                                                                     App.	
  Layer	
  
      End	
  Users	
  




                                                     Devices	
  




                         7	
  Million	
  Users	
                   8	
  devices	
  /	
  user	
                           Load	
  Balancer	
                               mongoDB	
  cluster	
  
                         Spread	
  Across	
                        Home/OfMice/                                          Receives	
  	
  high	
                           Sharding	
  
                         Geography	
                               Anywhere	
                                            volume	
  of	
                                   Replication	
  with	
  
                                                                                                                         concurrent	
  CRUD	
                             Automatic	
  
                                                                                                                         requests	
                                       Failover	
  
                                                                                                                         Routes	
  request	
                              Indexes	
  
                                                                                                                         trafMic	
  to	
  DB	
  
                                                                                                                         cluster	
                                        	
  




CIGNEX	
  Datamatics	
  Con1idential	
                                                     www.cignex.com	
                                                                                         10	
  
Database	
  Requirements	
  


           Flexibility	
  	
                                                High	
  
           in	
  Schema	
                                               Performance	
  



                                                Agility	
  in	
  	
  
                                           Development	
  	
  
                                           &	
  Deployment	
  



          Availability	
                                                 Enterprise	
  	
  
                                                                        Level	
  Support	
  




CIGNEX	
  Datamatics	
  Con1idential	
       www.cignex.com	
                                  11	
  
Limitations	
  of	
  RDBMS	
  




       Support	
  limited	
  to	
               Manage	
  only	
  Structured	
                RDBMS	
  doesn’t	
  scale	
                Feature	
  rich	
  but	
  slow	
  
          	
  terabytes	
                                  Data	
                                inherently	
                                performance	
  
                  	
  




                                                                                                                   $	
  



  Complex	
  to	
  Shard/Partition	
             Limitations	
  in	
  scaling	
  High	
           Specialized	
  Hardware	
  -­‐	
     Vertical	
  Scaling	
  expensive	
  
 due	
  to	
  maintenance	
  of	
  schema	
     volume	
  of	
  concurrent	
  CRUD	
                    Expensive	
                      and	
  dif1icult	
  to	
  scale	
  




                      RDBMS	
  can’t	
  manage	
  all	
  dimensions	
  	
  of	
  data	
  with	
  speed	
  &	
  at	
  lower	
  cost.	
  


CIGNEX	
  Datamatics	
  Con1idential	
                                       www.cignex.com	
                                                                                 12
Why	
  MongoDB?	
  

        	
  Flexibility	
  	
                                                                                           High	
  	
  
         in	
  Schema	
                                                                                             Performance	
  


            •  Easy	
  integration	
                                                                                      •  Concurrent	
  CRUD	
  	
  
            •  Ease	
  of	
  schema	
  	
                                                                                 •  Fast	
  Updates	
  
                                                               Agility	
  in	
  	
  
            	
  	
  	
  	
  	
  	
  	
  design	
                                                                          •  Write	
  distribution	
  	
  
                                                          Development	
  	
  
            •  Document	
  oriented	
  	
                                                                                 	
  	
  	
  	
  	
  	
  	
  with	
  Sharding	
  
                                                      	
  &	
  Deployment	
  
            	
  	
  	
  	
  	
  	
  	
  storage	
  

                   Schema	
  free	
                           •  Programming	
  	
                                       Indexes	
  &	
  Sharding	
  
                                                              	
  	
  	
  	
  	
  	
  	
  Language	
  drivers	
  
                                                              •  Shorter	
  Dev	
  cycle	
  
                                                              •  Faster	
  deployment	
  
                                                                                                                    Enterprise	
  
       Availability	
                                                                                                 Level	
  
                                                                                                                     Support	
  
                                                                      Driver	
  Support	
  
            •      Automatic	
  failover	
                                                                             •  Global	
  Coverage	
  
            •      Redundancy	
                                                                                        •  24x7	
  Support	
  
            •      100%	
  uptime	
                                                                                    •  Ease	
  of	
  	
  
            	
                                                                                                         	
  	
  	
  	
  	
  	
  	
  maintenance	
  


                     Replication	
                                                                                       Strong	
  Community	
  

CIGNEX	
  Datamatics	
  Con1idential	
                                     www.cignex.com	
                                                                                  13	
  
Solution:	
  Approach	
  

                	
         Schema	
  
                                              • Schema	
  Design	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  
                                              • Collections	
  and	
  Field	
  De1initions	
  


                                              • Document	
  Size	
  
                       Database	
  Size	
     • Total	
  expected	
  data	
  size	
  


                                              • Frequency	
  of	
  CRUD	
  operations	
  
       	
   Concurrent	
  Load	
              • Read/Write	
  ratio	
  


                                              • Automatic	
  Failover	
  
                        Availability	
        • Replication	
  and	
  Backup	
  


                                              • Working	
  Set	
  
                          Indexing	
          • Access	
  Patterns	
  


                                              • Horizontal	
  Scaling	
  
                          Sharding	
          • Query	
  Performance	
  


                                              • Cluster	
  sizing	
  
               Hardware	
  Sizing	
           • RAM	
  and	
  Disk	
  storage	
  



CIGNEX	
  Datamatics	
  Con1idential	
                                                          www.cignex.com	
                                                                                           14	
  
Solution:	
  Architecture	
  
                                                            Con1ig	
  Servers	
                                                             Shard	
  1	
  




                                            mongos	
  
                                                                                                                           mongod	
  




                          Server	
  
                           App	
  
                                                                                                                           Primary	
         Mongod	
  
                                                         mongod	
            mongod	
                                          	
            Arbiter	
  
                                                                                                                           mongod	
  
                                                                                                                          Secondary	
  




                                            mongos	
  
                          Server	
                               mongod	
                                                      	
  
                           App	
  

                                                                                                                              	
            Shard	
  2	
  
                                                                                                                           mongod	
  
                                                                                                                           Primary	
         Mongod	
  
                                            mongos	
  
                          Server	
  



                                                                                                                              	
             Arbiter	
  
                           App	
  
         Balancer	
  




                                                                                                       Data	
  Tier	
  
          Load	
  	
  




                                                                                                                           mongod	
  
                                                                                                                          Secondary	
  
                                                                      Routed	
  Requests	
  from	
  
                                                                        mongos	
  to	
  shards	
  
                                            mongos	
  
                          Server	
  




                                                                                                                              	
  
                           App	
  




                                                                                                                                            Shard	
  3	
  
                                                                                                                           mongod	
  
                                                                                                                           Primary	
         Mongod	
  
                                                                                                                              	
             Arbiter	
  
                                            mongos	
  




                                                                                                                           mongod	
  
                          Server	
  
                           App	
  




                                                                                                                          Secondary	
  


                                                                                                                                            Shard	
  4	
  
                                                                                                                          mongod	
  
                                            mongos	
  
                          Server	
  




                                                                                                                          Primary	
  
                           App	
  	
  




                                                                                                                                             Mongod	
  
                                                                                                                                             Arbiter	
  
                                                                                                                           mongod	
  
                                                                                                                          Secondary	
  
                                App	
  Tier	
                              Routed	
  for	
  non-­‐
                                                                         sharded	
  collections	
                                         Replica	
  Set	
  
                                                                                                                          mongod	
  
                                                                                                                          Primary	
          Mongod	
  
                                                                                                                                             Arbiter	
  
                                                                                                                           mongod	
  
                                                                                                                          Secondary	
  

CIGNEX	
  Datamatics	
  Con1idential	
                           www.cignex.com	
                                                                              15	
  
Sharding	
  –	
  What	
  is	
  it?	
  

       •  Distributes	
  single	
  logical	
  database	
  system	
  across	
  clusters	
  

       •  Allows	
  to	
  partition	
  a	
  collection	
  across	
  #	
  of	
  mongod	
  
              instances(shards)	
  

       •  Advantages:	
  
                –  Increases	
  write	
  capacity	
  

                –  Ability	
  to	
  support	
  larger	
  working	
  sets	
  

                –  Raises	
  limits	
  of	
  data	
  size	
  beyond	
  a	
  single	
  node	
  

       	
  



CIGNEX	
  Datamatics	
  Con1idential	
                     www.cignex.com	
                      16	
  
Sharding	
  -­‐	
  Features	
  

       •  Range-­‐based	
  Data	
  Partitioning	
  

       •  Automatic	
  Data	
  volume	
  distribution	
  

       •  Transparent	
  query	
  routing	
  

       •  Horizontal	
  capacity	
  
                –  Additional	
  write	
  capacity	
  through	
  distribution	
  

                –  Right	
  shard	
  key	
  allows	
  expansion	
  of	
  working	
  set	
  


       	
  




CIGNEX	
  Datamatics	
  Con1idential	
                  www.cignex.com	
                      17	
  
Sharding	
  –	
  When	
  to	
  use?	
  


                                                             Your	
  data	
  set	
  approaches	
  or	
  exceeds	
  the	
  storage	
  
                                                             capacity	
  of	
  a	
  single	
  node	
  in	
  your	
  system	
  
     Storage	
  
      Drive	
  




    The	
  size	
  of	
  your	
  system’s	
  active	
  working	
  set	
  will	
  soon	
  
    exceed	
  the	
  capacity	
  of	
  the	
  maximum	
  amount	
  of	
  RAM	
  
    for	
  your	
  system	
  
                                                                                                RAM	
  
                                                                                                                                  Working	
  Set	
  




                                                                                  Your	
  system	
  has	
  a	
  large	
  amount	
  of	
  write	
  
                                                                                  activity,	
  a	
  single	
  MongoDB	
  instance	
  cannot	
  
       Storage	
                                                                  write	
  data	
  fast	
  enough	
  to	
  meet	
  demand,	
  and	
  all	
  
        Drive	
                                                                   other	
  approaches	
  have	
  not	
  reduced	
  contention	
  	
  	
  

CIGNEX	
  Datamatics	
  Con1idential	
                              www.cignex.com	
                                                                    18	
  
Shard	
  Keys	
  

     Shard	
  Keys:	
                                                •  	
  	
  The	
  ideal	
  shard	
  key	
  :	
  
     Exist	
  in	
  every	
  document	
  in	
  a	
  
     collection	
  that	
  MongoDB	
  uses	
  to	
                              –  Easily	
  divisible	
  which	
  makes	
  it	
  
     distribute	
  documents	
  among	
  the	
  
     shards	
  like	
  indexes,	
  they	
  can	
  be	
                              easy	
  for	
  MongoDB	
  to	
  distribute	
  
     either	
  a	
  single	
  1ield,	
  or	
  a	
  
     compound	
  key	
                                                              content	
  among	
  the	
  shards	
  

                                                                                –  Higher	
  “randomness”	
  

                                                                                –  Targeted	
  queries	
  

                                                                                –  May	
  need	
  to	
  be	
  computed	
  




CIGNEX	
  Datamatics	
  Con1idential	
                     www.cignex.com	
                                                          19	
  
Choosing	
  Right	
  Shard	
  Key	
  

       Different	
  approach	
  for	
  Shard	
  Keys	
  	
  

       •  Approach	
  1:	
  Random	
  Key	
  	
  –	
  UserId	
  

       •  Approach	
  2:	
  Coarsely	
  ascending	
  key	
  +	
  Random	
  Key	
  –	
  	
  
              YearMonth	
  +	
  UserId	
  




       	
  


CIGNEX	
  Datamatics	
  Con1idential	
        www.cignex.com	
                                20	
  
Benchmarking	
  /	
  Load	
  Testing	
  Approach	
  
 Automated	
  scripts	
  with	
  varied	
  load	
  	
  




 	
  




CIGNEX	
  Datamatics	
  Con1idential	
     www.cignex.com	
     21	
  
Results	
  -­‐	
  INSERTS	
  

                                                                                           Approach	
  1	
  
                                                                                      Over	
  80	
  million	
  documents	
  inserted	
  
                                                                                      with	
  a	
  decreasing	
  threshold	
  over	
  10	
  
                                                                                      million	
  




                                                                                           Approach	
  2	
  
                                                                                      Over	
  225	
  million	
  documents	
  inserted	
  at	
  
                                                                                      a	
  stable	
  rate	
  of	
  6000	
  documents/sec	
  




Benchmarks	
  done	
  on	
  8GB	
  Test	
  H/W	
  Machines	
  

CIGNEX	
  Datamatics	
  Con1idential	
                           www.cignex.com	
                                                              22	
  
Results	
  -­‐	
  UPDATES	
  

                                                                                          Approach	
  1	
  
                                                                                      Over	
  50	
  million	
  documents	
  updated	
  at	
  
                                                                                      avg.	
  400	
  documents/sec	
  




                                                                                          Approach	
  2	
  
                                                                                      Over	
  100	
  million	
  documents	
  updated	
  at	
  
                                                                                      as	
  high	
  as.	
  4000	
  documents/sec	
  




Benchmarks	
  done	
  on	
  8GB	
  Test	
  H/W	
  Machines	
  

CIGNEX	
  Datamatics	
  Con1idential	
                           www.cignex.com	
                                                        23	
  
Results	
  –	
  INSERT,	
  UPDATE	
  


                                                                                      Approach	
  2	
  
                                                                                      Simultaneous	
  INSERT	
  
                                                                                      >6000	
  documents/	
  second	
  
                                                                                      >70	
  million	
  records	
  




                                                                                      Simultaneous	
  UPDATE	
  
                                                                                      >6000	
  documents/	
  second	
  
                                                                                      >50	
  million	
  records	
  




Benchmarks	
  done	
  on	
  8GB	
  Test	
  H/W	
  Machines	
  

CIGNEX	
  Datamatics	
  Con1idential	
                           www.cignex.com	
                                         24	
  
Benchmarking	
  –	
  Sharding	
  Vs	
  Non	
  Sharding	
  


     Operation	
                                  Sharding	
  (YearMonth	
  +	
              Non-­‐Sharding	
  
                                                  UserId)	
  
     INSERTS	
                                    ~6000	
  docs/sec	
                        ~2900	
  docs/sec	
  
     UPDATES	
                                    ~4000	
  docs/sec	
                        ~620	
  updates/sec	
  
     INSERT	
  &	
                                ~6000	
  docs/sec	
  &	
                   ~2000	
  docs/sec	
  &	
  
     UPDATES	
                                    ~6100	
  docs/sec	
                        ~600	
  docs/sec	
  




Benchmarks	
  done	
  on	
  8GB	
  Test	
  H/W	
  Machines	
  

CIGNEX	
  Datamatics	
  Con1idential	
                                  www.cignex.com	
                                  25	
  
Key	
  Takeaways	
  

       •  Comprehensive	
  approach	
  on	
  Performance	
  Tuning	
  

       •  Plan	
  Early	
  for	
  Performance	
  

       •  MongoDB	
  scales	
  &	
  shines	
  

       •  Sharding	
  scales	
  INSERTS/UPDATES	
  vs.	
  Non	
  sharding	
  

       •  Sharding	
  with	
  Approach	
  2	
  (Coarsely	
  ascending	
  Key	
  +	
  Random	
  
              Key)	
  provides	
  sustained	
  results	
  &	
  better	
  utilization	
  of	
  the	
  RAM	
  	
  

       •  Different	
  set	
  of	
  server/s	
  for	
  NON-­‐Sharded	
  collections	
  

       •  Indexes	
  to	
  be	
  de1ined	
  carefully	
  

       •  Sharded	
  collections	
  to	
  have	
  minimal	
  number	
  of	
  indexes	
  


CIGNEX	
  Datamatics	
  Con1idential	
                www.cignex.com	
                                             26	
  
Thank	
  You.	
  Any	
  Questions	
  ?	
  

                                             Making	
  Open	
  Source	
  Work	
  	
  
                                      For	
  queries	
  reach	
  out	
  to	
  us	
  at	
  info@cignex.com	
  
       	
  
       	
  
       	
  
       	
  




CIGNEX	
  Datamatics	
  Con1idential	
                          www.cignex.com	
  

Weitere ähnliche Inhalte

Was ist angesagt?

Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...
Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...
Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...Emakina
 
Premium Website Hosting
Premium Website HostingPremium Website Hosting
Premium Website Hostingwebhostingguy
 
Internship Experience
Internship Experience Internship Experience
Internship Experience Amit Chaudhari
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase
 
Increasing Accuracy and Efficiency Through Seamless Integration
Increasing Accuracy and Efficiency Through Seamless IntegrationIncreasing Accuracy and Efficiency Through Seamless Integration
Increasing Accuracy and Efficiency Through Seamless IntegrationSAP Ariba
 
Scaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data DistributionScaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data DistributionScaleBase
 
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]Rhapsody Technologies, Inc.
 
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOutScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOutScaleBase
 
Coveo Search - Product Overview
Coveo Search - Product OverviewCoveo Search - Product Overview
Coveo Search - Product OverviewAmplexor
 
Deployment Planning for Success - #SPSBend
Deployment Planning for Success - #SPSBendDeployment Planning for Success - #SPSBend
Deployment Planning for Success - #SPSBendDavid Samoranski
 
E-commerce Technology for Safe money transaction over the net
E-commerce Technology for Safe money transaction over the netE-commerce Technology for Safe money transaction over the net
E-commerce Technology for Safe money transaction over the netRaman K. Attri
 
SunCorp Campaign Measurement
SunCorp Campaign MeasurementSunCorp Campaign Measurement
SunCorp Campaign MeasurementDatalicious
 
Etendez votre datacenter avec aws v4
Etendez votre datacenter avec aws v4Etendez votre datacenter avec aws v4
Etendez votre datacenter avec aws v4Amazon Web Services
 
Creating an RAD Authoratative Data Environment
Creating an RAD Authoratative Data EnvironmentCreating an RAD Authoratative Data Environment
Creating an RAD Authoratative Data Environmentanicewick
 
SharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the FieldSharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the FieldChris McNulty
 
Australia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance SolutionsAustralia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance SolutionsChris McNulty
 
Increase Agility & ROI: BPM in Business Support Systems
Increase Agility & ROI: BPM in Business Support SystemsIncrease Agility & ROI: BPM in Business Support Systems
Increase Agility & ROI: BPM in Business Support SystemsSrikanth Minnam
 
Rajesh Srinivasan As Director @ Bsa Corporation Wiring Harness Division
Rajesh Srinivasan As Director @ Bsa Corporation Wiring Harness DivisionRajesh Srinivasan As Director @ Bsa Corporation Wiring Harness Division
Rajesh Srinivasan As Director @ Bsa Corporation Wiring Harness Divisionsrinivasanrajesh
 
Using BrightWork for Project Management with SharePoint 2010 - from Atidan
Using BrightWork for Project Management with SharePoint 2010 - from AtidanUsing BrightWork for Project Management with SharePoint 2010 - from Atidan
Using BrightWork for Project Management with SharePoint 2010 - from AtidanDavid J Rosenthal
 

Was ist angesagt? (20)

Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...
Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...
Emakina Academy 4 - AJAX, Flash & Rich Internet Applications: harnessing the ...
 
Premium Website Hosting
Premium Website HostingPremium Website Hosting
Premium Website Hosting
 
Internship Experience
Internship Experience Internship Experience
Internship Experience
 
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL DatabaseScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
ScaleBase Webinar: Methods and Challenges to Scale Out a MySQL Database
 
Increasing Accuracy and Efficiency Through Seamless Integration
Increasing Accuracy and Efficiency Through Seamless IntegrationIncreasing Accuracy and Efficiency Through Seamless Integration
Increasing Accuracy and Efficiency Through Seamless Integration
 
Scaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data DistributionScaling MySQL: Benefits of Automatic Data Distribution
Scaling MySQL: Benefits of Automatic Data Distribution
 
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
CDM SIG: Fusion MDM for Customer Highlights [2010 OAUG Collaborate]
 
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOutScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
ScaleBase Webinar 8.16: ScaleUp vs. ScaleOut
 
Coveo Search - Product Overview
Coveo Search - Product OverviewCoveo Search - Product Overview
Coveo Search - Product Overview
 
Msp introduction short
Msp introduction shortMsp introduction short
Msp introduction short
 
Deployment Planning for Success - #SPSBend
Deployment Planning for Success - #SPSBendDeployment Planning for Success - #SPSBend
Deployment Planning for Success - #SPSBend
 
E-commerce Technology for Safe money transaction over the net
E-commerce Technology for Safe money transaction over the netE-commerce Technology for Safe money transaction over the net
E-commerce Technology for Safe money transaction over the net
 
SunCorp Campaign Measurement
SunCorp Campaign MeasurementSunCorp Campaign Measurement
SunCorp Campaign Measurement
 
Etendez votre datacenter avec aws v4
Etendez votre datacenter avec aws v4Etendez votre datacenter avec aws v4
Etendez votre datacenter avec aws v4
 
Creating an RAD Authoratative Data Environment
Creating an RAD Authoratative Data EnvironmentCreating an RAD Authoratative Data Environment
Creating an RAD Authoratative Data Environment
 
SharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the FieldSharePoint Performance - Tales from the Field
SharePoint Performance - Tales from the Field
 
Australia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance SolutionsAustralia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance Solutions
 
Increase Agility & ROI: BPM in Business Support Systems
Increase Agility & ROI: BPM in Business Support SystemsIncrease Agility & ROI: BPM in Business Support Systems
Increase Agility & ROI: BPM in Business Support Systems
 
Rajesh Srinivasan As Director @ Bsa Corporation Wiring Harness Division
Rajesh Srinivasan As Director @ Bsa Corporation Wiring Harness DivisionRajesh Srinivasan As Director @ Bsa Corporation Wiring Harness Division
Rajesh Srinivasan As Director @ Bsa Corporation Wiring Harness Division
 
Using BrightWork for Project Management with SharePoint 2010 - from Atidan
Using BrightWork for Project Management with SharePoint 2010 - from AtidanUsing BrightWork for Project Management with SharePoint 2010 - from Atidan
Using BrightWork for Project Management with SharePoint 2010 - from Atidan
 

Andere mochten auch

An Integrated Solution Approach
An Integrated Solution ApproachAn Integrated Solution Approach
An Integrated Solution ApproachCees W.M. Nieboer
 
App Sharding to Autosharding at Sailthru
App Sharding to Autosharding at SailthruApp Sharding to Autosharding at Sailthru
App Sharding to Autosharding at SailthruMongoDB
 
Mongo Sharding: Case Study
Mongo Sharding: Case StudyMongo Sharding: Case Study
Mongo Sharding: Case StudyWill Button
 
Масштабирование баз данных
Масштабирование баз данныхМасштабирование баз данных
Масштабирование баз данныхSQALab
 
Webinar: Building Your First App with MongoDB and Java
Webinar: Building Your First App with MongoDB and JavaWebinar: Building Your First App with MongoDB and Java
Webinar: Building Your First App with MongoDB and JavaMongoDB
 
NoSQL into E-Commerce: lessons learned
NoSQL into E-Commerce: lessons learnedNoSQL into E-Commerce: lessons learned
NoSQL into E-Commerce: lessons learnedLa FeWeb
 
Hardware Provisioning
Hardware Provisioning Hardware Provisioning
Hardware Provisioning MongoDB
 
Synchronise your data between MySQL and MongoDB
Synchronise your data between MySQL and MongoDBSynchronise your data between MySQL and MongoDB
Synchronise your data between MySQL and MongoDBGiuseppe Maxia
 
MongoDB, E-commerce and Transactions
MongoDB, E-commerce and TransactionsMongoDB, E-commerce and Transactions
MongoDB, E-commerce and TransactionsSteven Francia
 

Andere mochten auch (9)

An Integrated Solution Approach
An Integrated Solution ApproachAn Integrated Solution Approach
An Integrated Solution Approach
 
App Sharding to Autosharding at Sailthru
App Sharding to Autosharding at SailthruApp Sharding to Autosharding at Sailthru
App Sharding to Autosharding at Sailthru
 
Mongo Sharding: Case Study
Mongo Sharding: Case StudyMongo Sharding: Case Study
Mongo Sharding: Case Study
 
Масштабирование баз данных
Масштабирование баз данныхМасштабирование баз данных
Масштабирование баз данных
 
Webinar: Building Your First App with MongoDB and Java
Webinar: Building Your First App with MongoDB and JavaWebinar: Building Your First App with MongoDB and Java
Webinar: Building Your First App with MongoDB and Java
 
NoSQL into E-Commerce: lessons learned
NoSQL into E-Commerce: lessons learnedNoSQL into E-Commerce: lessons learned
NoSQL into E-Commerce: lessons learned
 
Hardware Provisioning
Hardware Provisioning Hardware Provisioning
Hardware Provisioning
 
Synchronise your data between MySQL and MongoDB
Synchronise your data between MySQL and MongoDBSynchronise your data between MySQL and MongoDB
Synchronise your data between MySQL and MongoDB
 
MongoDB, E-commerce and Transactions
MongoDB, E-commerce and TransactionsMongoDB, E-commerce and Transactions
MongoDB, E-commerce and Transactions
 

Ähnlich wie Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics

Realize Your Potential At Proteans
Realize Your Potential At ProteansRealize Your Potential At Proteans
Realize Your Potential At ProteansPreejith
 
Infosys – Cloud Business Value Architecture
Infosys – Cloud Business Value ArchitectureInfosys – Cloud Business Value Architecture
Infosys – Cloud Business Value ArchitectureInfosys
 
Sponsored Session: Driving the business case and user adoption for SharePoint...
Sponsored Session: Driving the business case and user adoption for SharePoint...Sponsored Session: Driving the business case and user adoption for SharePoint...
Sponsored Session: Driving the business case and user adoption for SharePoint...SPTechCon
 
Oracle CRM On Demand Product Strategy And Roadmap
Oracle CRM On Demand Product Strategy And RoadmapOracle CRM On Demand Product Strategy And Roadmap
Oracle CRM On Demand Product Strategy And RoadmapJerome Leonard
 
Oracle Staffing Practice
Oracle Staffing PracticeOracle Staffing Practice
Oracle Staffing Practiceguest5c9d51
 
Cloud Limitless 2012
Cloud Limitless 2012Cloud Limitless 2012
Cloud Limitless 2012apsheehan
 
TMA brochure Business Apps
TMA brochure Business AppsTMA brochure Business Apps
TMA brochure Business AppsTMA Solutions
 
Cloud India Overview Snippet
Cloud India Overview SnippetCloud India Overview Snippet
Cloud India Overview Snippetvamseet
 
Cloud Computing Landscape in India
Cloud Computing Landscape in IndiaCloud Computing Landscape in India
Cloud Computing Landscape in IndiaZinnov
 
Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...
Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...
Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...Day Software
 
Northridge Presentation Share Point Portal User Experience
Northridge Presentation Share Point Portal User ExperienceNorthridge Presentation Share Point Portal User Experience
Northridge Presentation Share Point Portal User Experienceleewmartin
 
Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...
Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...
Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...InSync2011
 
Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010
Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010
Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010Oracle BH
 
Agile 2012 Conference briefing deck for Analyst and Press
Agile 2012 Conference briefing deck for Analyst and Press Agile 2012 Conference briefing deck for Analyst and Press
Agile 2012 Conference briefing deck for Analyst and Press Laszlo Szalvay
 
Business Process Optimization with Enterprise SOA and AIA
Business Process Optimization with Enterprise SOA and AIABusiness Process Optimization with Enterprise SOA and AIA
Business Process Optimization with Enterprise SOA and AIABob Rhubart
 
Oracle cloud story short
Oracle cloud story   shortOracle cloud story   short
Oracle cloud story shortYuri Grinshteyn
 
Quality and-process-outsourcing
Quality and-process-outsourcingQuality and-process-outsourcing
Quality and-process-outsourcingbhauc
 
Datameer Analytics Solution
Datameer Analytics SolutionDatameer Analytics Solution
Datameer Analytics Solutiontempledf
 

Ähnlich wie Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics (20)

Realize Your Potential At Proteans
Realize Your Potential At ProteansRealize Your Potential At Proteans
Realize Your Potential At Proteans
 
Tae 2012
Tae  2012Tae  2012
Tae 2012
 
Infosys – Cloud Business Value Architecture
Infosys – Cloud Business Value ArchitectureInfosys – Cloud Business Value Architecture
Infosys – Cloud Business Value Architecture
 
Oracle
OracleOracle
Oracle
 
Sponsored Session: Driving the business case and user adoption for SharePoint...
Sponsored Session: Driving the business case and user adoption for SharePoint...Sponsored Session: Driving the business case and user adoption for SharePoint...
Sponsored Session: Driving the business case and user adoption for SharePoint...
 
Oracle CRM On Demand Product Strategy And Roadmap
Oracle CRM On Demand Product Strategy And RoadmapOracle CRM On Demand Product Strategy And Roadmap
Oracle CRM On Demand Product Strategy And Roadmap
 
Oracle Staffing Practice
Oracle Staffing PracticeOracle Staffing Practice
Oracle Staffing Practice
 
Cloud Limitless 2012
Cloud Limitless 2012Cloud Limitless 2012
Cloud Limitless 2012
 
TMA brochure Business Apps
TMA brochure Business AppsTMA brochure Business Apps
TMA brochure Business Apps
 
Cloud India Overview Snippet
Cloud India Overview SnippetCloud India Overview Snippet
Cloud India Overview Snippet
 
Cloud Computing Landscape in India
Cloud Computing Landscape in IndiaCloud Computing Landscape in India
Cloud Computing Landscape in India
 
Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...
Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...
Crown Partners: Achieving Marketing Nirvana - Campaign, Systems and Analytics...
 
Northridge Presentation Share Point Portal User Experience
Northridge Presentation Share Point Portal User ExperienceNorthridge Presentation Share Point Portal User Experience
Northridge Presentation Share Point Portal User Experience
 
Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...
Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...
Developer and Fusion Middleware 2 _Alex Peattie _ An introduction to Oracle S...
 
Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010
Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010
Oracle tech fmw-02-soa-suite-11g-neum-15.04.2010
 
Agile 2012 Conference briefing deck for Analyst and Press
Agile 2012 Conference briefing deck for Analyst and Press Agile 2012 Conference briefing deck for Analyst and Press
Agile 2012 Conference briefing deck for Analyst and Press
 
Business Process Optimization with Enterprise SOA and AIA
Business Process Optimization with Enterprise SOA and AIABusiness Process Optimization with Enterprise SOA and AIA
Business Process Optimization with Enterprise SOA and AIA
 
Oracle cloud story short
Oracle cloud story   shortOracle cloud story   short
Oracle cloud story short
 
Quality and-process-outsourcing
Quality and-process-outsourcingQuality and-process-outsourcing
Quality and-process-outsourcing
 
Datameer Analytics Solution
Datameer Analytics SolutionDatameer Analytics Solution
Datameer Analytics Solution
 

Mehr von MongoDB

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump StartMongoDB
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB
 

Mehr von MongoDB (20)

MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB AtlasMongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDBMongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series DataMongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 MongoDB SoCal 2020: MongoDB Atlas Jump Start MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB SoCal 2020: MongoDB Atlas Jump Start
 
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your MindsetMongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas JumpstartMongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep DiveMongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & GolangMongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
 

Kürzlich hochgeladen

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Kürzlich hochgeladen (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics

  • 1. Scaling  MongoDB  with   Sharding  –  A  Case  Study   Presented  by   Yash  Badiani  and  Rahul  Nair           CIGNEX  Datamatics  Con1idential   www.cignex.com  
  • 2. About  CIGNEX  Datamatics   A  subsidiary  of  Datamatics  Global  Services   Limited     CIGNEX  Datamatics  Con1idential   www.cignex.com   2  
  • 3. Introduction  of  Datamatics  (DGSL)   •  Mission   Strategic  Alliances   –  Experts  in  improving   Enterprise  productivity     through    Process  Engineering  &     Information  Management   Solutions   •  Key  Highlights   –  Founded  in  1975   –  Publicly  listed  in  India   –  Annual  consolidated  revenue  of   US$100  Million   –  Fortune  500  clients   –  4,400+  employees  across  22   of1ices  in  9  countries   CIGNEX  Datamatics  Con1idential   www.cignex.com   3  
  • 4. What  Does  CIGNEX  Datamatics  Do?   Since  2000,  making  Open  Source  work  for   the  enterprise  through  adoption  and   integration  to:   Portal   Solutions   Content     •  Address  business  goals   Solutions   •  Increase  business  velocity   •  Lower  the  cost  of  doing  business   •  Reduce  TCO   Big  Data   •  Gain  competitive  advantage   Solutions   400+  implementations  worldwide  across  industries   CIGNEX  Datamatics  Con1idential   www.cignex.com   4  
  • 5. Where  We  Can  Help  You    SOLUTIONS   •  Intranet     •  S o c i a l   Portals   Liferay,  Drupal,  JBoss,   •  •  Extranet   EAI   Collabora>on   •  Mobile  Portals   User  eXperience     ZK,  HTML5,   •  SOA     PlaRorm   MuleSoW   Alfresco,  Adobe  CQ,     •  WCM   Content   Drupal,  Magento,     •  DM   •  E-­‐Commerce   Enterprise  Content   •  RM   •  E-­‐learning    JBoss,  Moodle,  EphesoW,   •  CMS   •  ERP   Management   •  DAM   •  Imaging   Liferay            Solu>ons       •  Analy>cs   •  DW  -­‐  BI   Hadoop,    MongoDB,  Neo4j,   Big  Data   •  •  Mobile   Social   •  Log  Processing   Flume,  Hive     and  Analysis     Making  Data  Work   •  Web   •  Enterprise   Solr,    Pentaho,  JaspersoW   •  Real-­‐>me   Search      SERVICES   UI,    Development  ,  Integra>on,    Customiza>on,    Migra>on  ,  Tes>ng,      Training  ,    Support  (24*7)   Managed  Cloud  Services  -­‐  Develop,  Deploy,  Manage   VAR/Annual  Product  Subscrip>on  -­‐  Liferay,  Alfresco,  Cloudera  Hadoop,  MongoDB     Extended  Development  Center  –  Center  of  Excellence     CIGNEX  Datamatics  Con1idential   www.cignex.com   5  
  • 6. About  the  Presenters   •  Yash  Badiani  is  the  Big  Data  Practice  Lead  at  CIGNEX  Datamatics  and   focuses  on  Big  Data  Technologies  including  MongoDB  &  Hadoop.  He   has  worked  extensively  on  large  Data  warehousing  &  Business   Intelligence  projects  with  tools  such  as  Business  Objects,  Microsoft  SQL   Server,  Microstrategy,  IBM  Cognos.         •  Gaurav  Khambhala  works  at  CIGNEX  Datamatics  as  Technical  Lead.   He  is  the  senior  member  of  the  PHP  Practice  at  CIGNEX  Datamatics  and   is  involved  on  various  technology  initiatives  like  Big  Data  where  he   focuses  on  integration  of  PHP  with  NoSQL  sources  like  MongoDB.  He   has  a  wide  industry  experience  in  software  development  &   management  in  Open  Source  technologies  such  as  Drupal  &  Moodle   CIGNEX  Datamatics  Con1idential   www.cignex.com   6  
  • 7. Agenda   •  CIGNEX  Datamatics  –  Introduction  &  Offerings   •  Use  Case  &  Database  Requirements   •  Challenges  with  Traditional  Databases   •  Why  MongoDB?   •  Solution     –  Approach   –  Architecture  and  Hardware  Sizing   •  Scaling  with  Sharding   –  Sharding  Basics   –  Sharding  –  Choosing  the  RIGHT  Shard  Key   –  Benchmarking  with  Results   •  Key  Takeaways     CIGNEX  Datamatics  Con1idential   www.cignex.com   7  
  • 8. Big  Data  Practice  At  CIGNEX  Datamatics   Brief  Snapshot   •  ~40  employee  Big  Data  Practice   Technology  Partnership   focused  on  Hadoop,  MongoDB,  Neo4j,   Solr   •  Professionals  formally  trained  /   certi1ied  from  Cloudera  and  10gen   •  Expertize  in  Hadoop  Eco-­‐System   (HBase,  Pig,  Hive,  Flume,  Sqoop,   Oozie,  Zookeeper)   •  Strong  partnerships:   •  System  Integration  partners   with  Cloudera  for  CDH   •  Global  partner  with  10gen  for   MongoDB  –  multiple  webinars   on  different  solutions   CIGNEX  Datamatics  Con1idential   www.cignex.com   8  
  • 9. Our  Offerings  –  Big  Data   Support  &   Consulting   Implementation   Training   Consulting   Implementation   Support  &  Training   •  Business  Analysis     •  UI  Development   •  DBA  Support   •  Technology  Evaluation   •  Application  Integration   •  Application  Support   •  Architecture     •  Customization   •  Enhancements   •  Design  Framework   •  Migration   •  24*7  Production   •  Cluster  sizing   •  Testing   Support(Tier  1/2/3)   •  Deployment  planning   •  Performance  Tuning   •  Trainings   •  Proof-­‐of-­‐Concept   •  Health  Check   •  Performance   Benchmarking   CIGNEX  Datamatics  Con1idential   www.cignex.com   9  
  • 10. Use  Case     Load   Users   Devices   Database   Balancer     Data  Storage   App.  Layer   End  Users   Devices   7  Million  Users   8  devices  /  user   Load  Balancer   mongoDB  cluster   Spread  Across   Home/OfMice/ Receives    high   Sharding   Geography   Anywhere   volume  of   Replication  with   concurrent  CRUD   Automatic   requests   Failover   Routes  request   Indexes   trafMic  to  DB   cluster     CIGNEX  Datamatics  Con1idential   www.cignex.com   10  
  • 11. Database  Requirements   Flexibility     High   in  Schema   Performance   Agility  in     Development     &  Deployment   Availability   Enterprise     Level  Support   CIGNEX  Datamatics  Con1idential   www.cignex.com   11  
  • 12. Limitations  of  RDBMS   Support  limited  to   Manage  only  Structured   RDBMS  doesn’t  scale   Feature  rich  but  slow    terabytes   Data   inherently   performance     $   Complex  to  Shard/Partition   Limitations  in  scaling  High   Specialized  Hardware  -­‐   Vertical  Scaling  expensive   due  to  maintenance  of  schema   volume  of  concurrent  CRUD   Expensive   and  dif1icult  to  scale   RDBMS  can’t  manage  all  dimensions    of  data  with  speed  &  at  lower  cost.   CIGNEX  Datamatics  Con1idential   www.cignex.com   12
  • 13. Why  MongoDB?    Flexibility     High     in  Schema   Performance   •  Easy  integration   •  Concurrent  CRUD     •  Ease  of  schema     •  Fast  Updates   Agility  in                  design   •  Write  distribution     Development     •  Document  oriented                  with  Sharding    &  Deployment                storage   Schema  free   •  Programming     Indexes  &  Sharding                Language  drivers   •  Shorter  Dev  cycle   •  Faster  deployment   Enterprise   Availability   Level   Support   Driver  Support   •  Automatic  failover   •  Global  Coverage   •  Redundancy   •  24x7  Support   •  100%  uptime   •  Ease  of                    maintenance   Replication   Strong  Community   CIGNEX  Datamatics  Con1idential   www.cignex.com   13  
  • 14. Solution:  Approach     Schema   • Schema  Design                                                                     • Collections  and  Field  De1initions   • Document  Size   Database  Size   • Total  expected  data  size   • Frequency  of  CRUD  operations     Concurrent  Load   • Read/Write  ratio   • Automatic  Failover   Availability   • Replication  and  Backup   • Working  Set   Indexing   • Access  Patterns   • Horizontal  Scaling   Sharding   • Query  Performance   • Cluster  sizing   Hardware  Sizing   • RAM  and  Disk  storage   CIGNEX  Datamatics  Con1idential   www.cignex.com   14  
  • 15. Solution:  Architecture   Con1ig  Servers   Shard  1   mongos   mongod   Server   App   Primary   Mongod   mongod   mongod     Arbiter   mongod   Secondary   mongos   Server   mongod     App     Shard  2   mongod   Primary   Mongod   mongos   Server     Arbiter   App   Balancer   Data  Tier   Load     mongod   Secondary   Routed  Requests  from   mongos  to  shards   mongos   Server     App   Shard  3   mongod   Primary   Mongod     Arbiter   mongos   mongod   Server   App   Secondary   Shard  4   mongod   mongos   Server   Primary   App     Mongod   Arbiter   mongod   Secondary   App  Tier   Routed  for  non-­‐ sharded  collections   Replica  Set   mongod   Primary   Mongod   Arbiter   mongod   Secondary   CIGNEX  Datamatics  Con1idential   www.cignex.com   15  
  • 16. Sharding  –  What  is  it?   •  Distributes  single  logical  database  system  across  clusters   •  Allows  to  partition  a  collection  across  #  of  mongod   instances(shards)   •  Advantages:   –  Increases  write  capacity   –  Ability  to  support  larger  working  sets   –  Raises  limits  of  data  size  beyond  a  single  node     CIGNEX  Datamatics  Con1idential   www.cignex.com   16  
  • 17. Sharding  -­‐  Features   •  Range-­‐based  Data  Partitioning   •  Automatic  Data  volume  distribution   •  Transparent  query  routing   •  Horizontal  capacity   –  Additional  write  capacity  through  distribution   –  Right  shard  key  allows  expansion  of  working  set     CIGNEX  Datamatics  Con1idential   www.cignex.com   17  
  • 18. Sharding  –  When  to  use?   Your  data  set  approaches  or  exceeds  the  storage   capacity  of  a  single  node  in  your  system   Storage   Drive   The  size  of  your  system’s  active  working  set  will  soon   exceed  the  capacity  of  the  maximum  amount  of  RAM   for  your  system   RAM   Working  Set   Your  system  has  a  large  amount  of  write   activity,  a  single  MongoDB  instance  cannot   Storage   write  data  fast  enough  to  meet  demand,  and  all   Drive   other  approaches  have  not  reduced  contention       CIGNEX  Datamatics  Con1idential   www.cignex.com   18  
  • 19. Shard  Keys   Shard  Keys:   •     The  ideal  shard  key  :   Exist  in  every  document  in  a   collection  that  MongoDB  uses  to   –  Easily  divisible  which  makes  it   distribute  documents  among  the   shards  like  indexes,  they  can  be   easy  for  MongoDB  to  distribute   either  a  single  1ield,  or  a   compound  key   content  among  the  shards   –  Higher  “randomness”   –  Targeted  queries   –  May  need  to  be  computed   CIGNEX  Datamatics  Con1idential   www.cignex.com   19  
  • 20. Choosing  Right  Shard  Key   Different  approach  for  Shard  Keys     •  Approach  1:  Random  Key    –  UserId   •  Approach  2:  Coarsely  ascending  key  +  Random  Key  –     YearMonth  +  UserId     CIGNEX  Datamatics  Con1idential   www.cignex.com   20  
  • 21. Benchmarking  /  Load  Testing  Approach   Automated  scripts  with  varied  load       CIGNEX  Datamatics  Con1idential   www.cignex.com   21  
  • 22. Results  -­‐  INSERTS   Approach  1   Over  80  million  documents  inserted   with  a  decreasing  threshold  over  10   million   Approach  2   Over  225  million  documents  inserted  at   a  stable  rate  of  6000  documents/sec   Benchmarks  done  on  8GB  Test  H/W  Machines   CIGNEX  Datamatics  Con1idential   www.cignex.com   22  
  • 23. Results  -­‐  UPDATES   Approach  1   Over  50  million  documents  updated  at   avg.  400  documents/sec   Approach  2   Over  100  million  documents  updated  at   as  high  as.  4000  documents/sec   Benchmarks  done  on  8GB  Test  H/W  Machines   CIGNEX  Datamatics  Con1idential   www.cignex.com   23  
  • 24. Results  –  INSERT,  UPDATE   Approach  2   Simultaneous  INSERT   >6000  documents/  second   >70  million  records   Simultaneous  UPDATE   >6000  documents/  second   >50  million  records   Benchmarks  done  on  8GB  Test  H/W  Machines   CIGNEX  Datamatics  Con1idential   www.cignex.com   24  
  • 25. Benchmarking  –  Sharding  Vs  Non  Sharding   Operation   Sharding  (YearMonth  +   Non-­‐Sharding   UserId)   INSERTS   ~6000  docs/sec   ~2900  docs/sec   UPDATES   ~4000  docs/sec   ~620  updates/sec   INSERT  &   ~6000  docs/sec  &   ~2000  docs/sec  &   UPDATES   ~6100  docs/sec   ~600  docs/sec   Benchmarks  done  on  8GB  Test  H/W  Machines   CIGNEX  Datamatics  Con1idential   www.cignex.com   25  
  • 26. Key  Takeaways   •  Comprehensive  approach  on  Performance  Tuning   •  Plan  Early  for  Performance   •  MongoDB  scales  &  shines   •  Sharding  scales  INSERTS/UPDATES  vs.  Non  sharding   •  Sharding  with  Approach  2  (Coarsely  ascending  Key  +  Random   Key)  provides  sustained  results  &  better  utilization  of  the  RAM     •  Different  set  of  server/s  for  NON-­‐Sharded  collections   •  Indexes  to  be  de1ined  carefully   •  Sharded  collections  to  have  minimal  number  of  indexes   CIGNEX  Datamatics  Con1idential   www.cignex.com   26  
  • 27. Thank  You.  Any  Questions  ?   Making  Open  Source  Work     For  queries  reach  out  to  us  at  info@cignex.com           CIGNEX  Datamatics  Con1idential   www.cignex.com