1. Designs of Experiment Project
B.Sc. ASA SOS NMIMS Bengaluru IV th.
Semester
Bengaluru Zomato
Dataset EDA
Report
Mentor:
Prof. Preeti Ravikiran
Submitted by:
Vidit Jain
75251219007
M013
2. 1
Introduction ...............................................................................................................2
About Dataset .........................................................................................................3
EDA with Visualization ............................................................................................4
Inferences .............................................................................................................10
Suggested Hypothesis ............................................................................................11
References .............................................................................................................12
CONTENTS
3. 2
Introduction
Background:
Zomato is an Indian restaurant aggregator and food delivery start-up founded by Deepinder
Goyal and Pankaj Chadha in 2008. Zomato provides information, menus, and user-reviews of
restaurants as well as food delivery options from partner restaurants in select cities.
• Industry: Online food ordering Retail
• Services: Restaurant Search & Discovery, Online Ordering, Table Reservations &
Management, POS Systems, Subscription Service
The basic idea of analyzing the Zomato dataset is to get a fair idea about the factors affecting
the establishment of different types of restaurant at different places in Bengaluru, aggregate
rating of each restaurant, Bengaluru being one such city has more than50,000 restaurants
with restaurants serving dishes from all over the world.
With each day new restaurants opening the industry has not been saturated yet and
the demand is increasing day by day. In spite of increasing demand, it however has
become difficult for new restaurants to compete with established restaurants. Most
of them serving the same food. Bengaluru being an IT capital of India. Most of the
people here are dependent mainly on the restaurant food as they do not have time
to cook for themselves. With such an overwhelming demand of restaurants it has
therefore become important to study the demography of a location.
4. 3
About Dataset
This is publicly available dataset available on Kaggle based upon restaurant listed on Zomato
website for Bengaluru city as on 15-3-2019 contributed by user: Himanshu Poddar. It could be
accessed here. The data was scraped from the website of Zomato using Python’s Selenium API.
All possible ways were tried to keep the data error free and have tried to achieve 100 percent
accuracy in the dataset.
Dataset Properties:
1. The data is in csv
format.
2. The total size of the
dataset is
approximately
547MB.
3. The dataset
contains 17
variables all of
which were scraped
from the Zomato
website. The
dataset contains
details of 51,717
restaurants in
Bengaluru listed as
on 15 March 2019.
5. 4
“Exploratory Data Analysis refers to the critical process of performing initial investigations on data
so as to discover patterns, to spot anomalies, to test hypothesis and to check assumptions with the
help of summary statistics and graphical representations”
For this project we employed Python as an environment for all programing and execution needs.
And used various Python Packages for our analysis such as:
1.Pandas 4. Matplotlib etc.
2.NumPy 5. Seaborn
3.Follium 6. Sklearn
For sake of faster and convenient understanding of our work this report only consist of derived
results and graphs, Code to all data processing, visualization and results could be found at our
Python Notebook here.
1.Top restaurants chains in Bengaluru
Exploratory Data Analysis
& Visualization
6. 5
2.How many of the restaurants do not accept online orders?
3.How many restaurants do not provide online table
reservation?
Exploratory Data Analysis
& Visualization