SlideShare uma empresa Scribd logo
1 de 16
Baixar para ler offline
Japanese
Restaurants in NYC
By Jiro Stenger
Capstone Project
The Battle of Neighborhoods
Part of the IBM Data Science Professional Certificate
Jan 2020
Japanese
Restaurants in NYC By Jiro Stenger
Introduction
• The following capstone project is part of the Coursera Applied Data Science Course, which is part of the IBM Data
Science Professional Certification.
• GitHub Repository:
https://github.com/jirostenger/Coursera_Capstone/blob/edae726cae77657a1627516ba484567dc71eba78/Battl
e%20of%20Neighborhoods%20Project%20Part%202.ipynb
• The idea is to think of a fictitious business problem of a client and to solve it using data & FourSquare API
• The goal of the following project is to find out the best locations for a Japanese restaurant in New York City
Jan 2020
Japanese
Restaurants in NYC By Jiro Stenger
Business Problem
Jan 2020
• A client wants to open a major Japanese restaurant in NYC
• The client is aware there is lots of competition in a city like NYC
• The investment, that is needed to open the restaurant is very high, so the client needs recommendation on the
location to choose for the venue
• He needs to know where the Japanese restaurants are located and what they serve
• After a little research, I found there is a neighborhood called "little Tokyo". It is supposed to be located in the
East Village between St. Mark's Place and 10th Street (Source: Beacon Hotel, 2018)
• I will attempt to prove, if there is actually a neighborhood in NYC with a high quantity of Japanese restaurants as
well as whether the location stated before is correct or not
Japanese
Restaurants in NYC By Jiro Stenger
Data
• I need data on boroughs and neighborhoods of New York City
• I need data of restaurants and their location with latitute and longitute as well as their menus
• i will get the data on neighborhoods, boroughs, latitudes and longitudes from the following
GeoJSON: https://cocl.us/new_york_dataset
• The data on neighborhood boundaries of NYC I will obtain from here: https://data.cityofnewyork.us/City-
Government/Borough-Boundaries/tqmj-j8zm
• Lastly, I will get the data on Japanese restaurants and the type of food they serve as well as their geo coordinates
via the FourSquare API and request library
Jan 2020
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
1. Collect the described data from the section Data
2. Clean and process it into a dataframe
3. Use FourSquare data to locate all the restaurants and filter down to the Japanese ones
4. Take a look at the neighborhoods and boroughs of NYC and locate the Japanese restaurants in the areas
5. Using matplotlib and folium, I will visualize our data to get a better understanding
6. Get the menus of the restaurants by FourSquare and add them to the dataframe
7. In the end I will use the data of the restaurants menu to cluster the restaurants using K-means.
Jan 2020
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• I imported some libraries
and defined functions
• They can be found in the
GitHub Repo
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• I got the NYC data & the result was a total of 306 neighborhoods
• Then I visualized the distribution of the neighborhoods per borough
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• I also got the location data of the
venues from the FourSquare API
combined with the neighborhood
and borough data from the
first data set
• See next slide for dataframe
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• List of the venues & location data from FourSquare
• 75 venues returned, but no restaurant in Bronx
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• Unlike expected, East Village aka little Tokyo is not among the top 5 neighborhoods
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• Japanese restaurant “Kyo Ya“ is actually
on 10th Street in East Village
• But it‘s the only one…
Japanese
Restaurants in NYC By Jiro Stenger
Methodology
Jan 2020
• Unfortunately it is not possible to get
menu data with a regular FS account
• It only returns an empty dataframe
• This is why I had to abort the attempt
to get menu data for clustering
Japanese
Restaurants in NYC By Jiro Stenger
Results
Jan 2020
Even though the project had to be aborted because of the missing permission of a regular Foursquare account, I
was able to achieve the following results throughout the project:
• The borough Manhatten consists of the least neighborhoods, but has the most Japanese restaurants
• There is no Japanese restaurant in the borough Bronx. This could be an interesting starting point for further
evaluation
• In East Village aka little Tokyo we expected to find the most Japanese restaurants. Surprisingly, there is only one
restaurant yet. There must be another reason for the neighborhood to be called little Tokyo then having
Japanese restaurants. Since there is only one restaurant yet, it could be an interesting area to propose to the
client
Japanese
Restaurants in NYC By Jiro Stenger
Discussion
Jan 2020
• Looking at the elaborated neighborhoods, there is a good chance that East Village could be an interesting
neighborhood for the client to open the major Japanese restaurant
• Like mentioned before though, a lot more research needs to be done to deliver a valuable recommendation for
the client
• Also a lot of other variables like demographic characteristics could be evaluated in a further study, but since this
project is fictitious, I will stop right here.
Japanese
Restaurants in NYC By Jiro Stenger
Conclusion
Jan 2020
• The project has been a good way to apply theorethical knowledge learned from the last courses of the Applied
Data Science Specialization
• Unfortunately FourSquare changed it's policy, so the access to valuable venue data was very limited
• Hence, the course could be updated, so participants do not rely on FourSquare data in future studies
Japanese
Restaurants in NYC By Jiro Stenger
Sources
Jan 2020
• Nippon.com, 2016: https://www.nippon.com/en/features/h00128/japanese-restaurants-on-the-rise-abroad.html
• Investopedia, 2019: https://www.investopedia.com/articles/personal-finance/012315/how-expensive-new-york-city-
really.asp
• Beacon Hotel, 2018: https://www.beaconhotel.com/blog/your-intro-to-all-things-japanese-in-nyc/

Mais conteúdo relacionado

Semelhante a Capstone Project - Battle of Neighborhoods

Networld - Our Data Journey (2016-09-29)
Networld - Our Data Journey (2016-09-29)Networld - Our Data Journey (2016-09-29)
Networld - Our Data Journey (2016-09-29)Patrick Ng
 
Coronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote CultureCoronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote CultureKoray Tugberk GUBUR
 
Design-led Approach to Big Data
Design-led Approach to Big Data Design-led Approach to Big Data
Design-led Approach to Big Data ChrysSullivan
 
'Driving app discovery outside mobile' - How to use other channels to drive d...
'Driving app discovery outside mobile' - How to use other channels to drive d...'Driving app discovery outside mobile' - How to use other channels to drive d...
'Driving app discovery outside mobile' - How to use other channels to drive d...App Promotion Summit Conference
 
See Your Raiser’s Edge Data from a Different Angle Using Pivot Reports
See Your Raiser’s Edge Data from a Different Angle Using Pivot ReportsSee Your Raiser’s Edge Data from a Different Angle Using Pivot Reports
See Your Raiser’s Edge Data from a Different Angle Using Pivot ReportsBlackbaud
 
Financial activities and npa of cooperative banl ltd
Financial activities and npa of cooperative banl ltd Financial activities and npa of cooperative banl ltd
Financial activities and npa of cooperative banl ltd NAMITHA SUDHAKAR
 
HNA Slides Show and Tell 14.10.21
HNA Slides Show and Tell 14.10.21 HNA Slides Show and Tell 14.10.21
HNA Slides Show and Tell 14.10.21 PAS_Team
 
REAL ESTATE SECTOR - GODREJ PROPERTIES
REAL ESTATE SECTOR- GODREJ PROPERTIESREAL ESTATE SECTOR- GODREJ PROPERTIES
REAL ESTATE SECTOR - GODREJ PROPERTIESAkshay Jain
 
Snag 'Em for Life, Tag 'Em in eTapestry
Snag 'Em for Life, Tag 'Em in eTapestrySnag 'Em for Life, Tag 'Em in eTapestry
Snag 'Em for Life, Tag 'Em in eTapestryBlackbaud
 
Fairfax County Land Development Time to Market
Fairfax County Land Development Time to MarketFairfax County Land Development Time to Market
Fairfax County Land Development Time to MarketFairfax County
 
Research Framework for the Cape Town Stadium V3
Research Framework for the Cape Town Stadium V3Research Framework for the Cape Town Stadium V3
Research Framework for the Cape Town Stadium V3Roslyn Bristow
 
Rehoboth Beach, DE - Funding the Proposed City Hall
Rehoboth Beach, DE - Funding the Proposed City Hall Rehoboth Beach, DE - Funding the Proposed City Hall
Rehoboth Beach, DE - Funding the Proposed City Hall rehobothbeachde
 
Live Mag SA: Presentation by Coded
Live Mag SA: Presentation by CodedLive Mag SA: Presentation by Coded
Live Mag SA: Presentation by CodedPenny Mathebula
 
Queries, Exports, Reports: Where to go in The Raiser's Edge
Queries, Exports, Reports: Where to go in The Raiser's EdgeQueries, Exports, Reports: Where to go in The Raiser's Edge
Queries, Exports, Reports: Where to go in The Raiser's EdgeBlackbaud
 

Semelhante a Capstone Project - Battle of Neighborhoods (20)

Networld - Our Data Journey (2016-09-29)
Networld - Our Data Journey (2016-09-29)Networld - Our Data Journey (2016-09-29)
Networld - Our Data Journey (2016-09-29)
 
Long Sohu
Long Sohu Long Sohu
Long Sohu
 
Coronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote CultureCoronavirus and Future of SEO: Digital Marketing and Remote Culture
Coronavirus and Future of SEO: Digital Marketing and Remote Culture
 
Design-led Approach to Big Data
Design-led Approach to Big Data Design-led Approach to Big Data
Design-led Approach to Big Data
 
'Driving app discovery outside mobile' - How to use other channels to drive d...
'Driving app discovery outside mobile' - How to use other channels to drive d...'Driving app discovery outside mobile' - How to use other channels to drive d...
'Driving app discovery outside mobile' - How to use other channels to drive d...
 
See Your Raiser’s Edge Data from a Different Angle Using Pivot Reports
See Your Raiser’s Edge Data from a Different Angle Using Pivot ReportsSee Your Raiser’s Edge Data from a Different Angle Using Pivot Reports
See Your Raiser’s Edge Data from a Different Angle Using Pivot Reports
 
Financial activities and npa of cooperative banl ltd
Financial activities and npa of cooperative banl ltd Financial activities and npa of cooperative banl ltd
Financial activities and npa of cooperative banl ltd
 
HNA Slides Show and Tell 14.10.21
HNA Slides Show and Tell 14.10.21 HNA Slides Show and Tell 14.10.21
HNA Slides Show and Tell 14.10.21
 
REAL ESTATE SECTOR - GODREJ PROPERTIES
REAL ESTATE SECTOR- GODREJ PROPERTIESREAL ESTATE SECTOR- GODREJ PROPERTIES
REAL ESTATE SECTOR - GODREJ PROPERTIES
 
Snag 'Em for Life, Tag 'Em in eTapestry
Snag 'Em for Life, Tag 'Em in eTapestrySnag 'Em for Life, Tag 'Em in eTapestry
Snag 'Em for Life, Tag 'Em in eTapestry
 
Fairfax County Land Development Time to Market
Fairfax County Land Development Time to MarketFairfax County Land Development Time to Market
Fairfax County Land Development Time to Market
 
Research Framework for the Cape Town Stadium V3
Research Framework for the Cape Town Stadium V3Research Framework for the Cape Town Stadium V3
Research Framework for the Cape Town Stadium V3
 
Rehoboth Beach, DE - Funding the Proposed City Hall
Rehoboth Beach, DE - Funding the Proposed City Hall Rehoboth Beach, DE - Funding the Proposed City Hall
Rehoboth Beach, DE - Funding the Proposed City Hall
 
Maple Heights Master Plan presentation
Maple Heights Master Plan presentationMaple Heights Master Plan presentation
Maple Heights Master Plan presentation
 
Wiki pizza copy
Wiki pizza   copyWiki pizza   copy
Wiki pizza copy
 
Live Mag SA: Presentation by Coded
Live Mag SA: Presentation by CodedLive Mag SA: Presentation by Coded
Live Mag SA: Presentation by Coded
 
Netflix in Nigeria 1
Netflix in Nigeria 1Netflix in Nigeria 1
Netflix in Nigeria 1
 
Netflix in Nigeria
Netflix in NigeriaNetflix in Nigeria
Netflix in Nigeria
 
Queries, Exports, Reports: Where to go in The Raiser's Edge
Queries, Exports, Reports: Where to go in The Raiser's EdgeQueries, Exports, Reports: Where to go in The Raiser's Edge
Queries, Exports, Reports: Where to go in The Raiser's Edge
 
Budget (1).ppt
Budget (1).pptBudget (1).ppt
Budget (1).ppt
 

Último

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...KarteekMane1
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 

Último (20)

6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
wepik-insightful-infographics-a-data-visualization-overview-20240401133220kwr...
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 

Capstone Project - Battle of Neighborhoods

  • 1. Japanese Restaurants in NYC By Jiro Stenger Capstone Project The Battle of Neighborhoods Part of the IBM Data Science Professional Certificate Jan 2020
  • 2. Japanese Restaurants in NYC By Jiro Stenger Introduction • The following capstone project is part of the Coursera Applied Data Science Course, which is part of the IBM Data Science Professional Certification. • GitHub Repository: https://github.com/jirostenger/Coursera_Capstone/blob/edae726cae77657a1627516ba484567dc71eba78/Battl e%20of%20Neighborhoods%20Project%20Part%202.ipynb • The idea is to think of a fictitious business problem of a client and to solve it using data & FourSquare API • The goal of the following project is to find out the best locations for a Japanese restaurant in New York City Jan 2020
  • 3. Japanese Restaurants in NYC By Jiro Stenger Business Problem Jan 2020 • A client wants to open a major Japanese restaurant in NYC • The client is aware there is lots of competition in a city like NYC • The investment, that is needed to open the restaurant is very high, so the client needs recommendation on the location to choose for the venue • He needs to know where the Japanese restaurants are located and what they serve • After a little research, I found there is a neighborhood called "little Tokyo". It is supposed to be located in the East Village between St. Mark's Place and 10th Street (Source: Beacon Hotel, 2018) • I will attempt to prove, if there is actually a neighborhood in NYC with a high quantity of Japanese restaurants as well as whether the location stated before is correct or not
  • 4. Japanese Restaurants in NYC By Jiro Stenger Data • I need data on boroughs and neighborhoods of New York City • I need data of restaurants and their location with latitute and longitute as well as their menus • i will get the data on neighborhoods, boroughs, latitudes and longitudes from the following GeoJSON: https://cocl.us/new_york_dataset • The data on neighborhood boundaries of NYC I will obtain from here: https://data.cityofnewyork.us/City- Government/Borough-Boundaries/tqmj-j8zm • Lastly, I will get the data on Japanese restaurants and the type of food they serve as well as their geo coordinates via the FourSquare API and request library Jan 2020
  • 5. Japanese Restaurants in NYC By Jiro Stenger Methodology 1. Collect the described data from the section Data 2. Clean and process it into a dataframe 3. Use FourSquare data to locate all the restaurants and filter down to the Japanese ones 4. Take a look at the neighborhoods and boroughs of NYC and locate the Japanese restaurants in the areas 5. Using matplotlib and folium, I will visualize our data to get a better understanding 6. Get the menus of the restaurants by FourSquare and add them to the dataframe 7. In the end I will use the data of the restaurants menu to cluster the restaurants using K-means. Jan 2020
  • 6. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • I imported some libraries and defined functions • They can be found in the GitHub Repo
  • 7. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • I got the NYC data & the result was a total of 306 neighborhoods • Then I visualized the distribution of the neighborhoods per borough
  • 8. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • I also got the location data of the venues from the FourSquare API combined with the neighborhood and borough data from the first data set • See next slide for dataframe
  • 9. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • List of the venues & location data from FourSquare • 75 venues returned, but no restaurant in Bronx
  • 10. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • Unlike expected, East Village aka little Tokyo is not among the top 5 neighborhoods
  • 11. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • Japanese restaurant “Kyo Ya“ is actually on 10th Street in East Village • But it‘s the only one…
  • 12. Japanese Restaurants in NYC By Jiro Stenger Methodology Jan 2020 • Unfortunately it is not possible to get menu data with a regular FS account • It only returns an empty dataframe • This is why I had to abort the attempt to get menu data for clustering
  • 13. Japanese Restaurants in NYC By Jiro Stenger Results Jan 2020 Even though the project had to be aborted because of the missing permission of a regular Foursquare account, I was able to achieve the following results throughout the project: • The borough Manhatten consists of the least neighborhoods, but has the most Japanese restaurants • There is no Japanese restaurant in the borough Bronx. This could be an interesting starting point for further evaluation • In East Village aka little Tokyo we expected to find the most Japanese restaurants. Surprisingly, there is only one restaurant yet. There must be another reason for the neighborhood to be called little Tokyo then having Japanese restaurants. Since there is only one restaurant yet, it could be an interesting area to propose to the client
  • 14. Japanese Restaurants in NYC By Jiro Stenger Discussion Jan 2020 • Looking at the elaborated neighborhoods, there is a good chance that East Village could be an interesting neighborhood for the client to open the major Japanese restaurant • Like mentioned before though, a lot more research needs to be done to deliver a valuable recommendation for the client • Also a lot of other variables like demographic characteristics could be evaluated in a further study, but since this project is fictitious, I will stop right here.
  • 15. Japanese Restaurants in NYC By Jiro Stenger Conclusion Jan 2020 • The project has been a good way to apply theorethical knowledge learned from the last courses of the Applied Data Science Specialization • Unfortunately FourSquare changed it's policy, so the access to valuable venue data was very limited • Hence, the course could be updated, so participants do not rely on FourSquare data in future studies
  • 16. Japanese Restaurants in NYC By Jiro Stenger Sources Jan 2020 • Nippon.com, 2016: https://www.nippon.com/en/features/h00128/japanese-restaurants-on-the-rise-abroad.html • Investopedia, 2019: https://www.investopedia.com/articles/personal-finance/012315/how-expensive-new-york-city- really.asp • Beacon Hotel, 2018: https://www.beaconhotel.com/blog/your-intro-to-all-things-japanese-in-nyc/