SlideShare uma empresa Scribd logo
1 de 38
IT’S ALL ABOUT DATA & ANALYTICS
Different Profiles of Analytics
DATA SCIENCE
• Data scientist, chief scientist, senior analyst, director of analytics, Etc.
• Industries like Digital analytics, search technology, marketing, fraud detection,
astronomy, energy, Healthcare, social networks, finance, forensics, security
(NSA), mobile, telecommunications, weather forecasts, and fraud detection.
• Projects Taxonomy creation (text mining, big data), clustering applied to big
data sets, recommendation engines, simulations, rule systems for statistical
scoring engines, root cause analysis, automated bidding, forensics, exo-planets
detection, and early detection of terrorist activity or pandemics.
• Main components are Machine to Machine communications, Automation.
• Overlaps with Computer Science, Statistics, Machine Learning, Data Mining,
Operational Research, Business Intelligence.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
MACHINE LEARNING - Very popular computer science
discipline
• Part of data science and closely related to Data Mining. Machine learning is
about designing algorithms (like data mining), but emphasis is on prototyping
algorithms for production mode, and designing automated systems
• Python is now a popular language for ML development Projects.
• Core algorithms include clustering and supervised classification, rule systems,
and scoring techniques
• A sub-domain, close to Artificial Intelligence is deep learning.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
DATA MINING
• Designing algorithms to extract insights from rather large and potentially
unstructured data (text mining), sometimes called Nugget Discovery.
• Techniques include pattern recognition, feature selection, clustering,
supervised classification and encompasses a few statistical techniques.
• Data mining thus have some intersection with statistics, and it is a subset of
Data science.
• Data miners use open source and software such as Rapid Miner.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
PREDICTIVE MODELING
• Predictive modeling projects occur in all industries across all disciplines.
• Aim at predicting future based on past data, usually but not always based on
statistical modeling.
• Predictions often come with confidence intervals.
• Roots of predictive modeling are in statistical science.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
STATISTICS
• Loosing ground to data science, industrial statistics, operations research, data
mining, machine learning -- where the same clustering, cross-validation and
statistical training techniques are used, albeit in a more automated way and on
bigger data.
• Many professionals who were called statisticians 10 years ago, have seen their
job title changed to data scientist or analyst in the last few years.
• Modern sub-domains include statistical computing, statistical learning(closer to
machine learning), computational statistics(closer to data science), data-driven
(model-free) inference, sport statistics, and Bayesian statistics
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
INDUSTRIAL STATISTICS
• Statistics frequently performed by non-statisticians (engineers with good
statistical training), working on engineering projects such as yield optimization
or load balancing (system analysts). They use very applied statistics, and their
framework is closer to six sigma, quality control and operations research, than
to traditional statistics.Also found in oil and manufacturing industries.
• Techniques used include time series, ANOVA, experimental design, survival
analysis, signal processing(filtering, noise removal, deconvolution), spatial
models, simulation, Markov chains, risk and reliability models.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
ACTUARIAL SCIENCES
• Just a subset of statistics focusing on insurance (car, health, etc.)
• using survival models: predicting when you will die, what your health
expenditures will be based on your health status (smoker, gender, previous
diseases) to determine your insurance premiums.
• Also predicts extreme floods and weather events to determine premiums.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
HPC
• High performance computing, not a discipline per se, but should be of concern
to data scientists, big data practitioners, computer scientists and
mathematicians, as it can redefine the computing paradigms in these fields.
• HPC should not be confused with Hadoop and Map-Reduce: HPC is hardware-
related, Hadoop is software-related (though heavily relying on Internet
bandwidth and servers configuration and proximity).
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
OPERATIONS RESEARCH(OR)
• It is about decision science and optimizing traditional business projects:
inventory management, supply chain, pricing. They heavily use Markov Chain
models, Monter-Carlo simulations, queuing and graph theory, and software
such as AIMS, Matlab or Informatica.
• Big, traditional old companies use OR, new and small ones (start-ups) use data
science to handle pricing, inventory management or supply chain problems.
• Car traffic optimization is a modern example of OR problem, solved with
simulations, commuter surveys, sensor data and statistical modeling.
• OR has a significant overlap with six-sigma, also solves econometric problems,
and has many practitioners/applications in the army and defense sectors
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
ECONOMETRICS
• Econometrics is heavily statistical in nature, using time series models such as
auto-regressive processes.
• Also overlapping with operations research (itself overlapping with statistics!)
and mathematical optimization (simplex algorithm).
• Econometricians like ROC and efficiency curves.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
DATA ENGINEERING
• Performed by software engineers (developers) or architects (designers) in large
organizations (sometimes by data scientists in tiny companies)
• A sub-domain currently under attack is data warehousing, as this term is
associated with static, siloed conventational data bases, data architectures, and
data flows, threatened by the rise of NoSQL, NewSQL and graph databases.
• Transforming these old architectures into new ones (only when needed) or
make them compatible with new ones, is a lucrative business
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
BUSINESS INTELLIGENCE
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
BUSINESS INTELLIGENCE
• Focuses on dashboard creation, metric selection, producing and scheduling data
reports (statistical summaries) sent by email or delivered/presented to
executives, competitive intelligence (analyzing third party data), as well as
involvement in database schema design (working with data architects) to collect
useful, actionable business data efficiently.
• Typical job title is Business Analyst
• some are more involved with marketing, product or finance (forecasting sales and
revenue).
• Some have learned advanced statistics such as time series, but most only use
(and need) basic stats, and light analytics, relying on IT to maintain databases
and harvest data.
• BI and market research(but not competitive intelligence) are currently
experiencing a decline.
• Part of the decline is due to not adapting to new types of data (e.g. unstructured text)
that require engineering or data science techniques to process and extract value
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
DATA ANALYTICS
• This is the new term for Business Statistics since at least 1995, and it covers a
large spectrum of applications including fraud detection, advertising mix
modeling, attribution modeling, sales forecasts, cross-selling optimization
(retails), user segmentation, churn analysis, computing long-time value of a
customer and cost of acquisition, Etc.
• Except in big companies, data analyst is a Junior role; these practitioners have
a much more narrow knowledge and experience than data scientists
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
Different Profiles of Analytics
BUSINESS ANALYTICS
• Same as data analysis, but restricted to business problems only.
• Tends to have a bit more of a financial, marketing or ROI flavor.
Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
DATA SCIENTIST vs. DATA ANALYST
Resource: http://www.edureka.co/blog/difference-between-data-scientist-and-data-analyst/
DATA SCIENTIST vs. DATA ANALYST
• “Data Analyst” focuses on the movement and interpretation of
data, typically with a focus on the past and present.
• Alternatively, a “Data Scientist” may be primarily responsible for
summarizing data in such a way as to provide forecasting, or an
insight into future based on the patterns identified from past and
current data.
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
DATA SCIENTIST vs. DATA ANALYST
CRISP-DM Process Model
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
DATA SCIENTIST vs. DATA ANALYST
• Business understanding – Determine Business Objectives, Assess Situation,
Determine Data Mining Goals, Produce Project Plan
• Data understanding – Collect Initial Data, Describe Data, Explore Data,Verify
Data Quality
• Data preparation – Select Data, Clean Data, Construct Data, Integrate Data
• Modeling – Select modeling technique, Generate Test Design, Build Model, Assess
Model
• Evaluation - Evaluate Results, Review Process, Determine Next Steps
• Deployment – Plan Deployment, Plan Monitoring and Maintenance, Produce
Final Report, Review Project
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
DATA SCIENTIST vs. DATA ANALYST
• A Data Scientist is often heavily involved in the cleaning and manipulation
of data to support their modeling needs as well as the building and
evaluating of model designs which are intended to help guide changes in
business decisions.
• On the other hand, a Data Analyst may spend their time exploring data to
support troubleshooting efforts or to generate ideas for useful reports to
pitch to the customer.
• In general, while Data Analysts tend to be more Business focused, Data
Scientists are often Mathematically focused.
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
DATA SCIENTIST vs. DATA ANALYST
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
DATA SCIENTIST vs. DATA ANALYST
• Data Analysts typically perform data migration and visualization roles that
focus on describing the past; while Data Scientists typically perform roles
manipulating data and creating models to improve the future.
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
DATA SCIENTIST vs. DATA ANALYST
• Situation:
• A large provider of streaming entertainment and data services wants to
improve call center performance and extract tactical business value from
call center data
• Project Domain:
• Logged performance data from the firm’s proprietary hardware platform
and call center data tied to specific customers and device IDs
• In the above case, the Data Analyst and Scientist would both use data
but, in different ways. The Data Analyst would be concerned with
reporting metrics, such as average call time; while the Data Scientist
would be concerned with using the historical data to predict the future,
such as predicting future months call volumes. Both roles are equally
important to the operation of the call center and help find solutions for
the center to run smoothly. The figure below details some solutions each
role creates.
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
ANALYSIS vs. ANALYTICS
Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
Current Analytics Scenario
•2013
• Significant rise in Data Analytics and Big Data initiatives across all sectors
in India. Finance, telecom, ecommerce and retail sectors showed higher
investments in data related tools and technologies.
• Industries like healthcare, auto and manufacturing also increased their
data related spend.
• Hiring picked up considerably but the demand supply lag was still
considerable considering the sheer lack of availability of trained analytics
professionals
Stages in Analytics
So…..
Tech
Term
Stat/
OR
Source of
Data
Size of
data
Typical Software
DataAnalysis May be Manual
Usually
Small
SPSS, SYSTAT etc.
Business
Intelligence
No
Business
process
Usually
Large
Business Objects, Micro
strategy, Pentaho etc.
Data Mining May be
Business
process
Very
Large
Clementine, e-miner etc.
Analytics Yes
Business
process
Large SAS, R etc.
• Hence, a hallmark of Analytics is application of statistics/OR in industry
setting using business process data.
• SAS is the widely used tool. R (open source) is FAST growing
Current Analytics Scenario
•2013
Current Analytics Scenario
•2013 – Key Facts
• Though SAS remained a popular tool, trends show that R has been
increasing its share of the market. Several analytics companies use both
SAS and R, depending on project and client preferences.
• Big Data tools Hadoop and Map Reduce were industry favourites.
Companies are investing in building this capability , but are not using it
effectively yet.
• Hadoop skills were in demand coupled with SAS and R.
• Web analytics, text analytics and social media analytics began to gain
popularity
Current Analytics Scenario
•2014
• Demand for analytics is currently driven by businesses and
organizations that want to up-skill their employees. We predict that
in 2014, MBA colleges will place more emphasis on analytics as part
of their regular curriculum to cater to the demand-supply gap for
analytics professionals.
• A number of specialized analytics courses (such as the ones offered
by the Great Lakes Institute of Management, Chennai and the
Indian School of Business , Hyderabad) have already gained
popularity in 2013. We predict that many more such courses will be
launched in 2014.
Current Analytics Salary
Current Analytics Salary
Current Analytics Salary
Current Analytics Salary
Current Analytics Salary
• Average entry level salaries have increased 27% since 2013, from Rs.
520 thousand to Rs 660 thousand per annum.
• Typically, there is a 250% increase in salary from entry level analyst
to manager.
• Managers in analytics command an annual salary upward of Rs 1500
thousand.
• At senior levels, annual salaries are upward of Rs. 2500 thousand
which is more than a 60% increase from a managers salary
Let’s Conclusion for today
• Salaries will continue to increase and we will see professionals from
other sectors honing their analytics skills and switching careers.
• The scope of analytics will so permeate India Inc., that 15 years from
now, we predict that those professionals with no analytics and big
data skills will have no scope for growth.
• This seems harsh, but that is how dynamic data analytics is and how
pivotal data backed decision making will become.
In God,WeTrust... All Others must bring the "Data"
Srikanth Ayithy
about.me/srikanthayithy

Mais conteúdo relacionado

Mais procurados

Introduction to Big Data & Analytics
Introduction to Big Data & AnalyticsIntroduction to Big Data & Analytics
Introduction to Big Data & AnalyticsPrasad Chitta
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Edureka!
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data MiningIOSR Journals
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation Pralhad Rijal
 
GTU GeekDay Data Science and Applications
GTU GeekDay Data Science and ApplicationsGTU GeekDay Data Science and Applications
GTU GeekDay Data Science and ApplicationsKürşat İNCE
 
Supply chain management
Supply chain managementSupply chain management
Supply chain managementmuditawasthi
 
Using Data Analytics To Enhance Spend Management Control In The Public Sector...
Using Data Analytics To Enhance Spend Management Control In The Public Sector...Using Data Analytics To Enhance Spend Management Control In The Public Sector...
Using Data Analytics To Enhance Spend Management Control In The Public Sector...Ee Chuan Yoong
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introductionhktripathy
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining IntroductionVijayasankariS
 
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018PAÍS DIGITAL
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashokAshok Kumar
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : ConceptsPragya Pandey
 
Data Mining & Data Warehousing
Data Mining & Data WarehousingData Mining & Data Warehousing
Data Mining & Data WarehousingAAKANKSHA JAIN
 

Mais procurados (20)

Introduction to Big Data & Analytics
Introduction to Big Data & AnalyticsIntroduction to Big Data & Analytics
Introduction to Big Data & Analytics
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
Certified Business Analytics Specialist (CBAS)
Certified Business Analytics Specialist (CBAS) Certified Business Analytics Specialist (CBAS)
Certified Business Analytics Specialist (CBAS)
 
What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation What is Data mining? Data mining Presentation
What is Data mining? Data mining Presentation
 
GTU GeekDay Data Science and Applications
GTU GeekDay Data Science and ApplicationsGTU GeekDay Data Science and Applications
GTU GeekDay Data Science and Applications
 
Data Mining
Data MiningData Mining
Data Mining
 
Supply chain management
Supply chain managementSupply chain management
Supply chain management
 
Data Mining
Data MiningData Mining
Data Mining
 
Using Data Analytics To Enhance Spend Management Control In The Public Sector...
Using Data Analytics To Enhance Spend Management Control In The Public Sector...Using Data Analytics To Enhance Spend Management Control In The Public Sector...
Using Data Analytics To Enhance Spend Management Control In The Public Sector...
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
 
Data mining
Data miningData mining
Data mining
 
Data Science in Action
Data Science in ActionData Science in Action
Data Science in Action
 
Data mining Introduction
Data mining IntroductionData mining Introduction
Data mining Introduction
 
Data mining notes
Data mining notesData mining notes
Data mining notes
 
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
Presentación Ciro Cattuto, ISI Foundation en VI Summit País Digital 2018
 
Data mining
Data miningData mining
Data mining
 
Data mining by_ashok
Data mining by_ashokData mining by_ashok
Data mining by_ashok
 
Data Mining : Concepts
Data Mining : ConceptsData Mining : Concepts
Data Mining : Concepts
 
Data Mining & Data Warehousing
Data Mining & Data WarehousingData Mining & Data Warehousing
Data Mining & Data Warehousing
 

Semelhante a Analytics

DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION Elvis Muyanja
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxCarolineRebeccaD
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAjaved75
 
Top Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD AssistanceTop Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD AssistancePHDAssistance2
 
Real-Time Data Analytics Examples
Real-Time Data Analytics ExamplesReal-Time Data Analytics Examples
Real-Time Data Analytics ExamplesPHDAssistance2
 
Morden EcoSystem.pptx
Morden EcoSystem.pptxMorden EcoSystem.pptx
Morden EcoSystem.pptxpriti jadhao
 
KM.doc
KM.docKM.doc
KM.docbutest
 
Predictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use CasesPredictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use CasesKimberley Mitchell
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career pathRubikal
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data AnalyticsUtkarsh Sharma
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...DATAVERSITY
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptxXanGwaps
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...Experfy
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceMahir Haque
 
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakeseccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data LakesLinked Enterprise Date Services
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 

Semelhante a Analytics (20)

DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
Data Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptxData Engineer vs Data Scientist vs Data Analyst.pptx
Data Engineer vs Data Scientist vs Data Analyst.pptx
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Top Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD AssistanceTop Rated Dissertation Data Analysis Services | PhD Assistance
Top Rated Dissertation Data Analysis Services | PhD Assistance
 
Real-Time Data Analytics Examples
Real-Time Data Analytics ExamplesReal-Time Data Analytics Examples
Real-Time Data Analytics Examples
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Morden EcoSystem.pptx
Morden EcoSystem.pptxMorden EcoSystem.pptx
Morden EcoSystem.pptx
 
KM.doc
KM.docKM.doc
KM.doc
 
Predictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use CasesPredictive Analytics: Context and Use Cases
Predictive Analytics: Context and Use Cases
 
Data analytics career path
Data analytics career pathData analytics career path
Data analytics career path
 
Data Analytics Career Paths
Data Analytics Career PathsData Analytics Career Paths
Data Analytics Career Paths
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
 
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakeseccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 

Último

Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 

Último (20)

Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 

Analytics

  • 1. IT’S ALL ABOUT DATA & ANALYTICS
  • 2. Different Profiles of Analytics DATA SCIENCE • Data scientist, chief scientist, senior analyst, director of analytics, Etc. • Industries like Digital analytics, search technology, marketing, fraud detection, astronomy, energy, Healthcare, social networks, finance, forensics, security (NSA), mobile, telecommunications, weather forecasts, and fraud detection. • Projects Taxonomy creation (text mining, big data), clustering applied to big data sets, recommendation engines, simulations, rule systems for statistical scoring engines, root cause analysis, automated bidding, forensics, exo-planets detection, and early detection of terrorist activity or pandemics. • Main components are Machine to Machine communications, Automation. • Overlaps with Computer Science, Statistics, Machine Learning, Data Mining, Operational Research, Business Intelligence. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 3. Different Profiles of Analytics MACHINE LEARNING - Very popular computer science discipline • Part of data science and closely related to Data Mining. Machine learning is about designing algorithms (like data mining), but emphasis is on prototyping algorithms for production mode, and designing automated systems • Python is now a popular language for ML development Projects. • Core algorithms include clustering and supervised classification, rule systems, and scoring techniques • A sub-domain, close to Artificial Intelligence is deep learning. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 4. Different Profiles of Analytics DATA MINING • Designing algorithms to extract insights from rather large and potentially unstructured data (text mining), sometimes called Nugget Discovery. • Techniques include pattern recognition, feature selection, clustering, supervised classification and encompasses a few statistical techniques. • Data mining thus have some intersection with statistics, and it is a subset of Data science. • Data miners use open source and software such as Rapid Miner. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 5. Different Profiles of Analytics PREDICTIVE MODELING • Predictive modeling projects occur in all industries across all disciplines. • Aim at predicting future based on past data, usually but not always based on statistical modeling. • Predictions often come with confidence intervals. • Roots of predictive modeling are in statistical science. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 6. Different Profiles of Analytics STATISTICS • Loosing ground to data science, industrial statistics, operations research, data mining, machine learning -- where the same clustering, cross-validation and statistical training techniques are used, albeit in a more automated way and on bigger data. • Many professionals who were called statisticians 10 years ago, have seen their job title changed to data scientist or analyst in the last few years. • Modern sub-domains include statistical computing, statistical learning(closer to machine learning), computational statistics(closer to data science), data-driven (model-free) inference, sport statistics, and Bayesian statistics Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 7. Different Profiles of Analytics INDUSTRIAL STATISTICS • Statistics frequently performed by non-statisticians (engineers with good statistical training), working on engineering projects such as yield optimization or load balancing (system analysts). They use very applied statistics, and their framework is closer to six sigma, quality control and operations research, than to traditional statistics.Also found in oil and manufacturing industries. • Techniques used include time series, ANOVA, experimental design, survival analysis, signal processing(filtering, noise removal, deconvolution), spatial models, simulation, Markov chains, risk and reliability models. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 8. Different Profiles of Analytics ACTUARIAL SCIENCES • Just a subset of statistics focusing on insurance (car, health, etc.) • using survival models: predicting when you will die, what your health expenditures will be based on your health status (smoker, gender, previous diseases) to determine your insurance premiums. • Also predicts extreme floods and weather events to determine premiums. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 9. Different Profiles of Analytics HPC • High performance computing, not a discipline per se, but should be of concern to data scientists, big data practitioners, computer scientists and mathematicians, as it can redefine the computing paradigms in these fields. • HPC should not be confused with Hadoop and Map-Reduce: HPC is hardware- related, Hadoop is software-related (though heavily relying on Internet bandwidth and servers configuration and proximity). Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 10. Different Profiles of Analytics OPERATIONS RESEARCH(OR) • It is about decision science and optimizing traditional business projects: inventory management, supply chain, pricing. They heavily use Markov Chain models, Monter-Carlo simulations, queuing and graph theory, and software such as AIMS, Matlab or Informatica. • Big, traditional old companies use OR, new and small ones (start-ups) use data science to handle pricing, inventory management or supply chain problems. • Car traffic optimization is a modern example of OR problem, solved with simulations, commuter surveys, sensor data and statistical modeling. • OR has a significant overlap with six-sigma, also solves econometric problems, and has many practitioners/applications in the army and defense sectors Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 11. Different Profiles of Analytics ECONOMETRICS • Econometrics is heavily statistical in nature, using time series models such as auto-regressive processes. • Also overlapping with operations research (itself overlapping with statistics!) and mathematical optimization (simplex algorithm). • Econometricians like ROC and efficiency curves. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 12. Different Profiles of Analytics DATA ENGINEERING • Performed by software engineers (developers) or architects (designers) in large organizations (sometimes by data scientists in tiny companies) • A sub-domain currently under attack is data warehousing, as this term is associated with static, siloed conventational data bases, data architectures, and data flows, threatened by the rise of NoSQL, NewSQL and graph databases. • Transforming these old architectures into new ones (only when needed) or make them compatible with new ones, is a lucrative business Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 13. Different Profiles of Analytics BUSINESS INTELLIGENCE Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 14. Different Profiles of Analytics BUSINESS INTELLIGENCE • Focuses on dashboard creation, metric selection, producing and scheduling data reports (statistical summaries) sent by email or delivered/presented to executives, competitive intelligence (analyzing third party data), as well as involvement in database schema design (working with data architects) to collect useful, actionable business data efficiently. • Typical job title is Business Analyst • some are more involved with marketing, product or finance (forecasting sales and revenue). • Some have learned advanced statistics such as time series, but most only use (and need) basic stats, and light analytics, relying on IT to maintain databases and harvest data. • BI and market research(but not competitive intelligence) are currently experiencing a decline. • Part of the decline is due to not adapting to new types of data (e.g. unstructured text) that require engineering or data science techniques to process and extract value Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 15. Different Profiles of Analytics DATA ANALYTICS • This is the new term for Business Statistics since at least 1995, and it covers a large spectrum of applications including fraud detection, advertising mix modeling, attribution modeling, sales forecasts, cross-selling optimization (retails), user segmentation, churn analysis, computing long-time value of a customer and cost of acquisition, Etc. • Except in big companies, data analyst is a Junior role; these practitioners have a much more narrow knowledge and experience than data scientists Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 16. Different Profiles of Analytics BUSINESS ANALYTICS • Same as data analysis, but restricted to business problems only. • Tends to have a bit more of a financial, marketing or ROI flavor. Resource: http://www.datasciencecentral.com/profiles/blogs/17-analytic-disciplines-compared
  • 17. DATA SCIENTIST vs. DATA ANALYST Resource: http://www.edureka.co/blog/difference-between-data-scientist-and-data-analyst/
  • 18. DATA SCIENTIST vs. DATA ANALYST • “Data Analyst” focuses on the movement and interpretation of data, typically with a focus on the past and present. • Alternatively, a “Data Scientist” may be primarily responsible for summarizing data in such a way as to provide forecasting, or an insight into future based on the patterns identified from past and current data. Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 19. DATA SCIENTIST vs. DATA ANALYST CRISP-DM Process Model Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 20. DATA SCIENTIST vs. DATA ANALYST • Business understanding – Determine Business Objectives, Assess Situation, Determine Data Mining Goals, Produce Project Plan • Data understanding – Collect Initial Data, Describe Data, Explore Data,Verify Data Quality • Data preparation – Select Data, Clean Data, Construct Data, Integrate Data • Modeling – Select modeling technique, Generate Test Design, Build Model, Assess Model • Evaluation - Evaluate Results, Review Process, Determine Next Steps • Deployment – Plan Deployment, Plan Monitoring and Maintenance, Produce Final Report, Review Project Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 21. DATA SCIENTIST vs. DATA ANALYST • A Data Scientist is often heavily involved in the cleaning and manipulation of data to support their modeling needs as well as the building and evaluating of model designs which are intended to help guide changes in business decisions. • On the other hand, a Data Analyst may spend their time exploring data to support troubleshooting efforts or to generate ideas for useful reports to pitch to the customer. • In general, while Data Analysts tend to be more Business focused, Data Scientists are often Mathematically focused. Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 22. DATA SCIENTIST vs. DATA ANALYST Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 23. DATA SCIENTIST vs. DATA ANALYST • Data Analysts typically perform data migration and visualization roles that focus on describing the past; while Data Scientists typically perform roles manipulating data and creating models to improve the future. Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 24. DATA SCIENTIST vs. DATA ANALYST • Situation: • A large provider of streaming entertainment and data services wants to improve call center performance and extract tactical business value from call center data • Project Domain: • Logged performance data from the firm’s proprietary hardware platform and call center data tied to specific customers and device IDs • In the above case, the Data Analyst and Scientist would both use data but, in different ways. The Data Analyst would be concerned with reporting metrics, such as average call time; while the Data Scientist would be concerned with using the historical data to predict the future, such as predicting future months call volumes. Both roles are equally important to the operation of the call center and help find solutions for the center to run smoothly. The figure below details some solutions each role creates. Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 25. ANALYSIS vs. ANALYTICS Resource: http://captechconsulting.com/blog/richard-rivera/data-scientist-vs-data-analyst
  • 26. Current Analytics Scenario •2013 • Significant rise in Data Analytics and Big Data initiatives across all sectors in India. Finance, telecom, ecommerce and retail sectors showed higher investments in data related tools and technologies. • Industries like healthcare, auto and manufacturing also increased their data related spend. • Hiring picked up considerably but the demand supply lag was still considerable considering the sheer lack of availability of trained analytics professionals
  • 28. So….. Tech Term Stat/ OR Source of Data Size of data Typical Software DataAnalysis May be Manual Usually Small SPSS, SYSTAT etc. Business Intelligence No Business process Usually Large Business Objects, Micro strategy, Pentaho etc. Data Mining May be Business process Very Large Clementine, e-miner etc. Analytics Yes Business process Large SAS, R etc. • Hence, a hallmark of Analytics is application of statistics/OR in industry setting using business process data. • SAS is the widely used tool. R (open source) is FAST growing
  • 30. Current Analytics Scenario •2013 – Key Facts • Though SAS remained a popular tool, trends show that R has been increasing its share of the market. Several analytics companies use both SAS and R, depending on project and client preferences. • Big Data tools Hadoop and Map Reduce were industry favourites. Companies are investing in building this capability , but are not using it effectively yet. • Hadoop skills were in demand coupled with SAS and R. • Web analytics, text analytics and social media analytics began to gain popularity
  • 31. Current Analytics Scenario •2014 • Demand for analytics is currently driven by businesses and organizations that want to up-skill their employees. We predict that in 2014, MBA colleges will place more emphasis on analytics as part of their regular curriculum to cater to the demand-supply gap for analytics professionals. • A number of specialized analytics courses (such as the ones offered by the Great Lakes Institute of Management, Chennai and the Indian School of Business , Hyderabad) have already gained popularity in 2013. We predict that many more such courses will be launched in 2014.
  • 36. Current Analytics Salary • Average entry level salaries have increased 27% since 2013, from Rs. 520 thousand to Rs 660 thousand per annum. • Typically, there is a 250% increase in salary from entry level analyst to manager. • Managers in analytics command an annual salary upward of Rs 1500 thousand. • At senior levels, annual salaries are upward of Rs. 2500 thousand which is more than a 60% increase from a managers salary
  • 37. Let’s Conclusion for today • Salaries will continue to increase and we will see professionals from other sectors honing their analytics skills and switching careers. • The scope of analytics will so permeate India Inc., that 15 years from now, we predict that those professionals with no analytics and big data skills will have no scope for growth. • This seems harsh, but that is how dynamic data analytics is and how pivotal data backed decision making will become.
  • 38. In God,WeTrust... All Others must bring the "Data" Srikanth Ayithy about.me/srikanthayithy