Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
data science @ The New York Times
chris.wiggins@columbia.edu
chris.wiggins@nytimes.com
@chrishwiggins
references: bit.ly/b...
data science @ The New York Times
data science @ The New York Times
“data science”
jobs, jobs, jobs
“data science”
jobs, jobs, jobs
data science: mindset & toolset
drew conway, 2010
modern history:
2009
modern history:
2009
“data science”
ancient history: 2001
“data science”
ancient history: 2001
data science
context
home schooled
B.A. & M.Sc. from Brown
PhD in topology
“By the end of late 1945, I was a
statistician rather than a topologist”
invented: “bit”
invented: “software”
invented: “FFT”
“the progenitor of data science.” - @mshron
“The Future of Data Analysis,” 1962
John W. Tukey
introduces:
“Exploratory data anlaysis”
Tukey 1965, via John Chambers
TUKEY BEGAT S WHICH BEGAT R
Tukey 1972
Tukey 1975
In 1975, while at Princeton, Tufte was asked to teach a
statistics course to a group of journalists who were vi...
TUKEY BEGAT VDQI
Tukey 1977
TUKEY BEGAT EDA
fast forward -> 2001
“The primary agents for change should be
university departments themselves.”
data science @ The New York Timeshistories
1. slow burn @Bell: as heretical
statistics (see also Breiman)
2. caught fire 2...
biology: 1892 vs. 1995
biology: 1892 vs. 1995
biology changed for good.
biology: 1892 vs. 1995
new toolset, new mindset
genetics: 1837 vs. 2012
ML toolset; data science mindset
genetics: 1837 vs. 2012
genetics: 1837 vs. 2012
ML toolset; data science mindset
arxiv.org/abs/1105.5821 ; github.com/rajanil/mkboost
data science: mindset & toolset
1851
news: 20th century
church state
church
church
church
news: 20th century
church state
news: 21st century
church state
engineering
1851 1996
newspapering: 1851 vs. 1996
example:
millions of views per hour2015
"...social activities generate large quantities of potentially
valuable data...The data were not generated for the
purpose...
"...social activities generate large quantities of potentially
valuable data...The data were not generated for the
purpose...
data science: the web
data science: the web
is your “online presence”
data science: the web
is a microscope
data science: the web
is an experimental tool
1851 1996
newspapering: 1851 vs. 1996 vs. 2008
2008
“a startup is a temporary organization in search of a
repeatable and scalable business model” —Steve Blank
every publisher is now a startup
every publisher is now a startup
news: 21st century
church state
engineering
news: 21st century
church state
engineering
learnings
learnings
- predictive modeling
- descriptive modeling
- prescriptive modeling
(actually ML, shhhh…)
- (supervised learning)
- (unsupervised learning)
- (reinforcement learning)
learnings
- predictive modeling
- descriptive modeling
- prescriptive modeling
cf. modelingsocialdata.org
predictive modeling, e.g.,
cf. modelingsocialdata.org
predictive modeling, e.g.,
“the funnel”
cf. modelingsocialdata.org
interpretable predictive modeling
supercoolstuff
cf. modelingsocialdata.org
interpretable predictive modeling
supercoolstuff
cf. modelingsocialdata.org
arxiv.org/abs/q-bio/0701021
optimization & learning, e.g.,
“How The New York Times Works “popular mechanics, 2015
optimization & prediction, e.g.,
“How The New York Times Works “popular mechanics, 2015
(some models)
(somemoneys)
recommendation as predictive modeling
recommendation as predictive modeling
bit.ly/AlexCTM
descriptive modeling, e.g,
cf. daeilkim.com ; import bnpy
modeling your audience
bit.ly/Hughes-Kim-Sudderth-AISTATS15
modeling your audience
(optimization, ultimately)
also allows insight+targeting as inference
modeling your audience
prescriptive modeling
prescriptive modeling
cf. modelingsocialdata.org
prescriptive modeling
aka “A/B testing”;
RCT
cf. modelingsocialdata.org
prescriptive modeling, e.g,
prescriptive modeling, e.g,
prescriptive modeling, e.g,
Reporting
Learning
Test
Optimizing
Exploredescriptive:
predictive:
prescriptive:
Reporting
Learning
Test
Optimizing
Exploredescriptive:
predictive:
prescriptive:
common requirements in
data science:
common requirements in
data science:
1. people
2. ideas
3. things
cf. John Boyd, USAF
data science: ideas
data skills
data science and…
- data engineering
- data embeds
- data product
- data multiliteracies
cf. “data scientists ...
data science: ideas
- new mindset > new toolset
data science: people
thanks to the data science team!
data science @ The New York Times
chris.wiggins@columbia.edu
chris.wiggins@nytimes.com
@chrishwiggins
references: bit.ly/b...
data science @NYT ; inaugural Data Science Initiative Lecture
data science @NYT ; inaugural Data Science Initiative Lecture
You’ve finished this document.
Download and read it offline.
Upcoming SlideShare
CSS Grid Layout
Next
Upcoming SlideShare
CSS Grid Layout
Next
Download to read offline and view in fullscreen.

Share

data science @NYT ; inaugural Data Science Initiative Lecture

Download to read offline

inaugural Data Science Initiative Lecture @ Brown University
2015-12-04
https://www.eventbrite.com/e/data-science-at-the-new-york-times-tickets-19490272931

Related Books

Free with a 30 day trial from Scribd

See all

data science @NYT ; inaugural Data Science Initiative Lecture

  1. data science @ The New York Times chris.wiggins@columbia.edu chris.wiggins@nytimes.com @chrishwiggins references: bit.ly/brown-refs
  2. data science @ The New York Times
  3. data science @ The New York Times
  4. “data science” jobs, jobs, jobs
  5. “data science” jobs, jobs, jobs
  6. data science: mindset & toolset drew conway, 2010
  7. modern history: 2009
  8. modern history: 2009
  9. “data science” ancient history: 2001
  10. “data science” ancient history: 2001
  11. data science context
  12. home schooled
  13. B.A. & M.Sc. from Brown
  14. PhD in topology
  15. “By the end of late 1945, I was a statistician rather than a topologist”
  16. invented: “bit”
  17. invented: “software”
  18. invented: “FFT”
  19. “the progenitor of data science.” - @mshron
  20. “The Future of Data Analysis,” 1962 John W. Tukey
  21. introduces: “Exploratory data anlaysis”
  22. Tukey 1965, via John Chambers
  23. TUKEY BEGAT S WHICH BEGAT R
  24. Tukey 1972
  25. Tukey 1975 In 1975, while at Princeton, Tufte was asked to teach a statistics course to a group of journalists who were visiting the school to study economics. He developed a set of readings and lectures on statistical graphics, which he further developed in joint seminars he subsequently taught with renowned statistician John Tukey (a pioneer in the field of information design). These course materials became the foundation for his first book on information design, The Visual Display of Quantitative Information
  26. TUKEY BEGAT VDQI
  27. Tukey 1977
  28. TUKEY BEGAT EDA
  29. fast forward -> 2001
  30. “The primary agents for change should be university departments themselves.”
  31. data science @ The New York Timeshistories 1. slow burn @Bell: as heretical statistics (see also Breiman) 2. caught fire 2009-now: as job description historical rant: bit.ly/data-rant
  32. biology: 1892 vs. 1995
  33. biology: 1892 vs. 1995 biology changed for good.
  34. biology: 1892 vs. 1995 new toolset, new mindset
  35. genetics: 1837 vs. 2012 ML toolset; data science mindset
  36. genetics: 1837 vs. 2012
  37. genetics: 1837 vs. 2012 ML toolset; data science mindset arxiv.org/abs/1105.5821 ; github.com/rajanil/mkboost
  38. data science: mindset & toolset
  39. 1851
  40. news: 20th century church state
  41. church
  42. church
  43. church
  44. news: 20th century church state
  45. news: 21st century church state engineering
  46. 1851 1996 newspapering: 1851 vs. 1996
  47. example: millions of views per hour2015
  48. "...social activities generate large quantities of potentially valuable data...The data were not generated for the purpose of learning; however, the potential for learning is great’’
  49. "...social activities generate large quantities of potentially valuable data...The data were not generated for the purpose of learning; however, the potential for learning is great’’ - J Chambers, Bell Labs,1993
  50. data science: the web
  51. data science: the web is your “online presence”
  52. data science: the web is a microscope
  53. data science: the web is an experimental tool
  54. 1851 1996 newspapering: 1851 vs. 1996 vs. 2008 2008
  55. “a startup is a temporary organization in search of a repeatable and scalable business model” —Steve Blank
  56. every publisher is now a startup
  57. every publisher is now a startup
  58. news: 21st century church state engineering
  59. news: 21st century church state engineering
  60. learnings
  61. learnings - predictive modeling - descriptive modeling - prescriptive modeling
  62. (actually ML, shhhh…) - (supervised learning) - (unsupervised learning) - (reinforcement learning)
  63. learnings - predictive modeling - descriptive modeling - prescriptive modeling cf. modelingsocialdata.org
  64. predictive modeling, e.g., cf. modelingsocialdata.org
  65. predictive modeling, e.g., “the funnel” cf. modelingsocialdata.org
  66. interpretable predictive modeling supercoolstuff cf. modelingsocialdata.org
  67. interpretable predictive modeling supercoolstuff cf. modelingsocialdata.org arxiv.org/abs/q-bio/0701021
  68. optimization & learning, e.g., “How The New York Times Works “popular mechanics, 2015
  69. optimization & prediction, e.g., “How The New York Times Works “popular mechanics, 2015 (some models) (somemoneys)
  70. recommendation as predictive modeling
  71. recommendation as predictive modeling bit.ly/AlexCTM
  72. descriptive modeling, e.g, cf. daeilkim.com ; import bnpy
  73. modeling your audience bit.ly/Hughes-Kim-Sudderth-AISTATS15
  74. modeling your audience (optimization, ultimately)
  75. also allows insight+targeting as inference modeling your audience
  76. prescriptive modeling
  77. prescriptive modeling cf. modelingsocialdata.org
  78. prescriptive modeling aka “A/B testing”; RCT cf. modelingsocialdata.org
  79. prescriptive modeling, e.g,
  80. prescriptive modeling, e.g,
  81. prescriptive modeling, e.g,
  82. Reporting Learning Test Optimizing Exploredescriptive: predictive: prescriptive:
  83. Reporting Learning Test Optimizing Exploredescriptive: predictive: prescriptive:
  84. common requirements in data science:
  85. common requirements in data science: 1. people 2. ideas 3. things cf. John Boyd, USAF
  86. data science: ideas
  87. data skills data science and… - data engineering - data embeds - data product - data multiliteracies cf. “data scientists at work”, ch 1
  88. data science: ideas - new mindset > new toolset
  89. data science: people
  90. thanks to the data science team!
  91. data science @ The New York Times chris.wiggins@columbia.edu chris.wiggins@nytimes.com @chrishwiggins references: bit.ly/brown-refs
  • AsgharAliGujjar

    Oct. 24, 2021
  • CisnerosOctavio

    Oct. 3, 2021
  • OdoteiKwaoOdoi

    Sep. 9, 2021
  • SheenaMaeTangcora

    Aug. 13, 2021
  • LauraVVanLaningham

    Jul. 3, 2021
  • AamirSuhail45

    May. 20, 2021
  • PasutPiyapasut

    May. 9, 2021
  • BatsaikhanJargalsaikhan

    Apr. 20, 2021
  • JoseS34

    Apr. 10, 2021
  • AbouCamara5

    Apr. 5, 2021
  • RoyalSbachl1

    Mar. 20, 2021
  • ssuser9b3840

    Mar. 15, 2021
  • 734003128

    Mar. 2, 2021
  • ranandhan

    Feb. 23, 2021
  • VaniaAquize

    Feb. 10, 2021
  • AhmedElhaga

    Feb. 1, 2021
  • HariyapuReddy

    Jan. 17, 2021
  • TagesuDagefe

    Dec. 23, 2020
  • KingshukSantra

    Dec. 20, 2020
  • virendrakorikori

    Dec. 12, 2020

inaugural Data Science Initiative Lecture @ Brown University 2015-12-04 https://www.eventbrite.com/e/data-science-at-the-new-york-times-tickets-19490272931

Views

Total views

206,061

On Slideshare

0

From embeds

0

Number of embeds

550

Actions

Downloads

382

Shares

0

Comments

0

Likes

530

×