Public policy analytics code and context for data science in government

Public Policy Analytics: Code & Context for Data Science in Government teaches readers how to address complex public policy problems with data and analytics using reproducible methods in R. Each of the eight chapters provides a detailed case study, showing readers: how to develop exploratory ind...

Full description

Bibliographic Details
Main Author: Steif, Ken
Format: eBook
Language:English
Published: Boca Raton CRC Press 2021
Series:Chapman & Hall/CRC data science series
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 07889nmm a2200589 u 4500
001 EB002067407
003 EBX01000000000000001207497
005 00000000000000.0
007 cr|||||||||||||||||||||
008 220922 ||| eng
020 |a 9781000401578 
020 |a 100305465X 
020 |a 9781003054658 
020 |a 100040157X 
020 |a 9781000401615 
020 |a 1000401618 
050 4 |a JK468.A8 
100 1 |a Steif, Ken 
245 0 0 |a Public policy analytics  |b code and context for data science in government  |c Ken Steif 
260 |a Boca Raton  |b CRC Press  |c 2021 
300 |a 1 online resource  |b illustrations 
505 0 |a Includes bibliographical references and index 
505 0 |a Lancaster development<br>2.1.1 The bid-rent model<br>2.1.2 Setup Lancaster data <br>2.2 Identifying areas inside & outside of the Urban Growth Area <br>2.2.1 Associate each inside/outside buffer with its respective town<br>2.2.2 Building density by town & by inside/outside the UGA <br>2.2.3 Visualize buildings inside & outside the UGA<br>2.3 Return to Lancaster's Bid Rent <br>2.4 Conclusion -- On boundaries <br>2.5 Assignment -- Boundaries in your community </p><p><strong>Intro to geospatial machine learning, Part 1 </strong><br>3.1 Machine learning as a Planning tool <br>3.1.1 Accuracy & generalizability <br>3.1.2 The machine learning process <br>3.1.3 The hedonic model <br>3.2 Data wrangling -- Home price & crime data <br>3.2.1 Feature Engineering --  
505 0 |a Predict risk </p><p><strong>People-based ML models</strong><br>6.1 Bounce to work<br>6.2 Exploratory analysis <br>6.3 Logistic regression<br>6.3.1 Training/Testing sets <br>6.3.2 Estimate a churn model <br>6.4 Goodness of Fit <br>6.4.1 Roc Curves <br>6.5 Cross-validation <br>6.6 Generating costs and benefits <br>6.6.1 Optimizing the cost/benefit relationship <br>6.7 Conclusion -- churn <br>6.8 Assignment -- Target a subsidy </p><p><strong>People-Based ML Models: Algorithmic Fairness</strong><br>7.1 Introduction <br>7.1.1 The spectre of disparate impact <br>7.1.2 Modeling judicial outcomes <br>7.1.3 Accuracy and generalizability in recidivism algorithms <br>7.2 Data and exploratory analysis <br>7.3 Estimate two recidivism models <br>7.3.1 Accuracy & Generalizability <br>7.4 What about the threshold?</p><p>7.5 Optimizing 'equitable' thresholds <br>7.6 Assignment -- Memo to the Mayor </p><p><br><strong>Predicting rideshare demand</strong><br>8.1 Introduction --  
505 0 |a ride share <br>8.2 Data Wrangling -- ride share <br>8.2.1 Lubridate<br>8.2.2 Weather data <br>8.2.3 Subset a study area using neighborhoods <br>8.2.4 Create the final space/time panel <br>8.2.5 Split training and test<br>8.2.6 What about distance features? <br>8.3 Exploratory Analysis -- ride share <br>8.3.1 Trip-Count serial autocorrelation <br>8.3.2 Trip-Count spatial autocorrelation <br>8.3.3 Space/time correlation? <br>8.3.4 Weather<br>8.4 Modeling and validation using purrr::map<br>8.4.1 A short primer on nested tibbles <br>8.4.2 Estimate a ride share forecast <br>8.4.3 Validate test set by time <br>8.4.4 Validate test set by space <br>8.5 Conclusion -- Dispatch<br>8.6 Assignment -- Predict bike share trips</p><p>Conclusion -- Algorithmic Governance </p><p>Index</p> 
505 0 |a <P>Preface </p><p>Introduction <br><br><strong>Indicators for Transit Oriented Development </strong><br>1.1 Why Start With Indicators? <br>1.1.1 Mapping & scale bias in areal aggregate data <br>1.2 Setup <br>1.2.1 Downloading & wrangling Census data <br>1.2.2 Wrangling transit open data <br>1.2.3 Relating tracts & subway stops in space <br>1.3 Developing TOD Indicators <br>1.3.1 TOD indicator maps <br>1.3.2 TOD indicator tables <br>1.3.3 TOD indicator plots <br>1.4 Capturing three submarkets of interest <br>1.5 Conclusion: Are Philadelphians willing to pay for TOD? <br>1.6 Assignment -- Study TOD in your city <br><br><br><strong>Expanding the Urban Growth Boundary</strong><br>2.1 Introduction --  
505 0 |a Measuring exposure to crime <br>3.2.2 Exploratory analysis: Correlation<br>3.3 Introduction to Ordinary Least Squares Regression <br>3.3.1 Our first regression model<br>3.3.2 More feature engineering & colinearity <br>3.4 Cross-validation & return to goodness of fit<br>3.4.1 Accuracy -- Mean Absolute Error <br>3.4.2 Generalizability -- Cross-validation <br>3.5 Conclusion -- Our first model <br>3.6 Assignment -- Predict house prices </p><p><strong>Intro to geospatial machine learning, Part 2</strong><br>4.1 On the spatial process of home prices <br>4.1.1 Setup & Data Wrangling <br>4.2 Do prices & errors cluster? The Spatial Lag<br>4.2.1 Do model errors cluster? -- Moran's I<br>4.3 Accounting for neighborhood <br>4.3.1 Accuracy of the neighborhood model <br>4.3.2 Spatial autocorrelation in the neighborhood model <br>4.3.3 Generalizability of the neighborhood model<br>4.4 Conclusion -- Features at multiple scales</p><p><strong>Geospatial risk modeling --  
505 0 |a Predictive Policing </strong><br>5.1 New predictive policing tools <br>5.1.1 Generalizability in geospatial risk models <br>5.1.2 From Broken Windows Theory to Broken Windows Policing <br>5.1.3 Setup <br>5.2 Data wrangling: Creating the fishnet<br>5.2.1 Data wrangling: Joining burglaries to the fishnet <br>5.2.2 Wrangling risk factors <br>5.3 Feature engineering -- Count of risk factors by grid cell <br>5.3.1 Feature engineering -- Nearest neighbor features <br>5.3.2 Feature Engineering -- Measure distance to one point <br>5.3.3 Feature Engineering -- Create the final-net <br>5.4 Exploring the spatial process of burglary <br>5.4.1 Correlation tests <br>5.5 Poisson Regression <br>5.5.1 Cross-validated Poisson Regression <br>5.5.2 Accuracy & Generalzability <br>5.5.3 Generalizability by neighborhood context<br>5.5.4 Does this model allocate better than traditional crime hotspots? <br>5.6 Conclusion -- Bias but useful? <br>5.7 Assignment --  
653 |a BUSINESS & ECONOMICS / Statistics / bisacsh 
653 |a Politics and government / Data processing / fast 
653 |a United States / fast / https://id.oclc.org/worldcat/entity/E39PBJtxgQXMWqmjMjjwXRHgrq 
653 |a Politique publique / Informatique 
653 |a Political planning / Data processing 
653 |a Political planning / Data processing / fast 
653 |a États-Unis / Politique et gouvernement / Informatique 
653 |a United States / Politics and government / Data processing 
653 |a COMPUTERS / Computer Graphics / General / bisacsh 
653 |a COMPUTERS / General / bisacsh 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
490 0 |a Chapman & Hall/CRC data science series 
015 |a GBC177054 
776 |z 9781000401578 
776 |z 9781003054658 
776 |z 9781000401615 
776 |z 9780367507619 
776 |z 9780367516253 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781000401615/?ar  |x Verlag  |3 Volltext 
082 0 |a 519.5 
082 0 |a 352.3/80285 
082 0 |a 330 
520 |a Public Policy Analytics: Code & Context for Data Science in Government teaches readers how to address complex public policy problems with data and analytics using reproducible methods in R. Each of the eight chapters provides a detailed case study, showing readers: how to develop exploratory indicators; understand spatial process' and develop spatial analytics; how to develop useful' predictive analytics; how to convey these outputs to non-technical decision-makers through the medium of data visualization; and why, ultimately, data science and Planning' are one and the same. A graduate-level introduction to data science, this book will appeal to researchers and data scientists at the intersection of data analytics and public policy, as well as readers who wish to understand how algorithms will affect the future of government