JNB Lab Solutions

5.11. JNB Lab Solutions#

Exercise 1a

Read in Homicide data and drop rows that have missing data.

	Date of Incident	Age	Gender	Race	Primary Cause	Residence City	Incident Zip Code
0	2017-02-26 10:48:00	23.0	Male	Black	MULTIPLE GUNSHOT WOUNDS	Chicago	60623.0

Extract the month and year of homicide incidents.

	Date of Incident	Age	Gender	Race	Primary Cause	Residence City	Incident Zip Code	new_date	year	month	day
0	2017-02-26 10:48:00	23.0	Male	Black	MULTIPLE GUNSHOT WOUNDS	Chicago	60623.0	2017-02-26	2017	2	26

Get Chicago data, select the columns [“Age”,“Gender”,“Race”,“Primary Cause”,“Incident Zip Code”,“year”,“month”,“day”] and use integer format

	Age	Gender	Race	Primary Cause	Incident Zip Code	year	month	day
0	23	Male	Black	MULTIPLE GUNSHOT WOUNDS	60623	2017	2	26

Get the number of Chicago homicides in July 2020.

Chicago Homicides In July 2020:  89

Get the number of homicides which occurred on each day of July between 2015 and 2019.

  0
  1
  3
  1
  4
dtype: int32

Get frequency distributions for the number of homicides in a day

  0.200000
  0.309677
  0.258065
  0.141935
  0.051613
  0.012903
  0.012903
  0.012903
dtype: float64

0.2

Do a Monte Carlo simulation for monthly homicide counts in July based on 31 random draws from the respective empirical distributions.

Make a histogram for July.

Text(0.5, 1.0, '10,000 Simulated July Homicide Counts based on 2015-2019 Data')

../../_images/4e313f928b8b238c07d155abbf6ea6f68487dff595b748c1067ce7f22a317069.png

Exercise 1b

b) Find the empirical p-value of getting at least 108 homicides (the number in July 2020.)

Probaility that a random draw of 31 days from July 2015-2020 distibution results in 85 or more homicides: 0.0

Exercise 2

a) Make a histogram of the difference July homicide count - May homicide count

../../_images/353aa8a91f383dbe32cccc8557bd17b98f676f62c4ca6f32718e1ad05cc57124.png

Find the probability that July will have at least n more homicides than May \((0\le n \le 20)\).

   0.7409
   0.7091
   0.6766
   0.6436
   0.6077
   0.5694
   0.5298
   0.4885
   0.4529
   0.4168
  0.3796
  0.3436
  0.3082
  0.2773
  0.2459
  0.2171
  0.1889
  0.1677
  0.1461
  0.1246
  0.1045
dtype: float64

b) Make a plot of these probabilities

c) Based on 2015-2019 data, there is a 10% chance of 20 more homicides in July than May.