
With my Winter Break winding down, I decided to spend some time refining my Data Science skills. I gathered LA Crime Data from Data.gov and analyzed the data using Python, Apache Spark, Pandas, NumPy, Matplot but later switched to Ploty, and the Jupyter Notebook. According to the Data gathered from Data.gov. For the years 2011-2015 the data was complete, but in the year 2016, it seems that they stopped maintaining their data after July. Therefore from 2011- July of 2016, there was a total of 1,323,817 crimes that occurred
The data for 2016 may be incomplete, but if we calculate the average for the days that were recorded, we see that the daily average for # of crimes per day in the year 2016 is clearly higher than the previous years. If we use that to calculate the remaining 5 months of the year with an average rounded down to 700, it adds an additional 100,000 crimes which would put the total for the year over 250,000 crimes reported.
Where Did the Crimes Occur ?
Every Crime committed was reported with a corresponding area with a total of 21 Area in the Los Angeles County.
Which areas varied the greatest ?
Rank Of Areas By Least Amount of Crimes To Most
What are the 21 Areas ?
The 21 Area Span Across the entire LA County as shown in the map below.
Crime Locations
2011 Locations
2012 Locations
2013 Locations
2014 Locations
2015 Locations
*The 2016 Data was the only data to not include the Longitude and Latitude coordinates in their reporting
Data Gather From:
-
https://catalog.data.gov/dataset/crimes-2012-2015
-
https://catalog.data.gov/dataset/lapd-crime-and-collision-raw-data-for-2015
-
https://catalog.data.gov/dataset/open-data-crm-2016
-
https://catalog.data.gov/dataset/lapd-crime-and-collision-raw-data-for-2013
-
https://catalog.data.gov/dataset/lapd-crime-and-collision-raw-data-for-2014
-
https://catalog.data.gov/dataset/lapd-crime-and-collision-raw-data-for-2012
-
https://catalog.data.gov/dataset/lapd-crime-and-collision-raw-data-for-2011