top of page

With my Winter Break winding down, I decided to spend some time refining my Data Science skills. I gathered LA Crime Data from Data.gov and analyzed the data using Python, Apache Spark, Pandas, NumPy, Matplot but later switched to Ploty, and the Jupyter Notebook. According to the Data gathered from Data.gov. For the years 2011-2015 the data was complete, but in the year 2016, it seems that they stopped maintaining their data after July. Therefore from 2011- July of 2016, there was a total of 1,323,817 crimes that occurred 

The data for 2016 may be incomplete, but if we calculate the average for the days that were recorded, we see that the daily average for # of crimes per day in the year 2016 is clearly higher than the previous years. If we use that to calculate the remaining 5 months of the year with an average rounded down to 700, it adds an additional 100,000 crimes which would put the total for the year over 250,000 crimes reported.  

Where Did the Crimes Occur ?​

Every Crime committed was reported with a corresponding area with a total of 21 Area in the Los Angeles County.

Which areas varied  the greatest ?

Rank Of Areas By Least Amount of Crimes To Most  

What are the 21 Areas ?

The 21 Area Span Across the entire LA County as shown in the map below. 

Crime Locations

 2011 Locations

 2012 Locations

 2013 Locations

 2014 Locations

 2015 Locations

*The 2016 Data was the only data to not include the Longitude and Latitude coordinates in their reporting 
bottom of page