Lazaro Martull

Dallas Crime Data: Predicting Crime Locations Using Machine Learning

Overview

This project builds machine learning models to predict crime incident locations (premise) using Dallas Police open data. The task is a multi-class classification problem where the target variable is the reported location type and the inputs are the crime type and category.

Data

Key fields (renamed):

Methodology

Models

Results (Highlights)

All Locations (2024 Holdout)

Holdout Accuracy:

AUC:

Top 5 Locations (2024 Holdout)

Filtering to the top five most common location classes significantly improved performance:

Holdout Accuracy:

Limitations & Future Work

Files