Abstract

Overcrowding is a major challenge to correction systems because the conventional forecasting techniques are inaccurate and inadequate in most cases. This paper will solve this by constructing and testing a machine learning-based model to predict facility-level overcrowding. With the use of the XGBoost Regressor model on a dataset comprising of U.S. correctional facilities, the research identified key structural drivers but showed that the static facility attributes alone have limited predictive power (r-square approx 0.18) The discussion shows that overcrowding is non-linear, and a complicated problem not confined to the facility characteristics but to the larger, non-measurable regional influences, of which geographical characteristics are a strong proxy. The results present a demonstration of scalable interpretable forecasting system, which allows transition to proactive, data-driven strategic planning and ensure safety and efficiency in the facilities.

Library of Congress Subject Headings

Prison administration--United States--Data processing; Prisons--Overcrowding--Forecasting--Data processing; Machine learning

Publication Date

12-2025

Document Type

Thesis

Student Type

Graduate

Degree Name

Professional Studies (MS)

Department, Program, or Center

Graduate Programs & Research

Advisor

Sanjay Modak

Advisor/Committee Member

Ayman Ibrahim

Campus

RIT Dubai

Plan Codes

PROFST-MS

Share

COinS