Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

California Flight Delay Analysis

Binder

DOI

Overview

This project analyzes and models flight delay behavior in California using data from the U.S. Department of Transportation. The main objectives are to look at airline and airport reliability, explore clustering structures among airlines and airports, and evaluate how well we can predict flight delays using statistical and machine learning methods.

Motivation

Because of the many recent flight delays due to the government shutdown (Shepardson & Ljunggren (2025)), we found it interesting to take a closer look at how delays behave in practice. This motivated us to analyze which airlines and airports perform best and whether delays can be predicted using historical data.

Dataset

The dataset used in this project is the U.S. Flight Delays Dataset (United States Department of Transportation (2015)), available on Kaggle:
https://www.kaggle.com/datasets/usdot/flight-delays

The data comes from from the Bureau of Transportation Statistics and contains on-time performance records of domestic U.S. flights across 14 airlines and 322 airports during 2015. Due to the large size of the dataset, a subset is used in this project focusing only on flights leaving from California airports.

Project Website

The project’s website can be accessed here:
https://ucb-stat-159-f25.github.io/final-group21/

Repository Structure

The repository is structured in the following way:

Setup

Clone the repository:

git clone https://github.com/UCB-stat-159-f25/final-group21.git
cd final-group21

Create and activate the environment:

conda env update -f environment.yml --name proj03 --prune
conda activate proj03

Install the IPython kernel:

python -m ipykernel install --user --name proj03 --display-name "IPython - proj03"

Usage

Create or update the environment:

make env

Run all notebooks in the project:

make all

Testing

Tests can be executed from the project root using:

PYTHONPATH=./ pytest

License

This project is licensed under the BSD 3-Clause License.

References
  1. Jakob Eriksen, keylajb, Keval, & Jimmy Butler. (2025). UCB-stat-159-f25/final-group21: Initial release. Zenodo. 10.5281/ZENODO.17971075
  2. Shepardson, D., & Ljunggren, D. (2025). US flight delays, cancellations accelerate as air traffic controller shortages surge. Reuters. https://www.reuters.com/sustainability/sustainable-finance-reporting/us-airlines-brace-third-day-government-mandated-flight-cuts-2025-11-09/
  3. United States Department of Transportation. (2015). U.S. Flight Delays and Performance. Kaggle. https://www.kaggle.com/datasets/usdot/flight-delays/