Project Overview
Using R as a tool and Trackman CSVs as my data, I was able to craft a report that analyzes the results of all NCAA pitches with similar shape and metrics to a target pitcher's pitches, providing context for how their stuff performs across college baseball.
Project Description
This report was created with the goal of evaluating a pitcher and how his stuff performs across NCAA baseball. Rather than simply looking at how a pitcher's own pitches perform, this analysis provides crucial context by comparing results to all similar pitches thrown throughout college baseball.
Why Pitch Similarity Matters
Understanding how similar pitches perform across NCAA baseball provides invaluable context for evaluating a pitcher's effectiveness. If a pitcher throws a slider with certain characteristics, this report shows how all sliders with those same characteristics have performed against college hitters. This helps answer critical questions: Is the pitch underperforming relative to similar offerings? Are the results in line with expectations based on the pitch's shape and velocity?
This comparative approach helps identify whether poor results are due to pitch execution, sequencing, or simply the inherent quality of the pitch itself. It also highlights when a pitch is overperforming, which may indicate excellent command or deception even if the raw metrics don't look elite.
Similarity Criteria
The report includes a table for each pitch type a pitcher throws, split by batter handedness. Pitches are considered "similar" if they fall within these ranges:
Performance Metrics Analyzed
NCAA-Benchmarked Color Coding
For better interpretability, statistics where I felt it would be useful for them to be compared to league averages are automatically color coded on a red to green scale where green is good for the pitcher. The coding is based on the rolling NCAA average and standard deviation for each statistic.
The color scales are different for each pitch type because, for example, league average whiff percentage for a fastball is lower than league average whiff percentage for a slider. Therefore, each pitch type is color coded based on its custom scale, providing pitch-type-specific context.
Strategic Applications
This report is invaluable for pitcher development and evaluation. It helps coaches and pitchers understand whether results are meeting expectations based on pitch characteristics. If a slider with good shape and velocity is underperforming compared to similar sliders across NCAA baseball, it may indicate issues with command, sequencing, or tunneling. Conversely, if results are exceeding the performance of similar pitches, it validates the pitcher's execution and deception.
Example Output: Iowa 2025 Season
The attached report includes all of the data from the 2025 Iowa baseball season, providing comprehensive similarity analysis for the entire pitching staff. The code can be used to analyze any Trackman data, making it adaptable for any team or league.