5/3/2025, 3:52 PM: ELO Sorting My Baseball PC
After a full morning elbow-deep in rankings, matchups, and tuning logic, I’ve landed on a system that’s finally doing what I need: making sense of my personal card collection using a dynamic, living rating model.
Why ELO?
I’m not trying to create some universal “value” system. This isn’t about market comps or PSA 10s. This is about how I feel about my cards—relative to each other. Sentimental weight, aesthetics, print run quirks, storylines—all of it can be surfaced if the system allows for fluid context. That’s what ELO does better than tier lists or 1–10 scoring. It gives each card the chance to rise, fall, surprise, or age like wine—based on real head-to-head outcomes.
The Current State
We now have 32 cards, each with its own rating, match count, and an assigned tier (S through D). Elo is calculated via a custom Google Sheets script, using a standard kFactor = 32, and ratings update automatically when matchups are marked "done".
The best cards right now:
Aaron Judge T210, Lindor Blue Atomic, and Nimmo Snapshots Auto — all sitting solidly in S-Tier with 10–15 matchups under their belts.
At the bottom:
Tony Gwynn Chrome, Vladdy Jr. Bowman, and J.P. Morgan Heritage have sunk to the D-Tier after repeated trials.
And in the middle? That’s where the most interesting movement happens. A-Tier and B-Tier are volatile, still tuning themselves.
Process Design
After some experimentation, I’ve adopted a two-phase pipeline for evaluating cards:
1. Stagger Phase (New Card Intake):
When I add new cards:
They all start at 1000.
Each gets 3 initial matchups: the lowest A/S-tier, the first B-tier above 1000, and the highest D-tier.
This gives me a fast read without exhausting the whole list.
2. Tuning Phase (Pressure Testing):
Once those cards have ~3 matchups, I move them into tuning:
1 matchup vs a higher card in the same tier
1 matchup vs someone ±1 tier
1 matchup vs their tier floor
This gets them to 5–6 total matchups, which is the sweet spot for Elo to settle. At that point, I trust where they are and can shift focus to others.
Why This Matters (To Me)
This isn’t just busywork. It’s how I feel the collection. I want to see if the cards I love really hold up against the cards I think I love. That requires structure—but not rigidity.
ID | Card | Rating | Matches | Rank | Tier |
---|---|---|---|---|---|
13 | 2024 Aaron Judge - T210 | 1152 | 10 | 1 | S-Tier |
21 | 2022 Francisco Lindor - Topps Chrome - Blue Atomic - #95/100 | 1131 | 11 | 2 | S-Tier |
16 | 2019 Brandon Nimmo - Archives Snapshots - Auto , #03/50 | 1126 | 15 | 3 | S-Tier |
14 | 1978 Lenny Randle - Boxtop - Auto | 1125 | 15 | 4 | S-Tier |
6 | 2013 Carson Kelly - Bowman - Auto | 1105 | 17 | 5 | S-Tier |
25 | 2024 R.A. Dickey - T206 - Auto | 1095 | 6 | 6 | A-Tier |
20 | 2023 Pedro Martinez - 1988 Chrome Silver Pack - #67/99 | 1073 | 11 | 7 | A-Tier |
32 | 2011 Francisco Lindor - Bowman Chrome | 1061 | 6 | 8 | A-Tier |
17 | 2016 Aroldis Chapman - Gold Label | 1030 | 18 | 9 | B-Tier |