My main task of the offseason is now completed.  I have downloaded the Retrosheet database: play-by-play (to the extent known) of every MLB play for the last 100 years, with a smattering of games going back to 1901.  Retrosheet made a big change this year, producing a parallel set of files that should make a lot of queries easier.  I made the switch, and there’s going to be a learning curve here, but I am generally excited about the new file structure, not that anybody cares.

One of the things that I love about the Retrosheet database is the telegraphic “event” field.  This is a brief code that describes the play but that gives lots of information.  It always reminds me of the young Ronald Reagan (pictured above), who had a job doing play-by-play for Cubs baseball for WHO in Des Moines, Iowa. He didn’t attend these games.  He would get a description of each play and then had to create the excitement of the play itself as if he was there.

Those play descriptions came in Morse code, but you can learn pretty quickly to decipher Retrosheet code.  For example, the most famous play in Atlanta Braves history is coded:

 S7/L7S.3-H(UR);2-H(UR);1-2.

Translated, that’s a line single to short left field in which the man on third came home on an unearned run, as did the guy on second.  Finally, the guy on first advanced to second.  Would you rather have seen the play live or on television?  Of course.  But with the right enthusiasm, you can imagine that Skip Caray’s call would have been almost identical.  And Chip could read ahead and never be fooled by the force of the swing or anticipation of a moment that never happened.

So the event database now consists of 15,219,683 separate events.  Lots of them are very short: “K” for example.  As I downloaded the database, I got the idea of finding the longest play description.  Here it is:

E8/F89XD.3-H(NR)(UR);2-H(NR)(UR);1-H(NR)(UR);B-H(E4/TH)(NR)(UR)

This play is an error on the center fielder on a fly ball to deep right center in which all three base runners scored (no rbi or earned run for any of them) and the batter also came home on a throwing error by the second baseman.  I think that description is actually more interesting than watching the play.  It was in September 2021 and the victimized pitcher was none other than Chris Sale.

I don’t use this field all that often in my “research,” because there are usually easier ways to get at the same information in other parts of the data, but it is my go-to resource for unusual scoring plays, and I just wanted to let everyone know that I’m ready for your questions, as long as they don’t involve 2025 play.  That database won’t be out until December.