Discussion about this post

User's avatar
Niels's avatar

Thank you for this detailed and interesting article. As a sceptic, I have to chip in a possible, much less exciting alternative: To my untrained eye (maybe I imagine this now after the explanation though) it looks like there is slightly more kerning between 4 and 5 than between 1 and 4. I would expect the next generation of Gemini models to improve on visual accuracy, so it might be among the few models to be able to pick this subtlety up. However, if it picks it up, it would directly read "14 5" and require much less reasoning to decide for the 14lb 5oz interpretation. Either way, this is remarkable!

Expand full comment
Mark Humphries's avatar

I totally get the concern but I don't think that is happening here and would say it is not the same type of problem either. This is a bit different than claiming that the model is solving a novel math problem for which a solution exists. Any historian or person with knowledge of 18th century accounting would quickly reach the same conclusion, it's not difficult to figure out...but it does require a significant amount of contextual knowledge and the ability to reason across several layers of abstraction. Easy for humans with some training, hard for LLMs. But not the same as solving a novel math problem.

That said, all the requisite knowledge and similar examples would be in the training data from other sources, but I am certain that this particular ledger is not online because I took the photographs in the archives myself from the paper originals precisely because it had not been digitized. Nor have I ever found a reference to it in print or online. It is a very obscure document. If you are interested it is from the Albany Institute for Art and History in Albany, New York. It is officially catalogued as the Harmen van Heusen Daybook but (to add a layer of complexity to this), it was actually mis-catalogued and is really the Shipboy & Henry Daybook. These are all extremely obscure merchants from Albany in the 18th century and almost nothing is written about any of them. I am interested in them for some esoteric historical reasons and simply uploaded this image at random as it was at hand.

Expand full comment
33 more comments...

No posts