The Numismatic Bibliomania Society

PREV ARTICLE       NEXT ARTICLE       FULL ISSUE       PREV FULL ISSUE      

V26 2023 INDEX       E-SYLUM ARCHIVE

The E-Sylum: Volume 26, Number 52, December 24, 2023, Article 12

UPDATE: AI-POWERED APP NUMI

Justin Hinh submitted this update on his AI-powered coin grading and identification app. Thank you! -Editor

Numi logo I wanted to share an update on Numi with you and E-Sylum readers.

I revamped my AI grading and identification app Numi to v1.33 to use the official American Numismatic Association's grading standards. My goal was to transform Numi to focus solely on technical grading, as technical grading is much more objective as compared to market grading.

Attached are some results for coins ranging from G-4 to PR-70.

As you can see, the results were quite off. With Numi getting increasingly accurate as the coin moved to a better condition.

Numi struggles quite a bit with recognizing wear on a coin. Interestingly enough, this was the exact issue Compugrade experienced back in the 90's. Even though their algorithms were completely different than today's AI Large Language Models.

  Numi testing results G-4 Numi testing results VG-6
  Numi testing results F-12 Numi testing results F-15
  Numi testing results VF-20 Numi testing results XF-40
  Numi testing results AU-55 Numi testing results MS-63
  Numi testing results MS-65 Numi testing results MS-66
  Numi testing results PR-69 Numi testing results PR-70

Given Numi's failure to achieve consistent results, I am putting the project on ice until the next OpenAI GPT model is released. I'm also interested in testing Numi using Google's Gemini Ultra model once that releases sometime next year. Once they do, I will revisit these tests and see how much Numi has improved.

Observations & Lessons Learned

  • Like what many online are reporting, GPT-4 has substantially degraded in performance in the past few months. I found myself constantly arguing with the AI to get it to follow most of my custom instructions.
  • The model can follow custom instructions well for the first 8-10k tokens, but perception and logic fall off a cliff after that. Even with highly tailored instructions with clear steps to follow, Numi would veer off course when it had to process too much info. This was incredibly frustrating as GPT-4 Turbo claimed to have a token limit of 128k [i.e. ability to remember up to nearly 25k words].
  • The vision model needs a major boost. The Optical Character Reader was great at picking up texts on coins. I rarely had it misread the text. But often it would fail to pick up the mint mark on more worn coins. The vision model could generally tell when a coin was more worn, but it failed to apply a correct grade.
  • Numi was excellent at identifying non-ancient coins. And was quite often great at identifying tokens. But it struggled immensely with Ancient coins. The only ancient coins it could identify were coins in very high condition.

Overall Numi was a fun and exciting project. I got to apply my AI knowledge to my hobby and I'm excited to see how future models perform. AI is a force to be reckoned with and it's going to be an invaluable resource for researchers and those trying to learn more about their coins. I left this project feeling that AI technical grading would eventually work. But I now have my doubts that it can overtake market grading in a manner that would be widely adopted.

Thank you for the thorough evaluation report. I, too, remain optimistic for AI's future contributions to numismatics. We usually stay out of grading and pricing discussions at The E-Sylum, but appreciate the progress shown here with identifying tokens and non-ancient coins. I think text-rich paper money will also lend itself well to machine identification. Machines will continue to augment and improve human efforts, but many (if not most) tasks will remain best performed by humans. -Editor

To read the earlier E-Sylum articles, see:
NUMI: AI-POWERED COIN IDENTIFICATION APP (https://www.coinbooks.org/v26/esylum_v26n47a10.html)
NUMI: HOW THE AI SAUSAGE IS MADE (https://www.coinbooks.org/v26/esylum_v26n47a11.html)

To read about the Brazilian Councilman writing laws with ChatGPT, see:
The City That's Trying to Replace Politicians With Computers (It's Working) (https://www.wsj.com/tech/politican-ramiro-rosario-artificial-intelligence-brazil-82ca338d)

  CNG E-Sylum Ad 2023-12-10 Triton XXVII



Wayne Homren, Editor

Google
 
NBS (coinbooks.org) Web

The Numismatic Bibliomania Society is a non-profit organization promoting numismatic literature. See our web site at coinbooks.org.

To submit items for publication in The E-Sylum, write to the Editor at this address: whomren@gmail.com

To subscribe go to: https://my.binhost.com/lists/listinfo/esylum

PREV ARTICLE       NEXT ARTICLE       FULL ISSUE       PREV FULL ISSUE      

V26 2023 INDEX       E-SYLUM ARCHIVE

Copyright © 1998 - 2023 The Numismatic Bibliomania Society (NBS)
All Rights Reserved.

NBS Home Page
Contact the NBS webmaster
coin