The Numismatic Bibliomania Society

PREV ARTICLE       NEXT ARTICLE       FULL ISSUE       PREV FULL ISSUE      

V27 2024 INDEX       E-SYLUM ARCHIVE

The E-Sylum: Volume 27, Number 16, April 21, 2024, Article 9

REVISITING NUMI

Justin Hinh created an app using OpenAI's ChatGPT4 platform to provide coin identification and grading estimates. He submitted this update, which examines improvements in its grading performance. Thanks - very interesting. -Editor

  Revisiting Numi: Testing The Latest GPT-4 Update

Numi logo As a coin collector and technology nerd, in late 2023 I developed Numi, an AI-powered chatbot that leverages the advanced capabilities of OpenAI's GPT-4 vision model to assist coin collectors in identifying and grading their coins. It's been fascinating seeing the exponential growth of Artificial Intelligence, so I created Numi to test AI's abilities to tackle one of the biggest barriers to new collectors in coin collecting. Throughout Numi's development, I became more and more convinced that AI is going to fundamentally change the future of the hobby.

Testing Numi With OpenAI's Latest GPT-4 Update

I had a hypothesis that more data given to the AI would yield more accurate grading results. In December 2023 I ran a series of tests on each grade using 2 to 10 photos per coin. Following OpenAI's recent April 2024 update to their GPT-4 model, which powers Numi's AI capabilities, I conducted another series of tests on Numi's grading accuracy.

I then ran statistical analyses to assess the impact on Numi's performance and compared its grading accuracy between the December 2023 and April 2024 test results.

  Numi Full Test Graph

Determining the Optimal # of Photos for Accurate Grading

A key aspect of my analysis focused on identifying the optimal number of coin photos users should upload to achieve the most accurate grading results. In December 2023, my tests indicated that uploading 10 photos yielded the best accuracy across all coin grades. This aligned with my hypothesis that more data = better. However, after the GPT-4 update in April 2024, that number had changed, with just 4 photos now providing the most precise grading outcomes.

  Numi 4 & 10 Test Results

Just How Much Did Numi Improve?

To measure Numi's accuracy and any improvements, I calculated the Mean Absolute Deviation (MAD) – a metric that represents the average deviation between Numi's predicted grades and the actual, expert-assigned grades. In December 2023, Numi's MAD was 5.39, indicating that, on average, its predictions deviated by approximately 5 points from the actual coin's grade (for example, coin grade is AU-55 but Numi predicted MS-60). By April 2024, following the GPT-4 update, Numi's MAD score decreased to 3.64, representing a substantial 32.47% increase in overall accuracy.

I suspected that Numi would be more accurate given the updates, but I was not expecting this much of a change. While the GPT-4 vision model still struggles immensely with medium-graded coins [Around XF-40], Numi performed exceptionally well for very low and very high-graded coins, with the biggest improvements seen for very low-graded coins.

The Future of AI in Numismatics

After seeing these results, I am even more convinced that Artificial Intelligence will revolutionize the field of coin collecting. As models like GPT-4 continue to improve, AI tools will become increasingly valuable for collectors seeking to expand their knowledge and make informed decisions about their collections. While Numi itself will most likely not end up being the go-to tool for collectors in the future, it serves as powerful evidence of where the hobby is heading.

The progress Numi has made in a short time is encouraging, and I look forward to testing its capabilities as AI models advance. By making coin grading more accessible and user-friendly, AI has the potential to attract new enthusiasts to the hobby and help experienced collectors deepen their understanding and appreciation for numismatics.

To read the earlier E-Sylum articles, see:
NUMI: AI-POWERED COIN IDENTIFICATION APP (https://www.coinbooks.org/v26/esylum_v26n47a10.html)
https://www.coinbooks.org/v26/esylum_v26n47a11.html (https://www.coinbooks.org/v26/esylum_v26n47a11.html)
UPDATE: AI-POWERED APP NUMI (https://www.coinbooks.org/v26/esylum_v26n52a12.html)
NOTES FROM E-SYLUM READERS: NOVEMBER 26, 2023 : Numi on the Coin World Podcast (https://www.coinbooks.org/v26/esylum_v26n48a15.html)

THE BOOK BAZARRE

OVER 500 NUMISMATIC TITLES: Wizard Coin Supply has over 500 numismatic titles in stock, competitively discounted, and available for immediate shipment. See our selection at www.WizardCoinSupply.com.

Charles Davis ad02



Wayne Homren, Editor

Google
 
NBS (coinbooks.org) Web

The Numismatic Bibliomania Society is a non-profit organization promoting numismatic literature. See our web site at coinbooks.org.

To submit items for publication in The E-Sylum, write to the Editor at this address: whomren@gmail.com

To subscribe go to: https://my.binhost.com/lists/listinfo/esylum

PREV ARTICLE       NEXT ARTICLE       FULL ISSUE       PREV FULL ISSUE      

V27 2024 INDEX       E-SYLUM ARCHIVE

Copyright © 1998 - 2023 The Numismatic Bibliomania Society (NBS)
All Rights Reserved.

NBS Home Page
Contact the NBS webmaster
coin