Performance of a Small Language Model Versus a Large Language Model in Answering Glaucoma Frequently Asked Patient Questions: Development and Usability Study

Adriano Cypriano Faneli; Rafael Scherer; Rohit Muralidhar; Marcus Guerreiro-Filho; Luiz Beniz; Verônica Vilasboas-Campos; Douglas Costa; Alessandro A Jammal; Felipe A Medeiros

doi:10.2196/72101

Back

Performance of a Small Language Model Versus a Large Language Model in Answering Glaucoma Frequently Asked Patient Questions: Development and Usability Study

Journal article

Open access

Peer reviewed

Performance of a Small Language Model Versus a Large Language Model in Answering Glaucoma Frequently Asked Patient Questions: Development and Usability Study

Adriano Cypriano Faneli, Rafael Scherer, Rohit Muralidhar, Marcus Guerreiro-Filho, Luiz Beniz, Verônica Vilasboas-Campos, Douglas Costa, Alessandro A Jammal and Felipe A Medeiros

JMIR AI, Vol.5, pp.e72101-e72101

2026-01-06

DOI: https://doi.org/10.2196/72101

PMID: 41493946

Appears in Miller School of Medicine - Latest Publications

Abstract

online health information

ChatGPT4.0

small language model

large language model

glaucoma

Large language models (LLMs) have been shown to answer patient questions in ophthalmology similar to human experts. However, concerns remain regarding their use, particularly related to patient privacy and potential inaccuracies that could compromise patient safety. This study aimed to compare the performance of an LLM in answering frequently asked patient questions about glaucoma with that of a small language model (SLM) trained locally on ophthalmology-specific literature. We compiled 35 frequently asked questions on glaucoma, categorized into 6 domains, including pathogenesis, risk factors, clinical manifestations, diagnosis, treatment and prevention, and prognosis. Each question was posed to both a SLM using a retrieval-augmented generation framework, trained on ophthalmology-specific literature, and to a LLM (ChatGPT 4.0, OpenAI). Three glaucoma specialists from a single institution independently assessed the answers using a 3-tier accuracy rating scale: poor (score=1), borderline (score=2), and good (score=3). Each answer received a quality score ranging from 3 to 9 points based on the sum of ratings from the 3 graders. Readability grade level was assessed using 4 formulas, such as the Flesch-Kincaid Level, the Gunning Fog Index, the Coleman-Liau Index, and the Simple Measure of Gobbledygook Index. The answers from the SLM demonstrated comparable quality with ChatGPT 4.0, scoring mean 7.9 (SD 1.2) and mean 7.4 (SD 1.5), respectively, out of a total of 9 points (P=.13). The accuracy rating was consistent overall and across all 6 glaucoma care domains. Both models provided answers considered unsuitable for health care-related information, as they were difficult for the average layperson to read. Both models generated accurate content, but the answers were considered challenging for the average layperson to understand, making them unsuitable for health care-related information. Given the specialized SLM's comparable performance to the LLM, its high customization potential, lower cost, and ability to operate locally, it presents a viable option for deploying natural language processing in real-world ophthalmology clinical settings.

Files and links (1)

url

https://doi.org/10.2196/72101View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Performance of a Small Language Model Versus a Large Language Model in Answering Glaucoma Frequently Asked Patient Questions: Development and Usability Study
Creators: Adriano Cypriano Faneli - Universidade Federal de São Paulo
Rafael Scherer - University of Miami
Rohit Muralidhar - University of Miami
Marcus Guerreiro-Filho - Bascom Palmer Eye Institute, University of Miami, 900 NW 17th St, Miami, FL, 33136, United States, 1 305-326-6000
Luiz Beniz - Bascom Palmer Eye Institute, University of Miami, 900 NW 17th St, Miami, FL, 33136, United States, 1 305-326-6000
Verônica Vilasboas-Campos - Bascom Palmer Eye Institute, University of Miami, 900 NW 17th St, Miami, FL, 33136, United States, 1 305-326-6000
Douglas Costa - Bascom Palmer Eye Institute, University of Miami, 900 NW 17th St, Miami, FL, 33136, United States, 1 305-326-6000
Alessandro A Jammal - University of Miami
Felipe A Medeiros - University of Miami
Publication Details: JMIR AI, Vol.5, pp.e72101-e72101
Publisher: JMIR PUBLICATIONS, INC; TORONTO
Number of pages: 10
Academic Unit: UMMG Department of Ophthalmology; Miller School of Medicine
Language: English
Resource Type: Journal article
PMID: 41493946
Record Identifier: 991032937902902976