Artificial intelligence in reproductive endocrinology: an in-depth longitudinal analysis of ChatGPTv4’s month-by-month interpretation and adherence to clinical guidelines for diminished ovarian reserve

dc.authorscopusidAsena Ayar Madenli / 57761671000
dc.authorwosidAsena Ayar Madenli / IXW-7013-2023
dc.contributor.authorGürbüz, Tuğba
dc.contributor.authorGökmen, Oya
dc.contributor.authorDevranoğlu, Belgin
dc.contributor.authorMadenli, Asena Ayar
dc.date.accessioned2025-06-03T13:30:42Z
dc.date.available2025-06-03T13:30:42Z
dc.date.issued2024
dc.departmentİstinye Üniversitesi, Tıp Fakültesi, Cerrahi Tıp Bilimleri Bölümü
dc.description.abstractObjective: To quantitatively assess the performance of ChatGPTv4, an Artificial Intelligence Language Model, in adhering to clinical guidelines for Diminished Ovarian Reserve (DOR) over two months, evaluating the model’s consistency in providing guideline-based responses. Design: A longitudinal study design was employed to evaluate ChatGPTv4’s response accuracy and completeness using a structured questionnaire at baseline and at a two-month follow-up. Setting: ChatGPTv4 was tasked with interpreting DOR questionnaires based on standardized clinical guidelines. Participants: The study did not involve human participants; the questionnaire was exclusively administered to the ChatGPT model to generate responses about DOR. Methods: A guideline-based questionnaire with 176 open-ended, 166 multiple-choice, and 153 true/false questions were deployed to rigorously assess ChatGPTv4’s ability to provide accurate medical advice aligned with current DOR clinical guidelines. AI-generated responses were rated on a 6-point Likert scale for accuracy and a 3-point scale for completeness. The two-phase design assessed the stability and consistency of AI-generated answers over two months. Results: ChatGPTv4 achieved near-perfect scores across all question types, with true/false questions consistently answered with 100% accuracy. In multiple-choice queries, accuracy improved from 98.2 to 100% at the two-month follow-up. Open-ended question responses exhibited significant positive enhancements, with accuracy scores increasing from an average of 5.38 ± 0.71 to 5.74 ± 0.51 (max: 6.0) and completeness scores from 2.57 ± 0.52 to 2.85 ± 0.36 (max: 3.0). It underscored the improvements as significant (p < 0.001), with positive correlations between initial and follow-up accuracy (r = 0.597) and completeness (r = 0.381) scores. Limitations: The study was limited by the reliance on a controlled, albeit simulated, setting that may not perfectly mirror real-world clinical interactions. Conclusion: ChatGPTv4 demonstrated exceptional and improving accuracy and completeness in handling DOR-related guideline queries over the studied period. These findings highlight ChatGPTv4’s potential as a reliable, adaptable AI tool in reproductive endocrinology, capable of augmenting clinical decision-making and guideline development. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
dc.identifier.citationGurbuz, T., Gokmen, O., Devranoglu, B., Yurci, A., & Madenli, A. A. (2024). Artificial intelligence in reproductive endocrinology: an in-depth longitudinal analysis of ChatGPTv4’s month-by-month interpretation and adherence to clinical guidelines for diminished ovarian reserve. Endocrine, 86(3), 1171-1177.
dc.identifier.doi10.1007/s12020-024-04031-8
dc.identifier.endpage1177
dc.identifier.issn1355008X
dc.identifier.issue3
dc.identifier.pmid39341951
dc.identifier.scopusqualityQ2
dc.identifier.startpage1171
dc.identifier.urihttp://dx.doi.org/10.1007/s12020-024-04031-8
dc.identifier.urihttps://hdl.handle.net/20.500.12713/7283
dc.identifier.volume86
dc.identifier.wosWOS:001324503300001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakScopus
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakPubMed
dc.institutionauthorAyar Madenli, Asena
dc.institutionauthoridAsena Ayar Madenli / 0000-0003-0129-8710
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofEndocrine
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectArtificial Intelligence
dc.subjectChatGPTv4
dc.subjectDiminished Ovarian Reserve
dc.subjectReproductive Endocrinology
dc.titleArtificial intelligence in reproductive endocrinology: an in-depth longitudinal analysis of ChatGPTv4’s month-by-month interpretation and adherence to clinical guidelines for diminished ovarian reserve
dc.typeArticle

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
Artificial-intelligence-in-reproductive-endocrinology-an-indepth-longitudinal-analysis-of-ChatGPTv4s-monthbymonth-interpretation-and-adherence-to-clinical-guidelines-for-diminished-ovarian-reserveEndocrine.pdf
Boyut:
993.07 KB
Biçim:
Adobe Portable Document Format
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.17 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: