ChatGPT-o1 and the Pitfalls of Familiar Reasoning in Medical Ethics

Soffer, S.; Sorin, V.; Nadkarni, G.; Klang, E.

2024-09-27 medical ethics

10.1101/2024.09.25.24314342 medRxiv

Show abstract

Large language models (LLMs) like ChatGPT often exhibit Type 1 thinking--fast, intuitive reasoning that relies on familiar patterns--which can be dangerously simplistic in complex medical or ethical scenarios requiring more deliberate analysis. In our recent explorations, we observed that LLMs frequently default to well-known answers, failing to recognize nuances or twists in presented situations. For instance, when faced with modified versions of the classic "Surgeons Dilemma" or medical ethics cases where typical dilemmas were resolved, LLMs still reverted to standard responses, overlooking critical details. Even models designed for enhanced analytical reasoning, such as ChatGPT-o1, did not consistently overcome these limitations. This suggests that despite advancements toward fostering Type 2 thinking, LLMs remain heavily influenced by familiar patterns ingrained during training. As LLMs are increasingly integrated into clinical practice, it is crucial to acknowledge and address these shortcomings to ensure reliable and contextually appropriate AI assistance in medical decision-making.

ChatGPT-o1 and the Pitfalls of Familiar Reasoning in Medical Ethics

Matching journals