{"id":1160,"date":"2026-02-09T16:01:31","date_gmt":"2026-02-09T16:01:31","guid":{"rendered":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/"},"modified":"2026-02-09T16:01:31","modified_gmt":"2026-02-09T16:01:31","slug":"chatbots-make-terrible-doctors-new-study-finds","status":"publish","type":"post","link":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/","title":{"rendered":"Chatbots Make Terrible Doctors, New Study Finds"},"content":{"rendered":"<p>Chatbots may be able to pass medical exams, but that doesn\u2019t mean they make good doctors, according to a new, large-scale study of how people get medical advice from large language models.\u00a0The controlled study of 1,298 UK-based participants, published today in Nature from the Oxford Internet Institute and the Nuffield Department of Primary Care Health Sciences at the University of Oxford, tested whether LLMs could help people identify underlying conditions and suggest useful courses of action, like going to the hospital or seeking treatment. Participants were randomly assigned an LLM \u2014 GPT-4o, Llama 3, and Cohere\u2019s Command R+ \u2014 or were told to use a source of their choice to \u201cmake decisions about a medical scenario as though they had encountered it at home,\u201d according to the study. The scenarios included ailments like \u201ca young man developing a severe headache after a night out with friends for example, to a new mother feeling constantly out of breath and exhausted,\u201d the researchers said.\u201cOne user was told to lie down in a dark room, and the other user was given the correct recommendation to seek emergency care.\u201d\u00a0When the researchers tested the LLMs without involving users by providing the models with the full text of each clinical scenario, the models correctly identified conditions in 94.9 percent of cases. But when talking to the participants about those same conditions, the LLMs identified relevant conditions in fewer than 34.5 percent of cases. People didn\u2019t know what information the chatbots needed, and in some scenarios, the chatbots provided multiple diagnoses and courses of action. Knowing what questions to ask a patient and what information might be withheld or missing during an examination are nuanced skills that make great human physicians; based on this study, chatbots can\u2019t reliably replicate that kind of care.In some cases, the chatbots also generated information that was just wrong or incomplete, including focusing on elements of the participants\u2019 inputs that were irrelevant, giving a partial US phone number to call, or suggesting they call the Australian emergency number.\u00a0\u00a0\u201cIn an extreme case, two users sent very similar messages describing symptoms of a subarachnoid hemorrhage but were given opposite advice,\u201d the study\u2019s authors wrote. \u201cOne user was told to lie down in a dark room, and the other user was given the correct recommendation to seek emergency care.\u201d\u00a0\u201cThese findings highlight the difficulty of building AI systems that can genuinely support people in sensitive, high-stakes areas like health,\u201d Dr. Rebecca Payne, lead medical practitioner on the study, said in a press release. \u201cDespite all the hype, AI just isn&#8217;t ready to take on the role of the physician. Patients need to be aware that asking a large language model about their symptoms can be dangerous, giving wrong diagnoses and failing to recognise when urgent help is needed.\u201dInstagram\u2019s AI Chatbots Lie About Being Licensed TherapistsWhen pushed for credentials, Instagram\u2019s user-made AI Studio bots will make up license numbers, practices, and education to try to convince you it\u2019s qualified to help with your mental health.404 MediaSamantha ColeLast year, 404 Media reported on AI chatbots hosted by Meta that posed as therapists, providing users fake credentials like license numbers and educational backgrounds. Following that reporting, almost two dozen digital rights and consumer protection organizations sent a complaint to the Federal Trade Commission urging regulators to investigate Character.AI and Meta\u2019s \u201cunlicensed practice of medicine facilitated by their product,\u201d through therapy-themed bots that claim to have credentials and confidentiality \u201cwith inadequate controls and disclosures.\u201d A group of Democratic senators also urged Meta to investigate and limit the \u201cblatant deception\u201d of Meta\u2019s chatbots that lie about being licensed therapists, and 44 attorneys general signed an open letter to 11 chatbot and social media companies, urging them to see their products \u201cthrough the eyes of a parent, not a predator.\u201d\u00a0In January, OpenAI announced ChatGPT Health, \u201ca dedicated experience that securely brings your health information and ChatGPT\u2019s intelligence together, to help you feel more informed, prepared, and confident navigating your health,\u201d the company said in a blog post. \u201cOver two years, we\u2019ve worked with more than 260 physicians who have practiced in 60 countries and dozens of specialties to understand what makes an answer to a health question helpful or potentially harmful\u2014this group has now provided feedback on model outputs over 600,000 times across 30 areas of focus,\u201d the company wrote. \u201cThis collaboration has shaped not just what Health can do, but how it responds: how urgently to encourage follow-ups with a clinician, how to communicate clearly without oversimplifying, and how to prioritize safety in moments that matter\u2060.\u201d\u00a0\u201cIn our work, we found that none of the tested language models were ready for deployment in direct patient care. Despite strong performance from the LLMs alone, both on existing benchmarks and on our scenarios, medical expertise was insufficient for effective patient care,\u201d the researchers wrote in their paper. \u201cOur work can only provide a lower bound on performance: newer models, models that make use of advanced techniques from chain of thought to reasoning tokens, or fine-tuned specialized models, are likely to provide higher performance on medical benchmarks.\u201d The researchers recommend developers, policymakers, and regulators consider testing LLMs with real human users before deploying in the future.\u00a0<\/p>\n","protected":false},"excerpt":{"rendered":"<div>Chatbots provided incorrect, conflicting medical advice, researchers found: \u201cDespite all the hype, AI just isn&#8217;t ready to take on the role of the physician.\u201d<\/div>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"site-container-style":"default","site-container-layout":"default","site-sidebar-layout":"default","disable-article-header":"default","disable-site-header":"default","disable-site-footer":"default","disable-content-area-spacing":"default","footnotes":""},"categories":[4,1,489,603],"tags":[3],"class_list":["post-1160","post","type-post","status-publish","format-standard","hentry","category-ai","category-ai-and-ml","category-chatbots","category-medicine","tag-ai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Chatbots Make Terrible Doctors, New Study Finds - Imperative Business Ventures Limited<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Chatbots Make Terrible Doctors, New Study Finds - Imperative Business Ventures Limited\" \/>\n<meta property=\"og:description\" content=\"Chatbots provided incorrect, conflicting medical advice, researchers found: \u201cDespite all the hype, AI just isn&#039;t ready to take on the role of the physician.\u201d\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\" \/>\n<meta property=\"og:site_name\" content=\"Imperative Business Ventures Limited\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-09T16:01:31+00:00\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\"},\"author\":{\"name\":\"admin\",\"@id\":\"https:\/\/blog.ibvl.in\/#\/schema\/person\/55b87b72a56b1bbe9295fe5ef7a20b02\"},\"headline\":\"Chatbots Make Terrible Doctors, New Study Finds\",\"datePublished\":\"2026-02-09T16:01:31+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\"},\"wordCount\":880,\"keywords\":[\"AI\"],\"articleSection\":[\"AI\",\"AI and ML\",\"chatbots\",\"medicine\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\",\"url\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\",\"name\":\"Chatbots Make Terrible Doctors, New Study Finds - Imperative Business Ventures Limited\",\"isPartOf\":{\"@id\":\"https:\/\/blog.ibvl.in\/#website\"},\"datePublished\":\"2026-02-09T16:01:31+00:00\",\"author\":{\"@id\":\"https:\/\/blog.ibvl.in\/#\/schema\/person\/55b87b72a56b1bbe9295fe5ef7a20b02\"},\"breadcrumb\":{\"@id\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/blog.ibvl.in\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Chatbots Make Terrible Doctors, New Study Finds\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/blog.ibvl.in\/#website\",\"url\":\"https:\/\/blog.ibvl.in\/\",\"name\":\"Imperative Business Ventures Limited\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/blog.ibvl.in\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/blog.ibvl.in\/#\/schema\/person\/55b87b72a56b1bbe9295fe5ef7a20b02\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/blog.ibvl.in\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/4d20b2cd313e4417a599678e950e6fb7d4dfa178a72f2b769335a08aaa615aa9?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/4d20b2cd313e4417a599678e950e6fb7d4dfa178a72f2b769335a08aaa615aa9?s=96&d=mm&r=g\",\"caption\":\"admin\"},\"sameAs\":[\"https:\/\/blog.ibvl.in\"],\"url\":\"https:\/\/blog.ibvl.in\/index.php\/author\/admin_hcbs9yw6\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Chatbots Make Terrible Doctors, New Study Finds - Imperative Business Ventures Limited","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/","og_locale":"en_US","og_type":"article","og_title":"Chatbots Make Terrible Doctors, New Study Finds - Imperative Business Ventures Limited","og_description":"Chatbots provided incorrect, conflicting medical advice, researchers found: \u201cDespite all the hype, AI just isn't ready to take on the role of the physician.\u201d","og_url":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/","og_site_name":"Imperative Business Ventures Limited","article_published_time":"2026-02-09T16:01:31+00:00","author":"admin","twitter_card":"summary_large_image","twitter_misc":{"Written by":"admin","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/#article","isPartOf":{"@id":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/"},"author":{"name":"admin","@id":"https:\/\/blog.ibvl.in\/#\/schema\/person\/55b87b72a56b1bbe9295fe5ef7a20b02"},"headline":"Chatbots Make Terrible Doctors, New Study Finds","datePublished":"2026-02-09T16:01:31+00:00","mainEntityOfPage":{"@id":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/"},"wordCount":880,"keywords":["AI"],"articleSection":["AI","AI and ML","chatbots","medicine"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/","url":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/","name":"Chatbots Make Terrible Doctors, New Study Finds - Imperative Business Ventures Limited","isPartOf":{"@id":"https:\/\/blog.ibvl.in\/#website"},"datePublished":"2026-02-09T16:01:31+00:00","author":{"@id":"https:\/\/blog.ibvl.in\/#\/schema\/person\/55b87b72a56b1bbe9295fe5ef7a20b02"},"breadcrumb":{"@id":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/blog.ibvl.in\/index.php\/2026\/02\/09\/chatbots-make-terrible-doctors-new-study-finds\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/blog.ibvl.in\/"},{"@type":"ListItem","position":2,"name":"Chatbots Make Terrible Doctors, New Study Finds"}]},{"@type":"WebSite","@id":"https:\/\/blog.ibvl.in\/#website","url":"https:\/\/blog.ibvl.in\/","name":"Imperative Business Ventures Limited","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blog.ibvl.in\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/blog.ibvl.in\/#\/schema\/person\/55b87b72a56b1bbe9295fe5ef7a20b02","name":"admin","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/blog.ibvl.in\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/4d20b2cd313e4417a599678e950e6fb7d4dfa178a72f2b769335a08aaa615aa9?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/4d20b2cd313e4417a599678e950e6fb7d4dfa178a72f2b769335a08aaa615aa9?s=96&d=mm&r=g","caption":"admin"},"sameAs":["https:\/\/blog.ibvl.in"],"url":"https:\/\/blog.ibvl.in\/index.php\/author\/admin_hcbs9yw6\/"}]}},"_links":{"self":[{"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/posts\/1160","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/comments?post=1160"}],"version-history":[{"count":0,"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/posts\/1160\/revisions"}],"wp:attachment":[{"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/media?parent=1160"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/categories?post=1160"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.ibvl.in\/index.php\/wp-json\/wp\/v2\/tags?post=1160"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}