AI scribes and LLMs’ potential impact on cardiology just became a lot clearer after a Stanford RCT published in Nature Medicine demonstrated that Google’s AMIE system helped general cardiologists better diagnose and plan treatment for rare heart conditions.
- The WHO predicts an 18M healthcare provider deficit by 2030, with the shortage hitting hardest in subspecialties like those across cardiology.
- Large language models might help alleviate this shortage, but rigorous LLM RCTs are rare across medical specialties with zero RCTs evaluating LLMs in cardiology until now.
- Among the healthcare LLMs is Google’s AMIE, which focuses on gathering conversational information and interpreting medical scans like lab tests and ECGs.
Researchers enrolled nine general cardiologists to evaluate 107 patients with suspected genetic cardiovascular disease. Each case was then reviewed by two cardiologists (one with access to AMIE and one without) and three subspecialists later confirmed the diagnoses.
- AMIE-assisted assessments were more often supported by the subspecialists (46.7% vs 32.7%).
- As were the resulting management plans (45.8% vs 29.9%),
- And diagnostic recommendations (43.9% vs 30.8%).
When it came to catching mistakes…
- Clinical errors occurred in 13% of AI-assisted versus 24% of unassisted assessments.
- Meanwhile general cardiologists reported AMIE improved their assessments in 57% of cases, increased confidence in 52%, and saved time in 50%.
But AMIE also made some mistakes of its own, with hallucinations occurring in 6.5% of cases.
- The hallucinations were mild (such as assuming patient gender or fabricating minor imaging findings), and they often self-correcting when challenged by cardiologists.
What is perhaps most remarkable about the study is that AMIE adapted to this level of subspecialist cardiology with considerable data efficiency, requiring feedback from experts on just nine cases.
Taking it all into consideration in their discussion, the study’s authors suggest that AMIE is still better at improving treatment planning than initial diagnostic reasoning, with that part still best left to expert physicians.
The Takeaway
This study is notable in the cardiology community as the first RCT to truly evaluate how an LLM can support general cardiologists with handling complex CV conditions that usually belong to subspecialists. The results suggest it’s a strong starting point that could help improve treatment planning and management, while the verdict is still out on diagnostics.





