Was this helpful?
Thumbs UP Thumbs Down

Microsoft claims new AI diagnoses 4X more accurately than doctors

medical workers in laboratory
Close up view of the hand holding smartphone with Microsoft AI logo.

What is MAI‑DxO?

MAI‑DxO stands for Microsoft AI Diagnostic Orchestrator. It’s a new AI system designed to solve complex medical cases by mimicking how teams of expert doctors collaborate. Instead of relying on one AI model, it combines the strengths of several top models to simulate group decision-making.

The system walks through symptoms, recommends tests, and debates possible diagnoses step by step. It’s built to function like a virtual panel of doctors working together to deliver more accurate medical conclusions.

Microsoft used real case studies

Microsoft used 304 diagnostic case studies from the New England Journal of Medicine to test the AI. These cases are known for their difficulty and complexity. Each requires thinking like a real doctor, using available symptoms, test results, and patient history.

The AI had to act like it was treating a real person, requesting additional tests when needed, and narrowing down possibilities over time. This method simulated a realistic diagnostic workflow rather than just simple question-answering.

Man interacted with artificial intelligence.

AI outperformed experienced doctors

Microsoft’s system achieved 85.5 percent accuracy across those 304 complex cases. In contrast, a panel of 21 experienced doctors scored closer to 20 percent. That’s where the claim comes from: AI is four times more accurate than physicians.

The comparison wasn’t about speed or efficiency but diagnostic accuracy under identical conditions. All participants had access to the duplicate case files, and the AI showed stronger consistency when narrowing down the correct diagnosis.

friendly colleagues smiling while looking at the dna model

AI requested fewer and smarter tests

Aside from accuracy, MAI‑DxO also showed more intelligent decision-making when ordering medical tests. Instead of requesting every possible scan or lab result, the AI picked fewer but more relevant tests.

In doing so, it reduced the average cost of diagnosis by around 20 percent compared to the panel of doctors. This is a big deal in healthcare, where overtesting can lead to financial and medical complications. The AI showed strong judgment in balancing thoroughness with efficiency.

AI agent

It uses debate-style reasoning

MAI‑DxO is built on a unique process called chain-of-debate reasoning. It doesn’t just produce one answer. Multiple AI agents are assigned different roles, like proposing a diagnosis or challenging a decision. They interact, question each other, and refine their thinking.

This approach simulates how specialists might work through a case, each bringing a different perspective. It helps the AI avoid rushed answers and produce a more thoughtful diagnosis, which is key for handling complex medical issues.

Open AI logo displayed on a phone

It works with multiple AI models

The orchestrator isn’t tied to a single model. Instead, it can coordinate and guide large language models like OpenAI, Google Gemini, Claude, etc. This flexibility lets it play to each model’s strengths.

For example, one model might be good at gathering patient history, while another might shine in proposing likely diagnoses. OpenAI’s model performed the strongest in testing, but even weaker models improved significantly when used under MAI‑DxO’s guidance and reasoning structure.

Doctor using smartphone

Doctors had limited tools during testing

Doctors were prohibited from using outside resources like textbooks, internet searches, or team consultations in the testing process. This setup made the comparison more focused but also somewhat unrealistic.

In real-world settings, doctors rely on references and peer input. So, while the AI’s accuracy is impressive, the study does not reflect how doctors perform with full access to tools. Still, the results show that the AI can compete without real-time help or prompts.

Portrait of a female doctor holding a patient clipboard to

Not tested in live clinical settings

The system has not yet been used with patients in hospitals or clinics. All testing was done using case files, which limits how much we can conclude about its real-world safety. Diagnosing people involves more than data; it includes physical exams, emotional awareness, and patient feedback.

Microsoft acknowledged this and said the next step is full clinical trials. Until that happens, the technology remains a promising research tool rather than a licensed medical assistant.

A man is with a female doctor having a professional medical consultation

AI still lacks human empathy

While AI is excellent at reasoning and test selection, it does not replace the human side of medicine. Doctors build relationships, explain diagnoses, and comfort patients, none of which the AI can currently do.

There’s also the question of ethical decision-making in cases involving quality of life or personal values. Experts stress that AI should serve as a second opinion or support tool, not a standalone replacement for physicians. Emotional intelligence is still out of reach for AI systems.

medical workers in laboratory

Doctors could still lead treatment decisions

Even with advanced AI tools like MAI‑DxO, treatment planning involves much more than identifying a disease. Doctors often weigh risks, patient preferences, and insurance realities.

AI may handle the diagnosis, but only a human can decide whether a patient is ready for a specific treatment, wants to try an alternative, or needs additional counseling. If used responsibly, AI could save time and reduce errors, but doctors will still make judgment calls that go beyond raw facts.

A man's hand holding a smartphone with Bing AI

Microsoft sees future integration in tools

Microsoft hinted that MAI‑DxO might eventually be embedded into platforms like Bing and Copilot. These services already handle millions of health-related queries. Adding a diagnostic reasoning tool could offer more accurate responses to users trying to understand symptoms.

This could be especially helpful in areas with limited access to medical care. However, MAI‑DxO remains a research project, not a consumer product. Any future launch would need proper testing and regulatory approval first.

Could AI help address healthcare shortages

In parts of the world where there are not enough doctors or specialists, AI tools like MAI‑DxO could help fill the gap. This technology might improve early detection and reduce misdiagnosis by offering expert-level diagnostics, even in basic clinics or remote areas.

This only works if the tools are accessible, affordable, and adapted to local medical systems. Without infrastructure and follow-up care, accurate diagnoses alone won’t lead to better health outcomes.

Medicine and health concept on the table is a stethoscope

Clinical trials are the next step

Microsoft has clarified that before MAI‑DxO can be used in real patient settings, it must undergo formal clinical trials. These studies will test how well it works with live patients, how doctors interact, and whether it improves outcomes or safety.

Clinical trials are the gold standard in medicine for validating new tools. Without them, no AI system can be trusted or approved for regular use in hospitals or clinics.

AI ethics and law in artificial intelligence governance icons related.

Ethical review is still pending

Introducing AI into healthcare comes with ethical challenges. What happens if the AI gets it wrong? Who is legally responsible? How do you explain a diagnosis to a patient when it comes from a machine?

Microsoft’s system offers transparency in reaching conclusions, but that alone isn’t enough. Before deploying the system, regulators and medical boards must weigh privacy, accountability, and fairness. Trust must be earned through careful oversight and testing.

Google AI logo on the screen of mobile

Google’s earlier system was less accurate

Google previously tested its diagnostic AI on medical case files, achieving about 59 percent accuracy. That was considered a significant success at the time. Microsoft’s new result of 85 percent shows how quickly the field is advancing.

The difference may come from Microsoft’s use of multiple models and debate-style reasoning, which creates more robust answers. This also shows that competition among tech companies could produce even better diagnostic tools shortly.

With Google’s earlier system falling short on accuracy, Microsoft is stepping up. See how Microsoft is changing healthcare with AI.

AI technology supports doctors in diagnosing complex conditions enabling more

Doctors remain central to AI future

Most experts agree that doctors will remain central to healthcare even with high-performing systems like MAI-DxO. AI can help reduce diagnostic errors, manage workloads, and suggest possibilities that a busy human might miss.

But people still want to be treated by someone they trust. The future likely involves collaboration between doctors and AI rather than replacing each other. Used wisely, this technology could support better care without removing the personal touch that medicine depends on.

As AI tools grow smarter, doctors stay crucial, but could AI diagnose you before your doctor does? Find out if AI will be able to diagnose you before your doctor does or not.

Do you think AI will complement doctors or compete with them? Share your thoughts in the comments.
Read More From This Brand:

Don’t forget to follow us for more exclusive content on MSN.

If you liked this story, you’ll LOVE our FREE emails. Join today and be the first to get stories like this one.

This slideshow was made with AI assistance and human editing.

This content is exclusive for our subscribers.

Get instant FREE access to ALL of our articles.

Was this helpful?
Thumbs UP Thumbs Down
Prev Next
Share this post

Lucky you! This thread is empty,
which means you've got dibs on the first comment.
Go for it!

Send feedback to ComputerUser



    We appreciate you taking the time to share your feedback about this page with us.

    Whether it's praise for something good, or ideas to improve something that isn't quite right, we're excited to hear from you.