Weike Zhao

Weike Zhao (赵唯珂)

I'm a PhD candidate at Shanghai Jiao Tong University (SJTU), advised by Prof. Weidi Xie and Prof. Ya Zhang.

My current research interests include but are not limited to Artificial Intelligence for Medical (AI4Med) and Multimodal Perception.

zwk0629[at]sjtu.edu.cn

zhaoweike2000

Research

* denotes equal contribution, and denotes corresponding author.
MedRBench
Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
Technical Report, 2025

In this study, we quantitatively evaluate the free-text reasoning abilities of various state-of-the-art LLMs, such as DeepSeek-R1 and OpenAI-o3-mini, in assessment recommendation, diagnostic decision, and treatment planning.

RaTEScore
RaTEScore: A Metric for Radiology Report Generation
EMNLP 2024 main paper

RaTEScore is a novel, entity-aware metric to assess the quality of medical reports generated by AI models. It emphasizes crucial medical entities such as diagnostic outcomes and anatomical details, and is robust against complex medical synonyms and sensitive to negation expressions. The evaluations demonstrate that RaTEScore aligns more closely with human preference than existing metrics.

RP3D-Diag
Large-scale Long-tailed Disease Diagnosis on Radiology Images
Nature Communication 2024

In this paper, we build up an academically accessible, large-scale diagnostic dataset that encompassing 5568 disorders linked with 930 unique ICD-10-CM codes, containing 39,026 cases (192,675 scans). Also, we present a novel architecture that enables processing arbitrary number of input scans from various imaging modalities and initialize a new benchmark for multi-modal multi-anatomy long-tailed diagnosis.

GPT4V Evaluation
Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis
Technical Report, 2023

In this report, we evaluate GPT-4V for multimodal medical diagnosis at case studies, covering 17 human body systems, across 8 clinical imaging modalities. As the cases shown, GPT-4V is still far from clinical usage.

Hobby

🏂
🎬
🎯
🎵
🎻
🏃
🏊
🏓
🏸
📸
🤸
🏔
🎱
🎾