About Me
I am a Postdoctoral Researcher in the MARVL lab at Stanford University, advised by Prof. Serena Yeung-Levy. My research interests include computer vision and machine learning. I am currently exploring vision-language models for the automated interpretation of biological images, with the goal of developing new methods tailored to biomedical research.
I received my Ph.D. from the Australian National University in 2024, under the supervision of Prof. Liang Zheng, and was fortunate to also receive guidance from Prof. Hongdong Li. During the final six months of my Ph.D., I worked as a Research Associate at Nanyang Technological University, advised by Prof. Ying Wei. Before that, I was a Research Assistant at the Singapore University of Technology and Design from 2018 to 2019. I received my M.Sc from Nankai University in 2018, supervised by Prof. Jufeng Yang.
Recent News
- Co-organizing the 5th DataCV Workshop & Challenge at CVPR 2026. Submissions welcome! [Workshop Website]. The challenge includes two tracks: (1): Classic Visual Illusions Understanding; (2): Real-World Visual Illusions and Anomalies Understanding.
- One paper is accepted by Machine Learning for Health Symposium 2025 — No tokens wasted: Leveraging long context in biomedical vision-language models [Paper]. Congrats to Min Woo Sun!
- Co-organizing the 4th DataCV Workshop & Challenge at ICCV 2025. Submissions welcome! [Workshop Website]
- One paper is accepted by ICLR 2025 — CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models [Paper]
- Two papers are accepted by ICLR 2024 — Alice Benchmarks: Connecting Real World Object Re-Identification with the Synthetic [Paper]; CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis [Paper]
- One paper is accepted by NeurIPS 2023 for spotlight presentation — Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? [Paper] [Project]

Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions
arXiv:2601.22150, 2026

Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies
arXiv:2602.01816, 2026

From Panel to Pixel: Zoom-In Vision-Language Pretraining from Biomedical Scientific Literature
arXiv:2512.02566, 2025

No tokens wasted: Leveraging long context in biomedical vision-language models
(Machine Learning for Health Symposium), 2025




Rendering-Refined Stable Diffusion for Privacy Compliant Synthetic Data
arXiv:2412.06248, 2024








Clinical Skin Lesion Diagnosis using Representations Inspired by Dermatologist Criteria
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018

Historical Context-based Style Classification of Painting Images via Label Distribution Learning
ACM International Conference on Multimedia (ACM MM), 2018


