Rongsheng Wang   |   王荣胜🧑‍🚀

I am a first-year Ph.D. student at CUHK(SZ) The Chinese University of Hong Kong, Shenzhen, supervised by Prof. Benyou Wang. My research focuses on exploring trustworthy medical large language models (LLMs) and multimodal large models (MLLMs), as well as exploring multimodal generative for healthcare applications. I am a perfectionist. I want to solve the problems in my hands in perfect ways, which is, however, not achievable in many situations.

Email  /  Google Scholar  /  Github  /  CV

profile photo

News

  • [Jan. 2026] One paper accepted by ICLR 2026, congrats to all co-authors! See you at Brazil!
  • [May 2025] Two papers accepted by ACL 2025, congrats to all co-authors!
  • [Apr. 2025] Our team won the gold medal in the AIMO-2 competition, ranking 14th out of 2213!
  • Publications

    † Equal contribution. * Corresponding author. The leading papers are highlighted.

    • Research Interests
      • General
      • Medical
        • Multimodal Generation 2
        • Multimodal Models 1
        • Language Models 0



    GitHub Repo stars
    MicroVerse: A Preliminary Exploration Toward a Micro-World Simulation
    Rongsheng Wang†, Minghao Wu†, Hongru Zhou, Zhihan Yu, Zhenyang Cai, Junying Chen, Benyou Wang*
    Code / Paper / Dataset / Benchmark
    ICLR 2026 (Poster)

    MicroVerse is a world model designed to faithfully simulate biological processes at the microscale.



    GitHub Repo stars
    MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos
    Rongsheng Wang†, Junying Chen†, Ke Ji, Zhenyang Cai, Shunian Chen, Benyou Wang*
    Code / Paper / Dataset
    Under Review

    MedGen is a medical video generation model, built on the large-scale, caption-rich MedVideoCap-55K dataset.



    GitHub Repo stars
    Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging
    Zhenyang Cai†, Junying Chen†, Rongsheng Wang†, Weilong Wang, Yonglin Deng, Dingjie Song, Yize Chen, Zixu Zhang, Benyou Wang*
    Code / Paper / Dataset
    ACL 2025 (Main)

    Multimodal LLMs achieve strong generalization in medical imaging largely through compositional generalization (CG).

    If you would like to view my other publications, you are welcome to access them on my Google Scholar.


    Experience
    The Chinese University of Hong Kong, Shenzhen (CUHK-SZ)
    2024.09 - 2025.08
    Research Assistant
    Research Advisor: Prof. Benyou Wang

    Projects

    🚀 Build in Open. Grow Together.

    Loading projects...


    Service and Teaching

  • Journal Reviewer: JBHITMI

  • Awards and Honors

  • 2025: Gold Medal of Kaggle AI Mathematical Olympiad - Progress Prize 2, AIMO-2 (Link, Rank 14/2212 globally)
  • 2023: Silver Medal of Kaggle RSNA Screening Mammography Cancer Detection (Link, Rank 56/1687 globally)
  • 2021: Baidu PaddlePaddle Developers Experts, PPDE (Link)

  • If you like it, feel free to steal this template.

    ✅复制完成