Refining Multimodal Representations with Clinical Knowledge and Numerical Parameters for Gait Video Analysis in Neurodegenerative Diseases

Community
arrow_forward_ios
Seminars

Seminars

Name: 서혜원 (Hyewon Seo)

Affiliation: CNRS

Host: 이제희 교수

Date: 9/18/2025 오전 11:00 - 오후 12:00

Location: 302동 208호

Summary

In this talk, I will introduce GaVA-CLIP, a new knowledge-augmented framework for gait video analysis, aimed at assessing diagnostic groups and gait impairment. Built on the powerful CLIP vision–language model, GaVA-CLIP learns from three complementary sources: gait videos, medical descriptions of classes, and numerical gait parameters.

Our contributions are twofold. First, we use a knowledge-aware prompt tuning strategy that leverages class-specific medical descriptions to guide text learning. Second, we incorporate paired gait parameters as “numerical text,” enhancing the model’s ability to reason quantitatively.

I will show how this approach not only surpasses state-of-the-art methods in classifying gait videos, but also generates human-readable explanations that combine medical terminology with quantitative gait measures. I’ll conclude by sharing how this opens the door to more interpretable, clinically relevant video analysis, and point to our public release of code and models.

Speaker Introduction

Hyewon Seo is a research director at CNRS (Centre National de la Recherche Scientifique), affiliated with the Université de Strasbourg. She earned her B.Sc. and M.Sc. degrees in Computer Science from KAIST and completed her Ph.D. at MIRALab. Prior to joining CNRS, she served as an assistant professor at Chungnam National University in South Korea. Dr. Seo’s research expertise centers on 3D and 4D shape analysis and modeling, with a strong focus on human data. Over her career, she has authored around 70 peer-reviewed publications. Additionally, she has significantly contributed to the scientific community by serving on editorial boards, most notably as Associate Editor-in-Chief for The Visual Computer (2016–2020), and by co-organizing key international conferences such as CGI 2015, SPM/SMI2020, and CASA 2025.

expand_less

Preparing for the Future of AI: A Software's Point of View

expand_more

Higher connectivity for directed graphs

List

Seminars

Refining Multimodal Representations with Clinical Knowledge and Numerical Parameters for Gait Video Analysis in Neurodegenerative Diseases

Community

Refining Multimodal Representations with Clinical Knowledge and Numerical Parameters for Gait Video Analysis in Neurodegenerative Diseases