Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2
In submission to IEEE TMI, 2025
Surgical workflow analysis is essential in robot-assisted surgeries, yet the long duration of such procedures poses significant challenges for comprehensive video analysis. Recent approaches have predominantly relied on transformer models; however, their quadratic attention mechanism restricts efficient processing of lengthy surgical videos. In this paper, we propose a novel hierarchical input-dependent state space model that leverages the linear scaling property of state space models to enable decision making on full-length videos while capturing both local and global dynamics. Our framework incorporates a temporally consistent visual feature extractor, which appends a state space model head to a visual feature extractor to propagate temporal information. The proposed model consists of two key modules: a local-aggregation state space model block that effectively captures intricate local dynamics, and a global-relation state space model block that models temporal dependencies across the entire video. The model is trained using a hybrid discrete-continuous supervision strategy, where both signals of discrete phase labels and continuous phase progresses are propagated through the network. Experiments have shown that our method outperforms the current state-of-the-art methods by a large margin (+2.8% on Cholec80, +4.3% on MICCAI2016, and +12.9% on Heichole datasets). Code will be publicly available after paper acceptance.
Haoyang Wu, Tsun-Hsuan Wang, Mathias Lechner, Ramin Hasani, Jennifer A. Eckhoff, Paul Pak, Ozanan R. Meireles, Guy Rosman, Yutong Ban, Daniela Rus
Download Paper | Download Slides
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Advisor at UMJI Advising Center, Shanghai Jiao Tong University, 2024
Academic Advisor for undergraduate students: Provided guidance on academic planning, course selection, and career development.
ENGL1000J Academic Writing I, Shanghai Jiao Tong University, 2024
Teaching Assistant for ENGL1000J: Held office hours to assist students in revising and improving their course papers.
ECE2800J Programming and Elem. Data Structure, Shanghai Jiao Tong University, 2025
Teaching Assistant for ECE2800J: Held office hours and recitation sessions, and assisted with grading homework and exams.
ECE2810J Advanced Data Structures and Algorithms, Shanghai Jiao Tong University, 2025
Teaching Assistant for ECE2810J: Held office hours and recitation sessions, and assisted with grading homework and exams.