Paper 13407-117
A video classification method for diagnosing middle ear infections using otoscope imaging
19 February 2025 • 5:30 PM - 7:00 PM PST | Golden State Ballroom
Abstract
Otoscopy, visual examination of the ear, is essential for diagnosing middle ear infections, but accurate diagnosis requires a degree of clinical expertise. Automated deep learning methods have been explored to aid in diagnosis, typically relying on selected otoscope images. However, in real-world scenarios, clinical expert selection is not always feasible, and variations in brightness and color pose challenges. We propose a video classification approach using VideoMAE, leveraging all frames in a video to improve accuracy and eliminate the need for human selection. Our study developed a deep learning model to classify otoscope videos for seven middle ear conditions. Data augmentation techniques enhanced model generalization, and we resampled videos for balanced representation. The model, trained with 224x224 pixel videos with 16 frames each, optimized parameters to minimize classification errors. Results showed improved accuracy with data augmentation and balanced resampling, achieving 92.1%±4.0% accuracy.
Presenter
Hao Lu
Wake Forest Univ. School of Medicine (United States)
Hao Lu is a posdoc, working in center of artificial intelligent research in Wake Forest University school of Medicine. My research is focusing on using AI technic to assist doctors in diagnosis.