Program now available
Registration open
>
16 - 20 February 2025
San Diego, California, US
Conference 13407 > Paper 13407-117
Paper 13407-117

A video classification method for diagnosing middle ear infections using otoscope imaging

19 February 2025 • 5:30 PM - 7:00 PM PST | Golden State Ballroom

Abstract

Otoscopy, visual examination of the ear, is essential for diagnosing middle ear infections, but accurate diagnosis requires a degree of clinical expertise. Automated deep learning methods have been explored to aid in diagnosis, typically relying on selected otoscope images. However, in real-world scenarios, clinical expert selection is not always feasible, and variations in brightness and color pose challenges. We propose a video classification approach using VideoMAE, leveraging all frames in a video to improve accuracy and eliminate the need for human selection. Our study developed a deep learning model to classify otoscope videos for seven middle ear conditions. Data augmentation techniques enhanced model generalization, and we resampled videos for balanced representation. The model, trained with 224x224 pixel videos with 16 frames each, optimized parameters to minimize classification errors. Results showed improved accuracy with data augmentation and balanced resampling, achieving 92.1%±4.0% accuracy.

Presenter

Hao Lu
Wake Forest Univ. School of Medicine (United States)
Hao Lu is a posdoc, working in center of artificial intelligent research in Wake Forest University school of Medicine. My research is focusing on using AI technic to assist doctors in diagnosis.
Application tracks: AI/ML
Presenter/Author
Hao Lu
Wake Forest Univ. School of Medicine (United States)
Author
Seda Camalan
Wake Forest Univ. School of Medicine (United States)
Author
The Ohio State Univ. (United States)
Author
Vanderbilt Univ. Medical Ctr. (United States)
Author
Wake Forest School of Medicine (United States)