May 13, 2026
2026
New paper on improving alignment between video and audio. Check When Vision Speaks for Sound!