MuAViC: The First Audio-Video Speech Translation Benchmark
In countless everyday situations, background noise — the sound of traffic, music, other people speaking – makes it more difficult to understand what other people are saying. Humans often use…
Share