Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models
Published in arXiv, 2025
Recommended citation: Keunwoo Yu, Joyce Chai, "Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models." arXiv, 2025.
Download Paper