Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

Published in arXiv, 2025

Access paper here

Recommended citation: Keunwoo Yu, Joyce Chai, "Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models." arXiv, 2025.
Download Paper