Publications

You can also find my articles on my Google Scholar profile.

Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

Published in arXiv, 2025

Recommended citation: Keunwoo Yu, Joyce Chai, "Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models." arXiv, 2025.
Download Paper

Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model

Published in arXiv, 2024

Access paper here

Recommended citation: Keunwoo Yu, Achal Dave, Rares Ambrus, Jean Mercat, "Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model." arXiv, 2024.
Download Paper

Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

Published in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Access paper here

Recommended citation: Keunwoo Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai, "Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties." Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024.
Download Paper

Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?

Published in Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Access paper here

Recommended citation: Yuwei Bao, Keunwoo Yu, Yichi Zhang, Shane Storks, Itamar Bar-Yossef, Alex Iglesia, Megan Su, Xiao Zheng, Joyce Chai, "Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?." Findings of the Association for Computational Linguistics: EMNLP 2023, 2023.
Download Paper

NLP Reproducibility For All: Understanding Experiences of Beginners

Published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Access paper here

Recommended citation: Shane Storks, Keunwoo Yu, Ziqiao Ma, Joyce Chai, "NLP Reproducibility For All: Understanding Experiences of Beginners." Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023.
Download Paper

DANLI: Deliberative Agent for Following Natural Language Instructions

Published in Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Access paper here

Recommended citation: Yichi Zhang, Jianing Yang, Jiayi Pan, Shane Storks, Nikhil Devraj, Ziqiao Ma, Keunwoo Yu, Yuwei Bao, Joyce Chai, "DANLI: Deliberative Agent for Following Natural Language Instructions." Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022.
Download Paper

Keunwoo Peter Yu

Publications

Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models

Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model

Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?

NLP Reproducibility For All: Understanding Experiences of Beginners

DANLI: Deliberative Agent for Following Natural Language Instructions