Demo

PEEK: frame selection for video captioning

Upload a video and let PEEK pick the most informative frames — distilled from vision-language teachers, without watching every frame.

open on HuggingFace paper page arXiv code