Abstract: Book flipping videos present a distinctive challenge for information extraction, requiring the identification of frames with clear text visibility during dynamic page turns. This paper ...