

TV-Recap Dataset

The TVRecap dataset is collected for text recap extraction on TV series. This dataset contains the processed scripts, subtitles and synopses from websites. The ground truth is established to help future research on this challenging topic. TVRecap includes all seasons from the widely-known show “Lost” with a total of 106 episodes.

Please fill out the EULA form to get access to the dataset.