How Does Video Captioning Improve Listening Comprehension?