PROBLEM TO BE SOLVED: To provide a device and a method for efficiently producing a subtitle from a voice.SOLUTION: A voice recognition part 30 performs voice recognition of an object voice 10 or a voice obtained by repeating the object voice 10 and converts the voice into a text. A text division/connection processing part 40 performs division processing of the text after the voice is recognized, for creating a subtitle text. A keyboard correction part 60 corrects the subtitle text. A delay part 82 outputs plural delay voices obtained by delaying the object voice 10 by prescribed different times. A delay switching switch 84 switches the plural delay voices output by the delay part 82 and provides the delay voice to the keyboard correction part 60, based on an instruction of a person who corrects the subtitle text.SELECTED DRAWING: Figure 2