A system may be configured to allow for the translation of content, obtained and/or presented by a media cast device, to different languages. The translation may be performed based on translating the text of closed captioning information provided with the content, and generating audio based on the text. The translation may be performed independent of music or sound effects, such that only speech is replaced, without affecting other portions of the audio.