Provided is an information processing apparatus including a sound output control unit configured to generate localization information of a sound marker based on a virtual position, and a sound output unit configured to output a sound associated with the sound marker, based on the localization information, wherein the virtual position is determined based on a position of a real object present in a space.