In an exemplary embodiment, an image display system 1 includes an input unit 50 and an information processing unit 10; wherein an input unit has a massaging apparatus 54 for massaging a human body part, and a microphone 52 placed inside or outside the massaging apparatus to detect sounds near the massaging apparatus; and the information processing unit has a sound volume determination means 310 for determining whether or not the sound volume of the sounds detected by the microphone exceeds a pre-determined threshold, and a display means 90 for displaying content constituted by a combination of image data and sound data, by modifying them according to the determination result as determined by the sound volume determination means. The image display system can augment massaging.