An image display apparatus obtains a plurality of moving images obtained by capturing a plurality of imaging areas of a fundus, and a wide field of view image obtained by capturing an area including the plurality of imaging areas of the fundus. Each of the plurality of moving images is associated with pulse data based on a biomedical signal obtained in capturing the moving image. The image display apparatus superimposes and displays at least one frame of each of the plurality of moving images at a position on the wide field of view image, which is determined based on information about the positions of the plurality of imaging areas. In the superimposing/display operation, the image display apparatus displays the plurality of moving images at a display timing synchronized based on the pulse data.