A fundus oculi observation device 1 can form a near-infrared motion image and a color image (still image) of a fundus oculi Ef. The device 1 specifies an image region within the near-infrared motion image corresponding to a region of interest within the color image while the near-infrared motion image is being formed. The device 1 scans with a signal light LS based on the specified image region, thereby forming a tomographic image along the scanning line. According to the device 1, it is possible to determine a region of interest within a still image having a comparatively high image quality, specify the image region within the motion image corresponding to this region of interest, set a measurement site for the tomographic image.