A single-plate image pickup element picks up an eye fundus image. A tricolor separation color filter includes R, G, and B filter elements arranged in a mosaic so as to correspond to the pixels of the image pickup element. Each virtual pixel value of color image data is calculated from light detection data of adjacent pixels. Thus, image data of a color still image is generated. The R filter elements transmit near-infrared light. Each virtual pixel value is calculated from light detection data of pixels corresponding to B or G filter elements that are adjacent to the R filters and have sensitivity to near-infrared light. Thus, image data of a near-infrared light monochrome moving image is generated.