An image processing apparatus (10) that generates one image by using at least one frame each of a plurality of moving images obtained by taking moving images of a plurality of different regions of an eye at different times, includes: deciding means configured to decide the at least one frame in each of the plurality of moving images, so that regions which have actually been shot are included in the plurality of moving images in the plurality of regions and image generating means configured to generate one image by using the at least one frames decided from each of the plurality of moving images.