An image processing apparatus that processes an image generated using at least one item of three-dimensional tomographic data out of a plurality of items of three-dimensional tomographic data obtained by conducting a plurality of optical coherence tomography scans of a subject at different times using measuring light controlled to scan a same position of the subject includes: a front image generating unit configured to generate a front image of the subject using the at least one item of three-dimensional tomographic data; a motion contrast image generating unit configured to generate a motion contrast image of the subject using the plurality of items of three-dimensional tomographic data; and a display control unit configured to display the front image on a display unit before displaying the motion contrast image.