An image processing apparatus includes an information obtaining unit configured to obtain three-dimensional polarization sensitive tomographic information and three-dimensional motion contrast information about a subject based on tomographic signals of lights having different polarizations, the lights being obtained by splitting a combined light obtained by combining a returned light from the subject illuminated with a measurement light with a reference light corresponding to the measurement light, an obtaining unit configured to obtain a lesion region of the subject using the three-dimensional polarization sensitive tomographic information, and an image generation unit configured to generate an image in which the lesion region is superimposed on a motion contrast image generated using the three-dimensional motion contrast information.