An image processing apparatus includes first alignment means configured to perform an alignment in a horizontal direction on a plurality of two-dimensional tomographic images based on measurement light controlled to scan an identical position of an eye according to a first method, and second alignment means configured to perform an alignment in a depth direction on the plurality of two-dimensional tomographic images according to a second method that is different from the first method.