An image processing apparatus includes an SLO image acquisition unit configured to acquire a plurality of SLO images obtainable by an SLO apparatus that scans a target to be captured with signal light at various focus positions in an optical axis direction of the signal light. The image processing apparatus includes a structure acquisition unit configured to acquire a specific structure of the target to be captured. The image processing apparatus includes an object image acquisition unit configured to acquire an image of the specific structure from each of the plurality of SLO images captured at various focus positions according to the specific structure.