[Problem to be solved] To accurately register two images obtained by imaging the same subject with different types of modalities to make the spatial positions of the subject in the two images conform to each other. [Solution] A first image (V1) and a second image (V2) obtained by imaging the same subject with different types of modalities are obtained. The first image (V1) is deformed, and similarity between the deformed first image (V1a) and the second image (V2) is evaluated by an evaluation function that evaluates correlation between distributions of corresponding pixel values of the two images to estimate an image deformation amount of the first image (V1). Based on the estimated image deformation amount, a deformed image of the first image (V1) is generated. The evaluation function includes a term representing a measure of correlation between a pixel value of the deformed first image (V1a) and a corresponding pixel value of the second image (V2), wherein the term evaluates the correlation based on probability information that indicates a probability of each combination of corresponding pixel values of the first image (V1) and the second image (V2).