An image processing device includes an image sequence acquisition section (200) that acquires an image sequence that includes a plurality of images, and a processing section (100) that performs an image summarization process that deletes some of the plurality of images included in the image sequence acquired by the image sequence acquisition section (200) to acquire a summary image sequence, the processing section (100) selecting a reference image and a determination target image from the plurality of images, and determining whether or not the determination target image can be deleted based on the results of a process that utilizes deformation information about the reference image and the determination target image, and a process that utilizes a structural element that corresponds to an attention area.