An image processing device includes: a first feature amount extraction unit configured to extract a first feature amount from an image; a position detection unit configured to detect observation positions from the image based on a position detection dictionary, and the first feature amount extracted from the image; a second feature amount extraction unit configured to extract a second feature amount from the observation position; an observation-order determining unit configured to determine the order of observing the observation positions based on an order generation dictionary, and respective second feature amounts of the observation positions; and an image generation unit configured to generate observation images for displaying the observation positions in the observation order based on the image, the detected observation positions and the determined observation order.