An image processing apparatus includes: a display controller configured to cause a display to display an image including a first pointer having a predetermined shape superimposed thereon and move a position of the first pointer within the image in accordance with an input position that is input from an input device; a region-of-interest setting circuit configured to set a region corresponding to the position of the first pointer within the image as a region of interest when a confirmation operation for confirming the input position is input to the input device; and a distance determining circuit configured to determine a distance between a first representative position of the first pointer and a second representative position of the region of interest within the image.