A method of processing a sequence of images of a retina acquired by an ophthalmic device to generate retinal position tracking information indicative of retina movement during acquisition. The method includes (i) receiving one or more images of the retina; (ii) calculating a cross-correlation between a reference image and an image based on the received image(s) to acquire an offset between the image and reference image; and repeating processes (i) and (ii) to acquire, as the tracking information, respective offsets for images that are based on the respective received image(s). Another step includes modifying the reference image during the repeating, by determining a measure of similarity between correspondingly located regions of pixels in two or more received images and accentuating features in the reference image representing structures of the imaged retina in relation to other features in the reference image based on the determined measure of similarity.