A 3D ultrasound image from a memory (20) is compared with a 3D diagnostic image from a memory (12) by a localizer and registration unit (30) which determines a baseline transform (Tbase) which registers the 3D diagnostic and ultrasound volume images. The target region continues to be examined by an ultrasound scanner (22) which generates a series of real-time 2D or 3D ultrasound or other lower resolution images. The localizer and registration unit (30) compares one or a group of the 2D ultrasound images with the 3D ultrasound image to determine a motion correction transform (Tmotion). An image adjustment processor or program (32) operates on the 3D diagnostic volume image with the baseline transform (Tbase) and the motion correction transform (Tmotion), to generate a motion corrected image that is displayed on an appropriate display (74).