In a medical image display device, setting means sets first reference information for starting extraction of a desired region on a medical image displayed on the display means and second reference information for terminating the region extraction. Control means generates mark information indicating execution information on a region extraction process in the direction from the first reference information to the second reference information set by the setting means and control display so as to correlate the generated mark information with the medical image. For example, the mark information generated by the setting means is shifted and set and the control means modifies the size and the shape of target region information according to the mark information shifted and set.