A medical image processing device includes a processor configured to: acquire a first captured image captured based on light from an observation target irradiated with light in a first wavelength band, the observation target emitting fluorescence upon being irradiated with excitation light in a second wavelength band; acquire a second captured image captured based on the excitation light and the fluorescence from the observation target irradiated with the excitation light; specify a target area in which a pixel level is not lower than a predetermined threshold, in the first captured image; and generate, for an area at a same position as the target area, a superimposed image only including pixels in the target area in the first captured image, among pixels in the area at the same position as the target area in the second captured image and pixels in the target area in the first captured image.