An image processing device captures a normal light image of an observed region via a plurality of color filters having different spectral characteristics, respectively, generates a uniform image having low contrast by extracting each picture signal corresponding to a picture signal in a wavelength band for which light absorption characteristic of a contrast region in the observed region becomes low, and corrects each picture signal of a fluorescence image of the observed region by using the uniform image.