A plurality of sets each including part information representing a human body part and a schema image serving as the schematic view of the human body part are held. A captured image representing the interior of the body of a patient is acquired. Hierarchical structure information is acquired, which contains part information globally representing a part corresponding to a designated portion on the captured image and part information locally representing the part corresponding to the designated portion. A set including the part information contained in the hierarchical structure information is acquired. A schema image included in the acquired set is output.