With the object of enabling a subject to be imaged at an appropriate timing of shooting, an image diagnosing apparatus for shooting an image of the subject in an imaging space has a voice guidance unit which reproduces and outputs to the subject a prescribed voice guidance and a voice output control unit which causes the output timing of the voice guidance outputted from the voice guidance unit to correspond with the timing of shooting the image of the subject.