This device (2) for generating ultrasonic waves in a target region of a soft solid, includes at least two ultrasound sources (32), light sources (40) distributed around a central axis (X2) of the device (2), for enlightening a zone of the soft solid via subsurface scattering, and a video camera (50), for capturing images of the zone enlightened by the lighting means. The ultrasound source (32), the light sources (40) and the video camera (50) are mounted on a body of the device (20) and oriented toward a common target zone which includes a focal point of the ultrasound sources (32). A boresight of the video camera is aligned on the central axis (X2).