The present disclosure provides for a system that is adapted to simultaneously display photoacoustic and ultrasound images of the same object. An image combiner can perform spatial and temporal interpolation of the two images before generating a combined image. The combined image is then displayed on a display such as an LCD or CRT. The system is able to use motion estimates obtained from the ultrasound data to enhance the photoacoustic image thereby increasing its apparent frame rate, registering consecutive frames in order to reduce artifacts. The system is capable of generating combined ultrasound and photoacoustic images which are registered spatially and temporally.