Apparatus, systems, and methods to capture and combine patient vitals and image data are disclosed. An example apparatus includes a video capturing device or an audio receiving device, a vitals data manager, and a vitals aggregator. The video capturing device captures visual vital information of a patient from a vital monitor during an imaging procedure. The audio receiving device captures audible vital information of the patient during the imaging procedure. The vitals data manager receives the captured visual vital information or the captured audible vital information, the captured visual vital information or the captured audible vital information to be tagged with an identifier of the patient to form tagged vitals information. The vitals aggregator receives the tagged vitals information and the image associated with the patient, the vitals aggregator to organize the tagged vitals information with the image associated with the patient to form a composite image.