A system for tracking biomarkers in subjects. In one embodiment, the biomarker tracking system has a sensory array including an RGB-D camera or RGB camera, a memory, and an electronic processor. The microphone captures voice data, including but not limited to tremor detection data, speech volume and pronunciation data, speech strength data, changes in tonality, hesitance in voice, and changes in speed or verbiage. A stored baseline biomarker model may comprise a voice data profile which may be pre-stored in the memory of a server and include a plurality of benchmarks. This electronic processor is configured to use this pre-stored voice data and compare it to the voice data captured with the microphone. The electronic processor is further configured to determine a set of attributes for the voice data, and generates a speech data deviation model based, at least in part, on the comparison of the speech data to the stored baseline biomarker model.