What is disclosed is a system and method for estimating cardiac pulse rate from a video of a subject being monitored for cardiac function. In one embodiment, batches of overlapping image frames are continuously received and processed by isolating regions of exposed skin. Pixels of the isolated regions are processed to obtain a time-series signal per region and a physiological signal is extracted from each region's time-series signals. The physiological signal is processed to obtain a cardiac pulse rate for each region. The cardiac pulse rate for each region is compared to a last good cardiac pulse rate from a previous batch to obtain a difference. If the difference exceeds a threshold, the cardiac pulse rate is discarded. Otherwise, it is retained. Once all the regions have been processed, the retained cardiac pulse rate with a minimum difference becomes the good cardiac pulse rate for comparison on a next iteration.