Systems and methods for assessing the visual acuity of person using a computerized consumer device are described. The approach involves determining a separation distance between a human user and the consumer device based on an image size of a physical feature of the user, instructing the user to adjust the separation between the user and the consumer device until a predetermined separation distance range is achieved, presenting a visual acuity test to the user including displaying predetermined optotypes for identification by the user, recording the user's spoken identifications of the predetermined optotypes and providing real-time feedback to the user of detection of the spoken indications by the consumer device, carrying out voice recognition on the spoken identifications to generate corresponding converted text, comparing recognized words of the converted text to permissible words corresponding to the predetermined optotypes, determining a score based on the comparison, and determining whether the person passed the visual acuity test.