Generally, a method performed by one or more processing devices includes generating a graphical user interface that when rendered on a display of the one or more processing devices renders a visual representation of an environment and a visual representation of an object in the environment; retrieving an auditory stimulus with one or more auditory attributes indicative of a location of a virtual target in the environment; receiving information specifying movement of the object in the environment; determining, based on the movement of the object, a proximity of the object to the virtual target; adjusting, based on the proximity, one or more values of the one or more auditory attributes of the auditory stimulus; and causing the one or more processing devices to play the auditory stimulus using the adjusted one or more values.