Systems, devices, and methods are provided for remote interaction with a subject in an environment. The device has audio-visual recording and transmitting functionality to provide an operator at a remote location with an audio-visual feed of the environment near the device. The device also has a light emission component which the operator controls and which projects light onto a surface in the environment in the vicinity of the device. The systems, devices, and methods provide operators with the ability to control the positions of the light emission by tracking movement at a remote device at the remote location.