A server receives a request from a client device initiated from a user operating the client device. The server determines a user action identifier (ID) based on the request, the user action ID identifying a user physical action that was captured by one or more sensors. One or more image processing commands are determined based on the user action ID in view of a first medical image currently displayed at the client device. An image processing operation is performed based on the first medical image by executing the one or more image processing commands, generating a second medical image. The second medical image is transmitted to the client device to be presented to the user at the client device.