In one embodiment, a processor accesses sensor input data received from one or more sensors. The sensor input data represents one or more gestures. The processor determines, based on the sensor input data representing the one or more gestures, action data representing an action to be performed by a robot. The action includes physical movements of the robot. The processor evaluates the action data representing the action to be performed by the robot in light of evaluation data.