In an exemplary embodiment, a system includes a first camera, a second camera, and a processor wherein the second camera has a higher resolution than the first camera. The processor is communicatively coupled to the first camera and the second camera and is operable to determine a center coordinate of an udder of a dairy livestock based at least in part upon visual data captured by the first camera. The processor is also operable to determine a position of a teat of the dairy livestock based at least in part upon the center coordinate and visual data captured by the second camera.