A method and system for detecting anatomic landmarks in medical images is disclosed. In order to detect multiple related anatomic landmarks, a plurality of landmark candidates are first detected individually using trained landmark detectors. A joint context is then generated for each combination of the landmark candidates. The best combination of landmarks in then determined based on the joint context using a trained joint context detector.