Disclosed is a method and system for three-dimensional tracking of a target located within a body, the method performed using at least one processing system. A two-dimensional scanned image of the body including the target is processed to obtain a two-dimensional image of the target. A first present dataset of the target is predicted using a previous dataset of the target and a state transition model, the first present dataset includes a three- dimensional present position value of the target. A second present dataset of the target is measured by template-matching of the two-dimensional image of the target with a model of the target. A third present dataset of the target is estimated by statistical inference using the first present dataset and the second present dataset. The previous dataset of the target is updated to match the third present dataset.