An ultrasound diagnosis apparatus according to an embodiment includes processing circuitry. The processing circuitry acquires a plurality of frames representing ultrasound images starting with an initial frame in time series. The processing circuitry compares a current frame and a previous frame to the current frame for determining the similarity therebetween, and generates a reference frame based on weighting processing on the initial frame and the previous frame using results of the comparison. The processing circuitry implements tracking processing between the reference frame and the current frame.