The present invention relates to a method of estimating the motion of a target object from a plurality of images. The method includes: a) selecting consecutive first and second images from the plurality of images b) decomposing the first and second images into a plurality of sub-images based on the frequency components of the first and second images by n levels, respectively, wherein n is a positive integer c) selecting first and second sub-images of low frequency components from the plurality of sub-images d) setting a feature pixel in the second sub-image e) selecting an image block containing the feature pixel and a predetermined number of neighborhood pixels of the feature pixel f) selecting a reference region from the first sub-image by comparing the image block with the first sub-image g) calculating displacements between pixels of the reference region and pixels of the image block h) storing the calculated displacements i) performing 1-level composition for the decomposed images j) repeatedly performing the steps c) to i) until the decomposed images become 1-level decomposed images and k) estimating the motion of the target object based on the stored displacements.