Provided is an image acquisition apparatus for acquiring a 3D retinal image with high resolution, which is capable of reducing a time period required for data transmission. In the image acquisition apparatus: a blink of a subject is detected acquisition of image taking data is suspended thereafter until a line of sight becomes stable and the data transmission to a computer is started at a timing at which a blink has been detected, thereby avoiding acquiring unnecessary data, allowing a capacity of a data buffer to be smaller, and making the data transmission efficient.