An image forming apparatus has acoustic transducers; and an image processing unit which calculates intensity of acoustic waves irradiated from regions inside a subject respectively by processing received signals, which are output from the acoustic transducers, by a Fourier-domain method. The image processing unit includes: a coefficient memory which stores coefficients computed in advance, the coefficient being a value determined only by a position of the acoustic transducer, position of the region and a time of receipt of the acoustic wave; a multiplier unit which multiplies the received signal of the acoustic transducer by the corresponding coefficient; and a voxel memory which accumulates multiplication results of the multiplier unit for each region.