Provided is an optical tomographic imaging apparatus that is capable of shortening a period of time of focusing at multiple focus positions when images split in a depth direction are obtained by zone focusing. The optical tomographic imaging apparatus includes: a focus position setting device for splitting a zone within a predetermined imaging depth range into multiple focus zones so as to set multiple focus positions; a reference position setting device for setting at least two reference positions in an imaging depth direction within the predetermined imaging depth range; and a focus controlling device for performing control so as to perform focusing at the multiple focus positions sequentially based on focus position information generated by the focus position setting device and a focus condition of in-focus at the at least two reference positions set in advance by the reference position setting device.