Provided is a technology for extracting an image of a target plane from 2D or 3D image data acquired by a medical imaging apparatus with a small amount of computation and at high speed. A plane of a target plane including a predetermined structure is extracted from image data of a subject. A region of the predetermined structure included in the plane is detected by applying a learning model learned using learning data including a target plane for learning including an image of the structure and a region-of-interest plane for learning obtained by cutting out and enlarging a partial region including the structure in the target plane for learning to a plurality of planes obtained from the image data, and the plane of the target plane is extracted based on the detected region of the predetermined structure.