An endoscope system includes: at least one image sensor configured to generate multiple pieces of image data in which acquisition areas of an object image are at least partially different from each other, or multiple pieces of image data having a disparity with regard to an identical object; a first processor configured to combine the pieces of image data to generate a single piece of combined image data; a second processor configured to execute image processing on the combined image data, and generate display image data to be presented on a display based on the combined image data on which the image processing has been executed. The second processor is disposed inside a predetermined casing, the first processor is disposed outside the predetermined casing, and the combined image data generated by the first processor is transmitted to the predetermined casing.