A device for capturing a stereoscopic image includes a first image sensor including a first light-sensitive layer for capturing a first image, a second image sensor including a second light-sensitive layer for capturing a second image, a first lens including a first image-side principal plane, which first lens images points lying in a first area sharply onto the first light-sensitive layer of the first image sensor, and a second lens including a second image-side principal plane, which second lens images points lying in a second area sharply onto the second light-sensitive layer of the second image sensor. The distance between the first image-side principal plane of the first lens and the first light-sensitive layer of the first image sensor and the distance between the second image-side principal plane of the second lens and the second light-sensitive layer of the second image sensor are different.