A portable system that allows blind or visually impaired persons to interpret the surrounding environment by sound or touch, said system comprising: two cameras separate from one another and configured to capture an image of the environment simultaneously, and means for generating sound and/or touch output signals. Advantageously, the system also comprises processing means connected to the cameras and to the means for generating sound and/or touch signals. The processing means are configured to combine the images captured in real time and to process the information associated with at least one vertical band with information relating to the depth of the elements in the combined image.