A system and computer-implemented method for detecting retinopathy is provided. The system comprises an image input module configured to receive one or more fundus images. Further, the system comprises a pre-processing module configured to apply one or more transformations to the one or more received fundus images. Furthermore, the system comprises a feature extraction module configured to extract one or more features from the one or more transformed images using one or more Convolutional Neural Networks (CNNs). Also, the system comprises a prediction module configured to determine stage of retinopathy by classifying the one or more extracted features using pre-stored features, wherein the pre-stored features are extracted from one or more training fundus images by the one or more CNNs and further wherein each pre-stored feature corresponds to a class which is associated with a predetermined stage of retinopathy.