A system for image segmentation is provided. The system may obtain a target image including an ROI, and segment a preliminary region representative of the ROI from the target image using a first ROI segmentation model corresponding to a first image resolution. The system may segment a target region representative of the ROI from the preliminary region using a second ROI segmentation model corresponding to a second image resolution. At least one model of the first and second ROI segmentation models may at least include a first convolutional layer and a second convolutional layer downstream to the first convolutional layer. A count of input channels of the first convolutional layer may be greater than a count of output channels of the first convolutional layer, and a count of input channels of the second convolutional layer may be smaller than a count of output channels of the second convolutional layer.