This disclosure relates generally to physiological monitoring, and more particularly to feature set optimization for classification of physiological signal. In one embodiment, a method for physiological monitoring includes identifying clean physiological signal training set from an input physiological signal based on a Dynamic Time Warping (DTW) of segments associated with the physiological signal. An optimal features set is extracted from a clean physiological signal training set based on a Maximum Consistency and Maximum Dominance (MCMD) property associated with the optimal feature set that strictly optimizes on the objective function, the conditional likelihood maximization over different selection criteria such that diverse properties of different selection parameters are captured and achieves Pareto-optimality. The input physiological signal is classified into normal signal components and abnormal signal components using the optimal features set.