Processes for identification of binding sites on protein molecules such as epitopes are provided. The disclosed methods in some embodiments use a combination of protein sequence data, structural information, experimental data on binding affinity, and computational modeling in order to identify binding sites on protein molecules. Systems and computer readable media for implementing the disclosed methods are provided. Also provided are compositions comprising a binding site or antibody that interacts with the binding site as an active ingredient and methods of using such compositions.