Exemplary methods for identifying progenies for use in plant breeding are disclosed. One exemplary computer-implemented method includes accessing a data structure including data representative of a pool of progenies and determining a prediction score for at least a portion of the pool of progenies based on the data included in the data structure. The prediction score indicates a probability of selection of the progeny based on historical data. The method further includes selecting a group of progenies from the pool of progenies based on the prediction score, identifying a set of progenies, from the group of progenies, based on at least one of an expected performance of the group of progenies and at least one factor associated with the set of progenies, the pool of progenies and/or the group of progenies, and directing the set of progenies into a validation phase of a breeding pipeline.