A method, computer system, and a computer program product for estimating error in predictions from a data model is provided. The present invention may include providing at least one first metric quantifying similarity of entities belonging to a first data type. The present invention may also include providing a second metric quantifying correlation of entities belonging to the first data type and entities belonging to a second data type. The present invention may then include developing a first model for predicting the second metric based on the at least one first metric. The present invention may further include developing a second model to estimate error in the first model.