An emotion estimation device includes: an image obtaining unit that obtains plural images in which an object person is photographed in time series; an expression recognizer that recognizes an expression of the object person from each of the plural images obtained by the image obtaining unit; a storage in which expression recognition results of the plural images are stored as time-series data; and an emotion estimator that detects a feature associated with a time change of the expression of the object person from the time-series data stored in the storage in an estimation target period, and estimates the emotion of the object person in the estimation target period based on the detected feature.