Systems and methods for capturing media content in accordance with viewer expression are disclosed. In some implementations, a method is performed at a computer system having one or more processors and memory storing one or more programs for execution by the one or more processors. The method includes: (1) while a media content item is being presented to a user, capturing a momentary reaction of the user (2) comparing the captured user reaction with one or more previously captured reactions of the user (3) identifying the user reaction as one of a plurality of reaction types based on the comparison (4) identifying the portion of the media content item corresponding to the momentary reaction and (5) storing an association between the identified user reaction and the portion of the media content item.