A system and method for early media buffering using prediction of user behavior. In accordance with an embodiment, a user interface displays a plurality of media options from which particular options can be selected. A click determination logic is configured so that a first event associated with a particular option, such as a click event, is passed singly to a media application without trapping for the possibility of a double-click. The media application interprets the first event as a likely selection by a user of the particular option, and uses information associated with the likely selection to begin buffering a corresponding media content. If a second event associated with the particular option is received within a subsequent time interval, then the second event is treated, like a double-click, as confirmation of the user's selection, and the corresponding media content is streamed from its media content buffer.