A voice quality preference learning device according to an embodiment includes a storage, a user interface system, and a learning processor. The storage stores a plurality of acoustic models. The user interface system receives an operation input indicating a voice quality preference of a user for voice quality. The learning processor learns a preference model corresponding to the voice quality preference of the user based at least in part on the operation input, the operation input associated with a voice quality space, wherein the voice quality space is obtained by dimensionally reducing the plurality of acoustic models.