The present invention provides an articulation evaluation method and system combining acoustic features and articulation motion features. According to the articulation evaluation method and system, audio data and articulation motion data are acquired, acoustic features are extracted from the audio data, articulation motion features are extracted from the articulation motion features, and feature fusion and policy fusion are performed on the acoustic features and the articulation motion features according to a time correspondence, which effectively utilizes the complementarity of the two types of features to ensure the objectivity and comprehensiveness of evaluation, so that a more accurate and reliable feature fusion evaluation result and decision fusion evaluation result are obtained, making the articulation evaluation more objective and accurate.