Disclosed are a method and device for processing language data items and a method and device for analyzing language data items. The method comprises: acquiring a part or all of language data items to form a collection of language data items; determining an intent corresponding to each language data item in the collection of language data items; analyzing each language data item in the collection of language data items to determine terms in each language data item; determining an appearance frequency of each term in the collection of language data items; determining an appearance frequency of each term for each intent; and determining, according to the appearance frequency of each term in the collection of language data items and the appearance frequency of each term for each intent, a weight of each term for each intent.