Embodiments of the present disclosure relate generally to index trimming to improve search results of a large corpus. Some embodiments, prior to receiving, from a user device, a search query of one or more keywords searching for a relevant set of publications in a publication corpus, trim candidate publications from a plurality of candidate publications to generate a trimmed plurality of candidate publications.