# lq-text common word stop-list # Keep this list short -- at most 50 words -- or you will pay a penalty # in performance when you add documents to the index -- the list is # searched linearly (but is kept sorted internally, so it's OK to have # duplicated in here). # First index some text with everything commented out, and then use # FindCommon to determine which are very common words. You don't gain # all that much space by deleting them, so I don't usually bother. # the # 27880 <-- number of times this word appeared in a sample run # and # 23857 <-- on part (or all? I forget) of the King James Bible.. # that # 4705 # for # 3011