As a user, I want to generate a list of words that could be used for a document index, so that I can refine the index more quickly.