dariah_topics.preprocessing.dariah_topics.preprocessing.tokenize (from pytest)
DEBUG: Tokenizing document ... DEBUG: Lowering all characters ...