dariah_topics.preprocessing.dariah_topics.preprocessing.tokenize (from pytest)
178047 DEBUG: Tokenizing document ... 178048 DEBUG: Lowering all characters ...