Jenkins ChangeSet for http://dev.digital-humanities.de/ci/view/All/job/DARIAH-Topics/78/
find_hapax now removes hapax over the whole corpus and not only overby cyberpillip
#IntegrationTest_txt_gensim.ipynb
#dariah_topics/preprocessing.py