Jenkins ChangeSet for http://dev.digital-humanities.de/ci/job/DARIAH-Topics/38/
Fix tokenizer issueby github
#dariah_topics/preprocessing.py