Text this: Augsburg data set and Berlin data set for multimodal classification