Text this: A heuristics for HTTP traffic identification in measuring user dissimilarity