Text this: Building semantic trees from XML documents