Start a new topic

Portia: Issue scraping from pages with incorrectly formatted html

I'm scraping a page from a website and none of the selectors are working to extract the data from a few specific fields namely, the living and property square feet fields.  I've tried Xpath, Auto, and a bunch of CSS selectors in different arrangements. Nothing seems to work in scraping the data. The only thing that I notice when inspecting the page source is that there is a closing list tag with a missing opening tag and I feel that this could be the problem.  Is there anyone who has come across a similar thing?  Is this possible a bug or an enhancement?


Example: http://www.thecaribbeanrealtor.com/property/property-detail/43476

Login to post a comment