Start a new topic
Answered

Line Breaks

Im am scraping ownership and valuation info from the tax assessor website. How can i preserve line breaks that appear in the mailing address (and other fields) in the following URL: http://qpublic9.qpublic.net/la_orleans_display.php?KEY=208-AUDUBONST


I want the following:


ET AL
210 AUDUBON ST 
NEW ORLEANS, LA 70118


but im getting: ET AL 210 AUDUBON ST NEW ORLEANS, LA 70118


Using excel's text to columns is a way to parse, combine, rearrange to get what i want in batches, but preserving line breaks would get to the root of the problem


Thanks, Paul



Best Answer

Hi Paul, I'm not sure this can be done, from Portia,


but once you have the data if all the second lines are in the format [Number] "name of the street" [ST] maybe you can split using regex.


Or... maybe you can parse in Portia using regex and store each line as different items.


Check this article could be helpful.


Best,


Pablo

1 Comment

Answer

Hi Paul, I'm not sure this can be done, from Portia,


but once you have the data if all the second lines are in the format [Number] "name of the street" [ST] maybe you can split using regex.


Or... maybe you can parse in Portia using regex and store each line as different items.


Check this article could be helpful.


Best,


Pablo

Login to post a comment