0
Answered
mark80 3 weeks ago in Portia • updated by Pablo Vaz (Support Engineer) 3 weeks ago 1

http://www.ancebergamo.it/elencoimprese.asp?lettera=C


i need extract company name and mail info from every company paginated in alphabetic order..

i've set url generation, specifing every alphabetic letter but only 3 item after thousand of request..strangely slow..

can you take a look?


Pablo i've tried to add you to project? can you see my invitation?

Answer

Answer
Answered

Hi Mark,


As you can see, every page is like:

http://www.ancebergamo.it/elencoimprese.asp?lettera=1
http://www.ancebergamo.it/elencoimprese.asp?lettera=A
http://www.ancebergamo.it/elencoimprese.asp?lettera=B

and so on...


Please try to use pagination or URL list generation as seen on:

1. Extract data from a List of URLs

2. Handle pagination in Portia


I've noticed you has Follow all domain links on the pagination settings, at least for link_name_ink_extraction.


I hope you find this helpful.


Best regards!


Pablo

Answer
Answered

Hi Mark,


As you can see, every page is like:

http://www.ancebergamo.it/elencoimprese.asp?lettera=1
http://www.ancebergamo.it/elencoimprese.asp?lettera=A
http://www.ancebergamo.it/elencoimprese.asp?lettera=B

and so on...


Please try to use pagination or URL list generation as seen on:

1. Extract data from a List of URLs

2. Handle pagination in Portia


I've noticed you has Follow all domain links on the pagination settings, at least for link_name_ink_extraction.


I hope you find this helpful.


Best regards!


Pablo