I've been using scrapebox to find contact us pages. The operator that I've been using is: inurl:contact. I've scraped roughly 50 million URL's in the past couple of months using that operator and those proxies. I've been using 25 proxies per server (I have 5 servers running scrapebox). I run the servers pretty slow - 1 Google thread with RND set at 40 seconds on the high end and 4 seconds on the low end.
I've been using private proxies from myprivateproxy.net and they've worked great until the last 10 days or so. I've had the proxies replaced twice in that time and it hasn't made a bit of difference.
I had one server using 25 proxies from Squid Proxies and it did really well, so I ordered 100 more about a week ago when MyPrivateProxy's proxies weren't working and the Squid Proxies didn't work any better.
I really haven't had a lot of trouble scraping using that inurl:contact operator. When I did have trouble in the past it would be on 1 or 2 servers out of 5, but now it's on all 5.
I'm at a loss now and could really use some help from those of you who really know what you're doing.
I've been using private proxies from myprivateproxy.net and they've worked great until the last 10 days or so. I've had the proxies replaced twice in that time and it hasn't made a bit of difference.
I had one server using 25 proxies from Squid Proxies and it did really well, so I ordered 100 more about a week ago when MyPrivateProxy's proxies weren't working and the Squid Proxies didn't work any better.
I really haven't had a lot of trouble scraping using that inurl:contact operator. When I did have trouble in the past it would be on 1 or 2 servers out of 5, but now it's on all 5.
I'm at a loss now and could really use some help from those of you who really know what you're doing.