View Single Post
Old 07-01-2016, 11:34 PM  
johnnyloadproductions
Account Shutdown
 
Industry Role:
Join Date: Oct 2008
Location: Gone
Posts: 3,611
Quote:
Originally Posted by deonbell View Post
it cloudflare error for every name.
Typed in random names into the username.txt file, which you didn't include.

I got cloudflare errors as well.

Quote:
<h2 class="cf-subheadline" data-translate="error_desc">Direct IP access not allowed</h2>
I don't know much about sockets, use a headless browser.

A headless browser is one that acts like a normal browser except it is controlled programatically.

You can use phantomjs but because you are using python I would use a program called dryscrape.
Installation — dryscrape 0.8 documentation

Quote:
dryscrape is a lightweight web scraping library for Python. It uses a headless Webkit instance to evaluate Javascript on the visited pages. This enables painless scraping of plain web pages as well as Javascript-heavy “Web 2.0” applications like Facebook.
Struggle with it, learn it. Sounds like you have Python up and running ok.

I'm actually using dryscrape to build a archive site right now.
johnnyloadproductions is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote