![]() |
Need an email scraper
I need to contact a shitload of people - (read: CONTACT, not spam) - who all happen to be members of a site that lists their email addresses and phone numbers. Is there a program that will search the site and collect the email addresses for me?
|
so you are going to write personal messages to each and every single one of them? If not, its spam :)
|
if its your site you have access to the database and thats easy, if its not your site, you are going to spam no matter how you call it
|
I'm not here for your opinion. I'm here to find a solution to my need.
If anyone knows of a program, feel free to email me - wolfyman at gmail. |
^[a-z0-9._%+-]+@[a-z0-9.-]+\.[a-z]{2,4}$
|
use httrack to download the site (grab html only) then run an email extractor on the html files, I'm sure that there are better ways, google email extractor and see what you come up with
|
Quote:
Google: "site:example.com ^[a-z0-9._%+-]+@[a-z0-9.-]+\.[a-z]{2,4}$" returned nothing, what am I missing? Quote:
|
LOL. That's a regular expression. Why don't you just hire some cheap coder on Rent-a-Coder and have him write you a script which does it? Probably cost like $50 max.
|
do you have access to the sql db? if so it would be only a few lines of code
|
Quote:
|
Killswitch, it doesn't appear to work using it in google - how would I use that "regex"?
Quote:
|
Quote:
|
Quote:
|
I've retrieved all the email addresses on the site, thanks to a cool mofo that I'll happily give credit to if he wants it.
Quote:
For the sake of knowledge only, since my mission has been accomplished - how do I apply that regex expression to a task like this? I don't have control of the site, so any php pages I would build would be outside of the domain and would not have access to any databases. What am I missing? |
i am sure you can buy it from internet classifieds.
|
if what your doing is not spam then I don't know what is :1orglaugh
|
I still prefer this regex.
|
Quote:
i would have to see the site , but if you put your email on a public site , arent you basiclly asking for unsolicited mail ? Kinda like putting your # on a bathroom stall then saying every pervert who contacts you is "spamming" the only exception to the rule i would think would be if its implied on the site the mail is to be used for a specific purpose, or its a whois info that isn't posted by the user |
I need a poop scraper.
|
were you able to sort the first site out?
|
Quote:
|
Quote:
Personally, I'd go for another language than php for this, but really, it can be done in pretty much any programming language. Set a bot like that loose on a big directory, and you'll eventually build up a list of millions of email addresses. Of course, others do the same thing as well, so the email addresses won't exactly be fresh. Keep in mind that site owners might have email harvester traps, which generate a list of random invalid email addresses and generate dynamic links to themselves as well, ensuring that if your harvester bot isn't protected from them, it will keep getting new invalid email addresses from them forever. |
Perl allows you to be one lazy ass coder. Jump on CPAN and install the Net::Scan::Extract module
Code:
use Net::Scan::Extract qw( :all ); |
All times are GMT -7. The time now is 08:11 PM. |
Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123