GoFuckYourself.com - Adult Webmaster Forum

GoFuckYourself.com - Adult Webmaster Forum (https://gfy.com/index.php)
-   Fucking Around & Business Discussion (https://gfy.com/forumdisplay.php?f=26)
-   -   Tech how to scrape wordpress site ? (https://gfy.com/showthread.php?t=1280101)

freecartoonporn 09-24-2017 05:41 AM

how to scrape wordpress site ?
 
what are you guys using ?


i have list of urls , i need to scrape, images and or videos from.
target site is using wordpress.

i can create crawler myself using php simple html dom., but why reinvent the wheel.,

what is best out there ?

with scheduling and keeping watch on target site. , auto create categories, tags, auto scrape images , include them in our posts and set post as draft.

thanks for your time.

k0nr4d 09-24-2017 05:46 AM

Why not just use the RSS feed?

freecartoonporn 09-24-2017 05:47 AM

Quote:

Originally Posted by k0nr4d (Post 22012195)
Why not just use the RSS feed?

there are old posts, on target site, and i am not sure., they will be in feed., as basically feed has limit to last 10 posts , i guess.

is thre way i can access old posts using feed ?

klinton 09-24-2017 07:20 AM

Cyberseo :)

freecartoonporn 09-24-2017 08:51 AM

i have list of thousands of urls, i need to scrape. and create posts in draft mode., then i can only publish it after checking each post. for valid image links.

freecartoonporn 09-24-2017 08:52 AM

Quote:

Originally Posted by klinton (Post 22012285)
Cyberseo :)

i have list of thousands of urls, i need to scrape. and create posts in draft mode., then i can only publish it after checking each post. for valid image links.


is it possible with cyberseo.,
which version, cheap or expensive one ?

klinton 09-24-2017 09:45 AM

which version of it is cheap ? :winkwink:
im talking about normal version.
all options that you mentioned are default, the last one either is default (if i understand you correctly) or need some small simple parser to be written in php.
speak with Cyberseo, he posts here.
Quote:

Originally Posted by freecartoonporn (Post 22012385)
i have list of thousands of urls, i need to scrape. and create posts in draft mode., then i can only publish it after checking each post. for valid image links.


is it possible with cyberseo.,
which version, cheap or expensive one ?


freecartoonporn 09-24-2017 09:59 AM

Quote:

Originally Posted by klinton (Post 22012443)
which version of it is cheap ? :winkwink:
im talking about normal version.
all options that you mentioned are default, the last one either is default (if i understand you correctly) or need some small simple parser to be written in php.
speak with Cyberseo, he posts here.

i meant, the one which has lower price that other one.

i am still looking for more options, so i can decide .

cordoba 09-24-2017 01:44 PM

Quote:

Originally Posted by klinton (Post 22012285)
Cyberseo :)

Surely Google is clever enough these days to scan (and penalize) sites for using an overtly blackhat plugin?

brassmonkey 09-24-2017 02:41 PM

i can scrape it for a fee :)

Smack dat 09-24-2017 02:43 PM

There are loads of nulled cyberseo around.

freecartoonporn 09-24-2017 06:53 PM

Quote:

Originally Posted by brassmonkey (Post 22012699)
i can scrape it for a fee :)

Quote:

i can create crawler myself using php simple html dom., but why reinvent the wheel.,

Quote:

Originally Posted by Smack dat (Post 22012707)
There are loads of nulled cyberseo around.

nulled = bugged.

brassmonkey 09-24-2017 07:24 PM

Quote:

Originally Posted by freecartoonporn (Post 22012907)
nulled = bugged.

free = go fuck yourself! nulled my software is paid for

Barry-xlovecam 09-24-2017 08:45 PM

So you want help stealing the other guy's shit ... NO

Not only that but you want to hotlink to the stolen shit too?

Even dumber than dumb -- you post a thread to leave a record of your malfeasance.

Congratulations, you get the GFY dumb-fuck of the month award -- a bag of poop from?

Get a new fuckin' plan

freecartoonporn 09-24-2017 09:40 PM

Quote:

Originally Posted by Barry-xlovecam (Post 22012971)
So you want help stealing the other guy's shit ... NO

i am stealing from thief, i know two wrong does not make one right but still , i wanna try before i die.

Quote:

Not only that but you want to hotlink to the stolen shit too?
where i said, that i want to hotlink ?

i want to downlod content to my server.

Quote:

Even dumber than dumb -- you post a thread to leave a record of your malfeasance.
nobody cares.

Quote:

Congratulations, you get the GFY dumb-fuck of the month award -- a bag of poop from?
thanks.

Quote:

Get a new fuckin' plan
not until this plan fails.

just a punk 09-25-2017 12:04 AM

Quote:

Originally Posted by freecartoonporn (Post 22012199)
there are old posts, on target site, and i am not sure., they will be in feed., as basically feed has limit to last 10 posts , i guess.

is thre way i can access old posts using feed ?

Quote:

Originally Posted by freecartoonporn (Post 22012181)
what are you guys using ?

1) Install CyberSEO.
2) Add a link to WordrPress RSS feed.
3) Enable "Parse WordPress archives" option.

The target site will be fully scraped - doesn't matter how many posts it has in the RSS feed (1, 10 or none), the plugin will scrape just EVERYTHING - yes, including all old posts that you can't see in the RSS feed. It will even copy all the media to your host if you want it.

Quote:

Originally Posted by cordoba (Post 22012633)
Surely Google is clever enough these days to scan (and penalize) sites for using an overtly blackhat plugin?

My plugin is undetectable. The script file gives 404 error if you try to open it. The same does the plugin's folder.

Quote:

Originally Posted by freecartoonporn (Post 22012907)
nulled = bugged.

The version of 2012 which is floating on the net is very old, buggy and there is a backdoor added by hackers. No updates and no support too (you can't even read the official forum).

just a punk 09-25-2017 12:21 AM

Quote:

Originally Posted by Barry-xlovecam (Post 22012971)
So you want help stealing the other guy's shit ...

I won't be so radical here. There are many promo blogs that belong to affiliate programs and made to promote their paysites. The problem is that their RSS feeds include the most recent posts only. My plugin allows to scrape the old posts too.

freecartoonporn 09-25-2017 02:04 AM

Quote:

Originally Posted by CyberSEO (Post 22013031)
1) Install CyberSEO.
2) Add a link to WordrPress RSS feed.
3) Enable "Parse WordPress archives" option.

The target site will be fully scraped - doesn't matter how many posts it has in the RSS feed (1, 10 or none), the plugin will scrape just EVERYTHING - yes, including all old posts that you can't see in the RSS feed. It will even copy all the media to your host if you want it.



My plugin is undetectable. The script file gives 404 error if you try to open it. The same does the plugin's folder.



The version of 2012 which is floating on the net is very old, buggy and there is a backdoor added by hackers. No updates and no support too (you can't even read the official forum).

will it auto scrape new posts too ?

looks like i missed the boat, theres no lite version ?

just a punk 09-25-2017 02:06 AM

Quote:

Originally Posted by freecartoonporn (Post 22013105)
will it auto scrape new posts too ?

Yes, of course. It will do it automatically.

Quote:

Originally Posted by freecartoonporn (Post 22013105)
looks like i missed the boat, there no lite version?

There is no lite version anymore. Only the unlimited one.

freecartoonporn 09-25-2017 02:30 AM

Quote:

Originally Posted by CyberSEO (Post 22013107)
Yes, of course. It will do it automatically.



There is no lite version anymore. Only the unlimited one.

can i use delete id feeds to delete the specific id posts from our site ?

i mean like MGP/TGP2 guys do it. big tubes provide deleted id feed.

just a punk 09-25-2017 03:17 AM

Quote:

Originally Posted by freecartoonporn (Post 22013123)
can i use delete id feeds to delete the specific id posts from our site ?

i mean like MGP/TGP2 guys do it. big tubes provide deleted id feed.

You mean to use an RSS feed with id of posts that have to be deleted? With CyberSEO - you can't do it. I can write a special plugin for that.

freecartoonporn 09-25-2017 03:51 AM

Quote:

Originally Posted by CyberSEO (Post 22013147)
You mean to use an RSS feed with id of posts that have to be deleted? With CyberSEO - you can't do it. I can write a special plugin for that.

okie., can it rename file names ?
and save using new file names.

just a punk 09-25-2017 04:50 AM

Quote:

Originally Posted by freecartoonporn (Post 22013165)
okie., can it rename file names ?
and save using new file names.

Sure, why not? By default CyberSEO uses the original post title to rename the downloaded media. E.g.: Playboy Playmates

freecartoonporn 09-25-2017 08:11 AM

Quote:

Originally Posted by CyberSEO (Post 22013203)
Sure, why not? By default CyberSEO uses the original post title to rename the downloaded media. E.g.: Playboy Playmates

is there str replace ., so in case , post content has domain name, i would like to replace it with mine.

something like that ?


All times are GMT -7. The time now is 12:18 PM.

Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc