GoFuckYourself.com - Adult Webmaster Forum

GoFuckYourself.com - Adult Webmaster Forum (https://gfy.com/index.php)
-   Fucking Around & Business Discussion (https://gfy.com/forumdisplay.php?f=26)
-   -   Getting all urls of a website / domain (https://gfy.com/showthread.php?t=1165954)

AdultSites 05-02-2015 02:24 PM

Getting all urls of a website / domain
 
Is it possible to know about a url, if it is not linked to from any other place on the Internet, including search engine sites, or not? I am not talking about owners of the sites obviously, but any person, trying to get max number of urls of any site.

Is mapping the site by internal linking the only way to do it? I know it is also possible to get data from paid tools, scrape search engine results, and other things like that. I also know that, in general, number of pages that are not linked to from anywhere on the Internet, is low / very low.

I've been wondering if it is possible with some kind of server / Linux techniques, but it would probably require breaking in into the system of a site (hacking, if so, I am not talking about that).

EddyTheDog 05-02-2015 02:31 PM

I would have said mapping the site would be the easiest way to go - There are sites that will do it - It's done a lot by people who need a sitemap for the search engines...

xXXtesy10 05-02-2015 02:51 PM


SilentKnight 05-02-2015 04:57 PM

Quote:

Originally Posted by AdultSites (Post 20464948)
Is it possible to know about a url, if it is not linked to from any other place on the Internet, including search engine sites, or not? I am not talking about owners of the sites obviously, but any person, trying to get max number of urls of any site.

Is mapping the site by internal linking the only way to do it? I know it is also possible to get data from paid tools, scrape search engine results, and other things like that. I also know that, in general, number of pages that are not linked to from anywhere on the Internet, is low / very low.

I've been wondering if it is possible with some kind of server / Linux techniques, but it would probably require breaking in into the system of a site (hacking, if so, I am not talking about that).

What's your purpose for this?


All times are GMT -7. The time now is 12:35 PM.

Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc