![]() |
![]() |
![]() |
||||
Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums. You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today! If you have any problems with the registration process or your account login, please contact us. |
![]() ![]() |
|
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed. |
|
Thread Tools |
![]() |
#1 |
Confirmed User
Industry Role:
Join Date: Jun 2003
Posts: 267
|
![]() Hello,
I am looking for a search enigne script, like goole .. with the feature to check linksback and modifying the quality score for keyword result .. e.g. if site has 10 linkbacks rise the position of search result ... Any idea ? Thankyou Jimba |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#2 |
Raise Your Weapon
Industry Role:
Join Date: Jun 2003
Location: Outback Australia
Posts: 15,605
|
indexing how many sites ?
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#3 |
Confirmed User
Industry Role:
Join Date: Jun 2003
Posts: 267
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#4 |
Registered User
Industry Role:
Join Date: Dec 2011
Posts: 22
|
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#5 |
Raise Your Weapon
Industry Role:
Join Date: Jun 2003
Location: Outback Australia
Posts: 15,605
|
1000 Sites: Nutch + Hadoop
500k sites: get a small server farm then Nutch + Hadoop |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#6 |
Raise Your Weapon
Industry Role:
Join Date: Jun 2003
Location: Outback Australia
Posts: 15,605
|
It's worth noting that Nutch is not trivial to deploy, so if you modified your requirements there are plenty of far simpler script solutions to simple search.
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#7 | |
Confirmed User
Industry Role:
Join Date: Jun 2003
Posts: 267
|
Quote:
explain better please ... I've no idea of what is that, and what is used for ... ![]() ![]() Thankyou |
|
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#8 |
Raise Your Weapon
Industry Role:
Join Date: Jun 2003
Location: Outback Australia
Posts: 15,605
|
Unless you use Sphider http://www.sphider.eu or a similar script then implementing a search engine is non trivial.
What you want to do can't be achieved by Sphider because the indexing, weighting and ranking of sites based upon keywords and inbound links is a resource intensive task. Sphider does not support ranking based on inbound links, however Nutch will as will some other open source search platforms. Implementing such a thing is non trivial. You could develop your own solution and run it's crawler on Amazon EC2 instances to save having to invest in hardware, however you still need somewhere to house your database. |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#9 |
Raise Your Weapon
Industry Role:
Join Date: Jun 2003
Location: Outback Australia
Posts: 15,605
|
Here's something I posted on the subject two years ago in response to a similar question.
https://gfy.com/showpost.php?p=18075524&postcount=8 It's a bit out of date now, I'd lean toward Nutch these days. |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#10 |
So Fucking Banned
Industry Role:
Join Date: Dec 2012
Posts: 384
|
![]() Adult king you fuckin' rock!!!
![]() |
![]() |
![]() ![]() ![]() ![]() ![]() |
![]() |
#11 |
Registered User
Industry Role:
Join Date: Oct 2012
Location: Germany
Posts: 28
|
Do you know PHP and/or a language you can dev with on a server (Perl, C++)? It's pretty simple to get a crawler happening if you're apt with programming. Once you've got an "indexer" happening, the weighting and all that jazz is the fun part
![]() The search engine isn't really the hard part to program, more the crawler if you ask me. Depends on what you're crawling and how you're displaying it too. |
![]() |
![]() ![]() ![]() ![]() ![]() |