SunQuest
           Linux Help
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsOperating SystemsLinux Help

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
Generate data entry and reporting .NET Web apps in minutes, straight from your database. Read our FREE whitepaper “Build Web 2.0 Applications Without Hand-Coding” Download now!
  #1  
Old July 30th, 2002, 01:17 AM
pentium5 pentium5 is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Dec 2001
Location: entlegen
Posts: 442 pentium5 User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 7
what is searching engine?

I know different searching method while I was at the university.

is a searching engine store in a huge data base and perform mysql search and move the result to apache server and diplay it with the help of php.


HOW come it can walk in ( crawl ) different sites? what make it jump from one site to the other?


HOW big is goggle in the information that she snatched from the internet?


How many searching engine globally?

Is searching engine equivalent to data-mining?
__________________
regards,

wish to use AT 89c51 single chip computer to do remote sensing and send the data back from woods to office via nokie 3330 and internet. HOW?
first I must revise C program thro' example, what's next?

Last edited by pentium5 : July 30th, 2002 at 01:21 AM.

Reply With Quote
  #2  
Old July 30th, 2002, 02:58 PM
telex4's Avatar
telex4 telex4 is offline
Wacky hack
Dev Shed Novice (500 - 999 posts)
 
Join Date: Apr 2001
Location: London, England
Posts: 512 telex4 User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 h 25 m 29 sec
Reputation Power: 8
I'm not sure what this has to do with GNU/Linux, but...

Major search engines like Google and altavista don't use the "standard" PHP+MySQL solution, they usually use programs written in C running on a large cluster of powerful machines, using various database types as the backend. Smaller search engines, like site-specific ones, probably use something like PHP+MySQL or Perl+MySQL or ASP+Access, etc.

They work by fetching pages they already know exist, finding any links to other pages, and adding those new links to their list of pages to fetch. In this way they "crawl" across the Internet. They then keep limited information on each page (the title, url, some keywords, perhaps a little text from the site). Some engines like Google go so far as to keep a copy of the entire page (cache). You can also often submit URLs to search engines, which will be added to their list of pages to fetch. Some "search engines" actually work more like directories, only indexing pages people submit (and often checking each URL individually) and never re-checking them, or only doing os veyr occasionally. These are really web directories.

Google is immense; I've no idea how much information it holds, but I should think it would amount to many more bytes of hard drive space than I could afford in a lifetime Look on their site, they may say there how large they are. There are hundreds of search engines around the world, and they vary in their efficiency, the size of their databases, the types of sites they index, the way you can search them, and the kinds of results you get back.

Reply With Quote
  #3  
Old July 31st, 2002, 08:58 PM
StealthElephant's Avatar
StealthElephant StealthElephant is offline
Shes dancing (obviously)
Dev Shed Novice (500 - 999 posts)
 
Join Date: Jul 2002
Location: the far side
Posts: 526 StealthElephant User rank is Corporal (100 - 500 Reputation Level)StealthElephant User rank is Corporal (100 - 500 Reputation Level)StealthElephant User rank is Corporal (100 - 500 Reputation Level)StealthElephant User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 2 h 14 m 22 sec
Reputation Power: 8
IBM

google doesnt use a standard relational database, they use the older database "engine" that was forgoten about by the world but IBM kept making it better and better as time goes on, i forget the name but its much much faster for extremely large amount of data!.
a comparison of the 2 types of engines are similiar to comparing linux and windows, 1 is easy to setup and functional for home use, the other is harder but once its up its much better as everyone here knows
__________________

microsofts butterfly is their way off telling u their systems have a **** load of buggs
Advocating Linux Guide
Lesbian Linux
Great & Practical Computer Books

like the links?

Last edited by StealthElephant : July 31st, 2002 at 09:13 PM.

Reply With Quote
  #4  
Old August 7th, 2002, 11:16 PM
Ctb's Avatar
Ctb Ctb is offline
An Ominous Coward
Dev Shed Specialist (4000 - 4499 posts)
 
Join Date: Jan 2002
Posts: 4,425 Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level)Ctb User rank is Colonel (50000 - 60000 Reputation Level) 
Time spent in forums: 3 Weeks 10 h
Reputation Power: 0
I used to have a giant paper about the design and structure of google, how it operates on the web, and how it internally processes all that data. If I can find it, I'll let you know what the database system is called. I don't know how big google is, but they keep a running total of indexed pages at the bottom of the main page, and last I new it was over 2.07 billion.

Reply With Quote
Reply

Viewing: Dev Shed ForumsOperating SystemsLinux Help > what is searching engine?


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 1 hosted by Hostway