|
|
|||||||||
|
|||||||||
| |||||||||
|
|
|
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
|
|
Stop making mediocre tutorials.The best tutorials are video! Camtasia Studio makes it easy to create engaging, buzz-building screen videos at any size, in any popular format. Download the free trial!
|
|
#1
|
||||
|
||||
|
Spidered data: sellable?
I have been writing and running this spider app of mine for a few months and I have fully indexed 50000 web pages, files, and images with it. It has found close to a million links so far for future indexing as well.
It has the full body text of indexed web each page, stripped of all html, thumbnails of most of the images and id3 tags from the mp3s etc etc etc in a 'description' categoy. If the document had a title, that's in the 'title' row, else it's the filename. I have attempted to mark spam, commercial and porn sites as well. The data is good for searching in but it bogs my system down too much to run an actual public search engine on my site. Is there any companies that buy this kind of stuff? I imagine email lists go well but I won't be spidering that.
__________________
Stuff: Regular expression tutorial|JavaScript DOM stuff|What's wrong with your JavaScript?|JSON is neato My projects: African music videos|Free bookmarklets|Obnoxious Facebook app|My bedroom |
|
#2
|
|||
|
|||
|
Just a gentle reminder that you could be infringing on copyright laws on some of the sites you spidered, you should definitely check on this area first before attempting to sell the stuff.
|
|
#3
|
||||
|
||||
|
I think Google beat you to it.
__________________
Raid1 in XP Pro My open source projects: ------------------------ Blobber - Add images as blobs to SQL Server ------------------------ |
![]() |
| Viewing: Dev Shed Forums > Web Site Management > Business Help > Spidered data: sellable? |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|
|
|