Dev Shed Lounge
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsOtherDev Shed Lounge

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old September 12th, 2002, 06:00 PM
abrandt abrandt is offline
Junior Member
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2002
Posts: 1 abrandt User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Lightbulb Spider - Web Extraction Tools

Hello!

I am looking for the following spider or web extraction capabilities:

Utilizing KEYWORDS, I look to accomplish the following without programmer requirements:

* TARGET technological segments of the Internet
* Query search engines using KEYWORDS that match company website keywords
* PARSE targeted Web pages
* EXTRACT from targeted websites:
corporate officer names, owners, email addresses, phone numbers, fax numbers andbusiness profiles


Here an example of raw corporate data in a tab delimited format which I would like to duplicate with a web extraction tool:

URL COOLBOARD.COM "Coolboard offers free, customized message boards that match the look-and-feel of individual web sites." COOLBOARD.COM San Francisco CA 415 298-2722 Co-Founder and CEO : Josh Duhl / Co-Founder and President/CTO : David Park / Co-Founder and Vice-President of Marketing/Operations : Adrian Fung / Vice President of Sales : Christopher Moore / Director of Business Development : Michael Simon / Director of Product Management : Doug Stotland / Director of Product Development : Chris Butler / Director of Quality Assurance : Deborah Zhirnova " Provides a free, Web-based message board service for small businesses. The company hosts and maintains a customized message board that aggregates content and traffic from other Web sites focused on similar themes. The service allows sites to share messages and traffic with a larger community of Web sites while maintaining control of their content and design. Future services will include site directories, affiliate programs, chat rooms, and auction capabilities. Customers will be small businesses. Competes with Be Seen, Bravenet, and Delphi.com." " Not profitable. Expects to become profitable in 2002. Raised a first round of $1.6 million in July 1999 from Venture Frogs, Draper Richards LP, angel investors, and private investors. CEO Joshua Duhl is former director of product management at Firefly Network. Prior to Firefly Network, Mr. Duhl was a consultant with McKinsey & Company. Mr. Duhl also held a summer associate position at First Analysis Venture Capital. President David Park is former database consultant, Web site designer, and management consultant at McKinsey & Company. Vice President of Operations and Finance Adrian Fung is former consultant at Booz-Allen & Hamilton."

Here's a Hoovers.com example page: (Yahoo! Inc.) URL

Could someone perhaps refer me to a capable software application that can accomplish the above with user-friendly and quick learning curve features?

THANK YOU - in advance for any help in this matter!

Alan Brandt , SMP
Strategic IT Solutions Corp.
IT Cost Management
50%-70% Savings

Reply With Quote
Reply

Viewing: Dev Shed ForumsOtherDev Shed Lounge > Spider - Web Extraction Tools


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 5 hosted by Hostway