Software Design
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me

The Shed is going Social! Join us on FaceBook and Twitter and chime in on the conversation.

Go Back   Dev Shed ForumsProgramming Languages - MoreSoftware Design

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old February 9th, 2010, 02:18 PM
davidmellor davidmellor is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jun 2009
Posts: 11 davidmellor User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 4 h 5 m 41 sec
Reputation Power: 0
Web site caching policy and outsourcing

Hi group,
I want to ask about web site caching policies.

I have built a website in C from scratch, and have got to the point where I need a fuzzy search facility. I want to put a job for this on Rent-a-coder, Find-a-guru or script lance but it occurs to me I really need to sort out a policy on caching first. There would be little point in trying to do fuzzy searching through built in MySQL facilities for example, if that’s possible, only to end up with the whole the database contents cached in main memory where it would be far faster to search it there. (my data will be fairly small for quite some time).

I believe that caching web pages, semi or fully formed, rather than caching the raw relational data is more usual.
Also I believe a simple expiry policy is often used rather than directly tracking dependencies between raw data and web pages but that doesn’t sound like an acceptable solution due to inconsistencies in the generated web pages.

Any one want a discussion about caching policies or the pros/cons of the above three contractor websites?

Very best wishes,
David

Reply With Quote
  #2  
Old February 11th, 2010, 06:55 AM
alienkinetics alienkinetics is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Feb 2010
Location: Australia
Posts: 35 alienkinetics User rank is Corporal (100 - 500 Reputation Level)alienkinetics User rank is Corporal (100 - 500 Reputation Level)alienkinetics User rank is Corporal (100 - 500 Reputation Level)alienkinetics User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 9 h 59 m 52 sec
Reputation Power: 5
Sounds like you need a search engine.
Quote:
I believe that caching web pages, semi or fully formed, rather than caching the raw relational data is more usual.

Yep if there is no direct relationship (definable/consistant) between a database and the webpages, then you need to index the fully formed web pages. But, then you have WordPress which does search the database and provides web links, because there is a direct relationship between the DB and the page. Hence, Wordpress's search is live and un-cached.
Quote:
...doesn’t sound like an acceptable solution due to inconsistencies in the generated web pages

Yep, how many times have you searched Google and received a result only to goto a page in question and it hasn't got what you want. This happends a lot on blog/news sites where old content is pushed onto another page, but Google still lists the page originally indexed.

If your "raw relational data" can be searched and URL's can be derived from the result, then I'd do that.

If not, then a spider is required to build a cached index of your website content. Ive coded a number of spiders in the past, but you can use Google's results to get a list of URL's on your site: ie: Google "site:www.microsoft.com"

The reason why a simple expiry policy is used is because by the time you head check (HTTP HEAD) a page and try to compare that with some cached values, you might as well get the entire HTML and re-index it.

So, the question is, can you provide a live spider search? Um, depends on the speed of your server and the amount of traffic you are planning to have I guess.

Those are just my views. Other people would have diferent views based on different experiences.

Reply With Quote
  #3  
Old February 11th, 2010, 10:01 AM
davidmellor davidmellor is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jun 2009
Posts: 11 davidmellor User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 4 h 5 m 41 sec
Reputation Power: 0
Many thanks for your thoughts.

The search I need is not to search the text of existing web pages but simply to search the product names / descriptions and artists names from my database.

I guess there are a few commercial / open source apps available that might impliment this directly on my database if I decided not to use caching.
Any suggestions here would be much appreciated.

I could build a cached page for each category / sub-category page, each artist page and each product page but then need to build cache coherancy logic so that when the relational data changes the corrisponding cached page(s) are invalidated or re-built.

Will MySQL's string matching function do an acceptable job of searching the database if I decided not to use caching?

If I do use caching and assume that at some point, not all the data will fit in main memory then each entry in the database would have an indication of which web cached page(s) must be invalidated when that data item (product or artist name) is altered or removed.

Any thoughts would be much appreciated.
Very best wishes,
David

Reply With Quote
  #4  
Old February 11th, 2010, 03:25 PM
jwdonahue's Avatar
jwdonahue jwdonahue is offline
Bellevue WA, USA
Dev Shed Loyal (3000 - 3499 posts)
 
Join Date: May 2004
Location: Bellevue Washington, USA
Posts: 3,398 jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level)jwdonahue User rank is Lieutenant General (80000 - 90000 Reputation Level) 
Time spent in forums: 3 Weeks 5 Days 6 h 48 m 17 sec
Reputation Power: 886
There isn't a C or derivative language question here so I have requested this thread be moved to the Software Design forum.

Quote:
Originally Posted by davidmellor
I want to put a job for this on Rent-a-coder, Find-a-guru or script lance but it occurs to me I really need to sort out a policy on caching first.


It's always easier for the bidders if they know what it is they are a bidding on and it will increase your odds of finding a good match for the task.
__________________
My worst nightmare was a pointless infinite loop.
Work in progress; don't poke the curmudgeon!
http://www.odonahue.com/

Reply With Quote
  #5  
Old February 12th, 2010, 06:23 AM
davidmellor davidmellor is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jun 2009
Posts: 11 davidmellor User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 4 h 5 m 41 sec
Reputation Power: 0
Sorry. I had actually not intended to post the message in this section. It may be better to delete it so I can repost a revised version elsewhere.... if possible.
David

Reply With Quote
  #6  
Old February 12th, 2010, 06:47 AM
alienkinetics alienkinetics is offline
Contributing User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Feb 2010
Location: Australia
Posts: 35 alienkinetics User rank is Corporal (100 - 500 Reputation Level)alienkinetics User rank is Corporal (100 - 500 Reputation Level)alienkinetics User rank is Corporal (100 - 500 Reputation Level)alienkinetics User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 9 h 59 m 52 sec
Reputation Power: 5
O, its ok david. jwdonahue runs around telling everyone off.

Basiclly what jwdonahue is saying is, if you are planning on paying someone, then you need to provide a job spec or pay someone to help you write a job spec.

Otherwise, you are asking people to contribute help to a private (possable commercial) code base which you arn't willing to post, in which case you are just here to save money.

Hey noone can blame you for wanting to save money, but yes, if you cant do it yourself, then coff up money like everyone else has too

Reply With Quote
  #7  
Old March 30th, 2010, 07:13 AM
RACNicole RACNicole is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Mar 2010
Posts: 2 RACNicole User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 2 m 55 sec
Reputation Power: 0
Hi, this is Nicole from Rent a Coder.

As others have suggested, providing a thorough spec is highly recommended. Fortunately, Rentacoder makes creating a proper spec rather easy with its Bid Request Wizard -- an easy, online question and answer type program.

If you have any questions, please let me know. You can also call in to talk to a facilitator 7 days a week, or email us.

Nicole

Reply With Quote
Reply

Viewing: Dev Shed ForumsProgramming Languages - MoreSoftware Design > Web site caching policy and outsourcing

Developer Shed Advertisers and Affiliates



Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump

Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 


Powered by: vBulletin Version 3.0.5
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.

© 2003-2013 by Developer Shed. All rights reserved. DS Cluster - Follow our Sitemap