SunQuest
           Beginner Programming
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsOtherBeginner Programming

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
Get inside! Sample the range of functionality easily built with JMSL Library for Time Series Data Analysis, Heat Maps, Portfolio Optimization, Monte Carlo Simulation, Stock Price Charting and more. Download Now!
  #1  
Old February 23rd, 2002, 05:00 PM
nuk nuk is offline
Junior Member
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2001
Location: Wenatchee, WA
Posts: 19 nuk User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via Yahoo to nuk
Question ideas for WWW Mailing List archives

Hello,

Looking for ideas for making a mailing list available via a web page, and it needs to be searchable. Ideally it would need to be updated pretty frequently, perhaps as much as evey quarter hour?

This is a high-volume list, averaging btwn 2000-3000 messages per month, sometimes as high as 200-300 per day.

Eventually, I'd like to add some other related lists, but they are *much* lower volume, perhaps a few hundred to a thousand per month total per list.

I was wondering if something could be set up w/ a SQL database or something to keep things indexed?

I really have no prior experience w/ this sort of thing, but I'm interested in learning what I can.

TIA,

nuk

Reply With Quote
  #2  
Old February 23rd, 2002, 07:25 PM
M.Hirsch M.Hirsch is offline
Contributing User
Dev Shed God 1st Plane (5500 - 5999 posts)
 
Join Date: Oct 2000
Location: Back in the real world.
Posts: 5,969 M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level) 
Time spent in forums: 1 Month 1 Day 22 h 39 m 55 sec
Reputation Power: 184
depends on where the news are stored.
if in traditional "usenet" news, you could index them once per 15 minutes into a database (databases provide the fastest search i think)

if you have much cpu & harddisk power related to the number of people accessing the search, just do a full-text search in text-files, newsgroups or whatever.

programming a search script for 200-300 mails per day should not be a problem for a server of today... unless several 10s users use it at the same time.

can you access the way the mailing list is stored? then make an index while processing new posts. it will be as much up-to-date as possible...

Reply With Quote
  #3  
Old February 23rd, 2002, 08:44 PM
nuk nuk is offline
Junior Member
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2001
Location: Wenatchee, WA
Posts: 19 nuk User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via Yahoo to nuk
The messages are received as part of a mailing list, so I could potentially have them in about any format I really wanted. IIRC, there are 'raw' message archives available for past months/years. This is kind of an idea that I have running thru my head at this stage, whether or not it would be something doable on a hosted website w/ say, several hundred megs of space and perhaps PHP/MYSQL available. Right now, most of the existing / previous archives use perl and something like MHonarc, Glimpse, or HT://dig, and claim that to rebuild the index takes so long that they can only afford (system resource-wise) to re-index once a day, sometimes once a week. I have no idea what other load may have been on those particular machines.

Thanks,

nuk

Reply With Quote
  #4  
Old February 23rd, 2002, 08:49 PM
M.Hirsch M.Hirsch is offline
Contributing User
Dev Shed God 1st Plane (5500 - 5999 posts)
 
Join Date: Oct 2000
Location: Back in the real world.
Posts: 5,969 M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level) 
Time spent in forums: 1 Month 1 Day 22 h 39 m 55 sec
Reputation Power: 184
the "format" i am talking about is the way you access the files. do you get them via email, usenet, text-file, ... ?
you donīt need to rebuild the whole index on every new post. just add a line to it. maybe this requires you not to use ht://dig et al, but write your own database-driven script.

Reply With Quote
  #5  
Old February 23rd, 2002, 10:55 PM
nuk nuk is offline
Junior Member
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2001
Location: Wenatchee, WA
Posts: 19 nuk User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via Yahoo to nuk
Right now I get the messages via email, and they are downloaded from my ISP w/ fetchmail every 5 minutes, processed thru a series of procmail recipes, which sorts my messages according to mailing list, and then puts them in a directory; my mail client uses a MH format for handling messages (each message is a separate file). It wouldn't be hard to fork off a copy of each message into a standard mbox file using procmail, though.

Thanks,

nuk

Reply With Quote
  #6  
Old February 24th, 2002, 05:22 AM
M.Hirsch M.Hirsch is offline
Contributing User
Dev Shed God 1st Plane (5500 - 5999 posts)
 
Join Date: Oct 2000
Location: Back in the real world.
Posts: 5,969 M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level) 
Time spent in forums: 1 Month 1 Day 22 h 39 m 55 sec
Reputation Power: 184
procmail is fine. it supports piping the mail to a script that can upload it to eg. a mysqldatabase then. you can setup this in your procmailrc. i would go this way.

Reply With Quote
  #7  
Old February 24th, 2002, 10:32 AM
nuk nuk is offline
Junior Member
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2001
Location: Wenatchee, WA
Posts: 19 nuk User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via Yahoo to nuk
Do you know of anything 'pre-canned' that does this (the web/mysql part) or something similar that I could look at for learning purposes?

Thanks,

nuk

Reply With Quote
  #8  
Old February 24th, 2002, 01:11 PM
M.Hirsch M.Hirsch is offline
Contributing User
Dev Shed God 1st Plane (5500 - 5999 posts)
 
Join Date: Oct 2000
Location: Back in the real world.
Posts: 5,969 M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level)M.Hirsch User rank is First Lieutenant (10000 - 20000 Reputation Level) 
Time spent in forums: 1 Month 1 Day 22 h 39 m 55 sec
Reputation Power: 184
sorry, i donīt know many pre-made scripts. but i can give you some code to play with:

say php is installed on your system, it is available as executable and not only apache module.
procmail pipes the email to your script like this: |php /root/scripts/mail_in.php?subject=subject&body=body

your database "news" has one table "news":
id: int 11 unsigned auto-increment;
subject: blob;
body: blob;

then your script should do this:
PHP Code:
 $db=mysql_connect(); // add parameters according to manual
mysql_select_db("news",$db);
$subject=addslashes($subject);
$body=addslashes($body); // these 2 for security reasons
$result=mysql_query("insert into news values(null,'$subject','$body')"); 

to insert the mail into a database.

your search.php can look like this:
PHP Code:
if (!isset($search)) {
echo 
'<form><input type="text" name="search"></form>';
} else {
  
$search=addslashes($search); // see above
  
$db=mysql_connect(); // see above
  
mysql_select_db(...) // see above
  
$result=mysql_query("select * from news where subject LIKE '%$search%' OR body LIKE '%$search%'") or die("sql error.");
  while (
$row=mysql_fetch_array($result)) {
      echo 
"News id:".$row[id]." Subject:".$row[subject]." Body: ".$row[body]."<br>";
  }
}



you probably need to modify a lot though...

Last edited by M.Hirsch : February 24th, 2002 at 01:13 PM.

Reply With Quote
  #9  
Old February 24th, 2002, 01:20 PM
nuk nuk is offline
Junior Member
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2001
Location: Wenatchee, WA
Posts: 19 nuk User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
Send a message via Yahoo to nuk
Thanks for the ideas! It'll probably be a little while before I get up to speed enough to tackle coding the whole thing myself, but I guess it'll give me a direction to head in.

Thanks,

nuk

Reply With Quote
Reply

Viewing: Dev Shed ForumsOtherBeginner Programming > ideas for WWW Mailing List archives


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 5 hosted by Hostway