Re: hmrc

---------

From: Bill Moseley (moseley@hank.org)
Date: Fri Dec 28 2001 - 22:30:01 CST


At 07:22 PM 12/28/01 -0500, Jim Lill wrote:
>
>I use procmail to split up incoming stuff from a list into 10 sub-cats and
>then let HM makes 10 archives. I do this by subject. I need 10X hmrc files
>to do this although the bulk of the hmrc files is identical. Is there a
>function I'm missing that would allow include of a basic hmrc into the
>by-subject specific hmrc.

Hi Jim,

This is not really related but it solves a similar problem for me:

Just in the last few hours I wrote a perl script to automate hypermail to
some degree. I've got a bunch of lists that I need to archive. The lists
all have mbox files in gzip and text format (ezmlm generated lists) with
each collection of mbox files in their own directory. I'm also adding a
search feature to the archive using swish-e.

The way it works is you setup one config file where [text] marks a section
that defines a list. e.g.:

[Title of This List One]

    list_email = somelist@foo.org
    archive_dir = otherdir
    strip_subject = [Somelist Discussion]
    mbox_dir = /path/to/mbox/files
    hypermail_opts = gmtime=On, showhtml=1
    header_order = List-Post To Cc

[Title of This List Two]
    ...

Running the program like:

  ./mail_archive.pl --create

scans the config file and for each section (list archive) that doesn't
already exist, it creates a directory for the archive, and if mbox_dir
exists, reads in any mbox files and runs hypermail for each one building
the hypermail archive from scratch. Then it indexes the archive for
searching with swish-e.

In .procmailrc, I use:

    MAILDIR=$ARCHIVE_DIR
    : 0w
    | ./mail_archive.pl --mode=update

Then the mail headers are scanned to route it to the correct archive. It
uses the header_order config setting and looks for the address set by
list_email.

Then there's also a crontab entry:

0,10,20,30,40,50 * * * * ./mail_archive.pl --mode=index --chdir $HOME/archives

which reindexes (for swish-e) any archives that have been added to in the
last ten minutes.

For your setup I suppose you could set

   header_order = Subject
   match_string = foo

to match automatically for you. I suppose it would be helpful to make
match_string a regular expression.

It's probably overkill for your setup unless you want to provide a search
engine for the archive, as well.

I'm currently testing this at:

    http://search.apache.org/archives/

But that's only a temporary location.

If interested let me know by private email -- I can always use help testing...

I can also use pointers on how to best configure higher volume lists, and
how best way to purge old messages.

-- 
Bill Moseley
mailto:moseley@hank.org

---------

This archive was generated by hypermail 2.1.5.