owark/archiver/pipelines/actions
Eric van der Vlist 3bcb813cb7 Unpause Heritrix job. 2012-04-22 17:59:39 +02:00
..
mediatypes First version that can produce a packaged archive. 2012-04-13 19:08:04 +02:00
README.txt Moving action pipelines in their own directory 2012-04-13 10:53:25 +02:00
archive-resource.xpl First version that can produce a packaged archive. 2012-04-13 19:08:04 +02:00
archive-set.xpl Adding a basic squeleton to generate what should ultimately be a WARC archive 2012-04-13 18:01:53 +02:00
crawler-beans-template.cxml Unpause Heritrix job. 2012-04-22 17:59:39 +02:00
cxml.xslt Modifying the way the Heritrix (spring) config file is generated since it seems to be picky on whitespaces and indentation... 2012-04-22 16:27:16 +02:00
heritrix-archive-set.xpl Unpause Heritrix job. 2012-04-22 17:59:39 +02:00
package-archive.xpl Still work in progress, but the WARC archive now validates with warc-tools' warcvalid.py... 2012-04-15 00:12:29 +02:00
warc-lib.xsl Still work in progress, but the WARC archive now validates with warc-tools' warcvalid.py... 2012-04-15 00:12:29 +02:00

README.txt

Pipelines in this directory are called by the scheduler.

Their name is the name of the corresponding action.

Inputs:

    * data: the action

Outputs: None

These pipelines must take care of removing the action from the queue once they are done.