source: @ f907af8

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @f907af8   8 years vdv Fixing #9
(edit) @5acb101   10 years vdv Rewriting resources with no archived out links
(edit) @4473ad6   10 years vdv Support HTML @background
(edit) @94d3351   10 years vdv Map application/xhtml+xml to .html
(edit) @5e2b674   10 years vdv Store the craw log into the archive
(edit) @c25b18f   10 years vdv Support HTML embed/@src
(edit) @16ef797   10 years vdv Trying to guess content types
(edit) @bc581fa   10 years vdv Adapting relative links to match the structure of the browsable archive
(edit) @bf29805   10 years vdv Cleaning the algorithm to compute friendly local names.
(edit) @cfaf8ae   10 years vdv Adding XSLTUnit tests for the local-name function.
(edit) @a7c3525   10 years vdv Hmmm... HTML should be serialized as HTML, of course!
(edit) @c79bd8e   10 years vdv Forcing HTML content type for XHTML documents
(edit) @9bce34f   10 years vdv Rewriting links in HTML and CSS resources within WARC archives
(edit) @5b162a6   10 years vdv WARC mail extract loop
(edit) @466d447   10 years vdv Generating a resource index to facilitate further processing.
(edit) @675ed04   10 years vdv Download and convert the crawl log
(edit) @6f64c7f   10 years vdv Handling payload content types
(edit) @be1a361   10 years vdv Implementing yet another WARC parser (the heritrix one didn't work well …
(edit) @307b6d2   10 years vdv Adding whois records
(edit) @22c3028   10 years vdv First stab of WARC packaging.
(edit) @51c2058   10 years vdv Queue an action to package the Heritrix WARC.
(edit) @b346236   10 years vdv Adding a mechanism to delay actions in the queue.
(edit) @3bcb813   10 years vdv Unpause Heritrix job.
(edit) @f25a924   10 years vdv Modifying the way the Heritrix (spring) config file is generated since it …
(edit) @a3fa073   10 years vdv Update to follow changes to Orbeon Forms experimental features…
(edit) @a1dc635   10 years vdv Update to follow changes to Orbeon Forms experimental features…
(edit) @57daa70   10 years vdv Now building and launching Heritrix jobs…
(edit) @be2f974   10 years vdv Update to follow changes to Orbeon Forms experimental features…
(edit) @c4c4108   10 years vdv Starting to write pipeline actions that interact with an Heritrix server
(edit) @ad35672   10 years vdv Still work in progress, but the WARC archive now validates with …
(edit) @ba51ddf   10 years vdv Starting to support content lengths in warc archives
(edit) @9d99928   10 years vdv Removing the last action from the queue
(edit) @01a6690   10 years vdv First version that can produce a packaged archive.
(edit) @5ac9ea9   10 years vdv Packaging resources that have not been rewritten…
(edit) @0e7bdd1   10 years vdv Adding a basic squeleton to generate what should ultimately be a WARC …
(edit) @3d18e9d   10 years vdv Adding a mechanism to avoid to archive multiple times the same resource …
(edit) @cf97a98   10 years vdv Fist version supporting CSS rewriting
(edit) @750ccaa   10 years vdv Dummy (passthrough) implementation of the CSS support…
(edit) @16cc943   10 years vdv Refactoring before supporting CSS
(edit) @11027c0   10 years vdv Moving action pipelines in their own directory
(edit) @a0bd1a5   10 years vdv Adding a priority mechanism
(edit) @6b10b3e   10 years vdv Removing an xsl:message.
(edit) @fd2ca8f   10 years vdv Adding timestamps to the archive indexes
(edit) @c71d5b2   10 years vdv Starting to implement a version based on Orbeon's XPL or the archiver.
(edit) @0424eed   10 years vdv Adding credential to the logo
(edit) @bbe3c7f   10 years vdv Logos by Michel Duperrier
(edit) @eef5297   10 years vdv Quick fix for Wikipedia archives issues #6.
(edit) @6332cf6   10 years vdv Adding an empty archives directory in git.
(edit) @1581728   10 years vdv Support wget 1.11 (ticket #5)
(edit) @be2719e   10 years vdv Support wget 1.11 (ticket #5)
(edit) @7543bba   11 years vdv Include wp-admin/includes/plugin.php when needed.
(edit) @1033614   11 years vdv #4 detection of the encoding used in the archives.
(edit) @5a50ccf   11 years vdv #3: supporting other filenames than index.html (enhancement)
(edit) @136dad5   11 years vdv #3: supporting other filenames than index.html
(edit) @dad0250   11 years vdv #2: trying to implement a semaphore with wp_options…
(edit) @30fae5a   11 years vdv Suppression des dernières références à perwarc (ancienne implémentation en …
(edit) @3c55dce   11 years vdv Implementing the archive retrieval using wget.
(edit) @bb2bfba   11 years vdv Checking that we can execute wget (no need of passthru for us).
(edit) @b341683   11 years vdv Checking that we can execute wget.
(edit) @5b7eecf   11 years vdv Checking that we can create an archives sub directory.
(edit) @3fe3b36   11 years vdv Checking that the Broken Link Checker plugin is installed.
(edit) @53f84f8   11 years vdv Adding a mechanism to display admin notices.
(edit) @77850f0   11 years vdv Changing table name in select queries.
(edit) @68b2159   11 years vdv Adding a table creation and update method.
(add) @86d7e64   11 years vdv Initial import of the Wordpress plugin
Note: See TracRevisionLog for help on using the revision log.