README.mdwn


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18

HTTRUTA Feed Crawler Project
============================

Download all links from a feed using httrack. This is the engine behind the
"Cache" feature used by https://links.fluxo.info Semantic Scuttle instance.

Usage
-----

Place this script somewhere and setup a cronjob like this:

`*/5 * * * * /var/sites/cache/httruta/httracker &> /dev/null`

Features
--------

- Integration with http://wkhtmltopdf.org
- Integration with [webkit2png](http://www.paulhammond.org/webkit2png/).