30 4 * * * /home/it2000/createSitemap-daily.sh >/home/it2000/logs/daily.log 2>&1
30 3 * * 0 /home/it2000/createSitemap-weekly.sh >/home/it2000/logs/weekly.log 2>&1
30 2 1 * * /home/it2000/createSitemap-monthly.sh >/home/it2000/logs/monthly.log 2>&1
For all spaces and groups:
daily.sh - threads 0-29 … change frequency: daily
weekly.sh - threads 30-59, docs 0-29 … change frequency: weekly
monthly.sh - threads 60-, docs 30- … change frequency: never
~/src … sources
~/conf … configuration file
~/libs … log4j libs
~/logs … log files
TODO: Add http://community.igniterealtime.org/sitemap/sitemap.xml to robots.txt so all spiders will use it.
Known issue: There will be always a gap (some missing threads) between the monthly, the weekly and the daily sitemap file when new threads are created. This should be no problem, as the files which are moved in the gap were likely indexed before.