Hi all,
I'm serving files for http://tvbrowser.org and want to analyze traffic to the TVB dedicated download site with AWStats (6.7) as good as possible. The (many, about 15,000) downloadable files are mostly *.gz updated once a day via 'rsync –delete ….' (they are changing every day, old ones get deleted, new ones are coming in, existing ones for the next 4 weeks of TV channels are updated often).
Access to them is via HTTP GET requests from the TVB Java application, which identifies itself e.g. as useragent
- TV-Browser 2.2.5 Java/1.4.2_12
- TV-Browser 2.6 Java/1.6.0
- TV-Browser 2.6.3 Java/1.6.0_07
- TV-Browser 2.7 Java/1.6.0_03
depending on the TVB and Java version installed.
Main Problem is: I had to remove the 'java' catchall in "lib/robots.pm", else the statistics were totally wrong: TV-Browser was not considered as a normal browser but instead summed up under 'robots'. But for my site (and some few others) TV-Browser accesses are the main/only important browsers.
Simply adding 'tv\\-browser' in "lib/browsers.pm" (at all neccessary places) did not do the job alone, the 'java' catchall seems to have precedence. I also took a look into "awstats.pl", confess, that I didn't grasp most of it, but it seems to me, that the useragent string is checked againsts the robots.pm before it is checked against the browsers.pm?
So, am I left alone with this workaround? This would be a bad compromise, because AWStats runs on a virtual host serving several other domains … at least for the other domains, the 'java' catchall should be in place!
Hints, anybody?
Second Problem: I am interested to get statistics for the TVB version used (like e.g. for Msie, Firefox or SVN). But the version handling seems to be buried deep inside "awstats.pl" and not configurable at all?
Thanks for your time, the great AWStats software and all helpers here on this faszinating forum!
Best regards,
Georg