| User | Post |
|
11:45 am April 21, 2008
| Vani
Guest
| | | |
|
| |
|
|
Hi,
One of the top3 filetypes for our awstats report shows as “unknown”. Is there a way I can configure awstats to identify the file types? We use apache logs and the logFormat is set to 1.
Please help.
Thanks,
- Vani
|
|
|
12:47 pm April 21, 2008
| Jean-Luc
Admin
| | | |
|
| posts 254 |
|
|
Hi,
If you do not know what this “unknown” file type is, create an extra section that will collect all file types:
ExtraSectionName1=”File Types” ExtraSectionCodeFilter1=”200 304″ ExtraSectionCondition1=”URL,.*” ExtraSectionFirstColumnTitle1=”File Type” ExtraSectionFirstColumnValues1=”URL,\.([^\.]*)$” ExtraSectionFirstColumnFormat1=”%s” ExtraSectionStatTypes1=PHBL ExtraSectionAddSumRow1=1 MaxNbOfExtra1=10 MinHitExtra1=1
Note: replace the special double quote characters in the code above by regular double quote characters before you paste it into your config file (regular double quotes are not accepted by the forum software we use here, sorry for that).
|
|
|
11:33 am April 22, 2008
| Vani
Guest
| | | |
|
| |
|
|
Thank you Jean-Luc. I now am able to see some additional file types that were not reported on originally.
However, I still have this huge chunk of unknow file types as below, and the ones reported in the extra section doesn't seem to add up (even after extrapolating it to 1 month). Any ideas?
Thanks
- Vani
| File type |
Hits |
Percent |
Bandwidth |
Percent |
| gif |
Image |
57045511 |
42.9 % |
46.10 GB |
4.8 % |
| js |
JavaScript file |
37395090 |
28.1 % |
134.42 GB |
14.2 % |
| Unknown |
12199138 |
9.1 % |
667.11 GB |
70.6 % |
| png |
Image |
11807672 |
8.8 % |
8.90 GB |
0.9 % |
| css |
Cascading Style Sheet file |
9493115 |
7.1 % |
51.54 GB |
5.4 % |
| jpg |
Image |
4178513 |
3.1 % |
9.40 GB |
0.9 % |
|
|
|
11:51 am April 22, 2008
| Jean-Luc
Admin
| | | |
|
| posts 254 |
|
|
There are two possible reasons for that:
1. MaxNbOfExtra1 too small
You can try with a larger value:
2. The ExtraSectionFirstColumnValues1 that I suggested does not count the URL's that end with a query string. This should be more general:
ExtraSectionFirstColumnValues1=”URL,\.([^\.]*)$||URL,\.([^\.]*)\?”
Let me know if this helps.
|
|
|
12:21 pm April 22, 2008
| Vani
Guest
| | | |
|
| |
|
|
The ExtraSectionFirstColumnValues1 is set as follows
ExtraSectionFirstColumnValues1=”URL,\.([^\.]*)$”
Also, I noticed that ExtraTrackedRowsLimit is defaulted to 500.
Should I increase that?
Thanks,
Vani
|
|
|
3:07 pm April 22, 2008
| Jean-Luc
Admin
| | | |
|
| posts 254 |
|
|
In your case, the ExtraTrackedRowsLimit of 500 will be okay, as long as you don't have more than 500 different file extensions.
Did you try with the ExtraSectionFirstColumnValues1 that I suggested in my last post ?
|
|
|
4:10 pm April 22, 2008
| Vani
Guest
| | | |
|
| |
|
|
I guess the ExtraSectionFirstColumnValues1 value has gotten truncated in the post. Could you please post it again?
ExtraSectionFirstColumnValues1=”URL,\.([^\
Thanks
Vani
|
|
|
4:30 pm April 22, 2008
| Jean-Luc
Admin
| | | |
|
| posts 254 |
|
|
Yep, it is apparently truncated by Firefox, but the hidden part of the line appears if you select farther than the visible part.
Here is the part of the line from the = sign :
...=”URL,\.([^\.]*)$||URL,\.([^\.]*)\?”
|
|
|
12:34 pm April 23, 2008
| Vani
Guest
| | | |
|
| |
|
|
Thanks Jean-Luc. The above regex captured more file types in the extra section.
My final question - Is the extraSection reporting is in addition to the regular file types listing?
Or is the File Types reporting inclusive of the extra section statistics too?
Thanks
Vani
|
|
|
12:55 pm April 23, 2008
| Jean-Luc
Admin
| | | |
|
| posts 254 |
|
|
The regular file types listing only counts the hits from browsers; hits from robots are excluded. The extra section does not distinguish browsers from robots; it includes hits from browsers and from robots. Thus the numbers should be bigger in the extra section.
|
|