Changes to wwwstat ================== See the file README for wwwstat usage, licensing, and installation info. If you have any suggestions, bug reports, fixes, or enhancements, send them to the author Roy Fielding at . Known problems Problems? What problems? Things I'd like to do (or would welcome someone doing for me) Easier installation procedure and default user config files. A nice man file containing what is in the README/usage. Version 1.0 patchlevel 1 April 24, 1994 Minor change: new log format uses leading zero in day number field, so that is added to oldlog2new and blanked by space in wwwstat. Version 1.0 April 23, 1994 Now supports the NCSA httpd_1.2 "common" log format. As a result, all attempts to figure out file size are gone and there is no longer any need for all those fstat tests. Code for srm parsing of aliases and scripts has been removed. Basically, the entire log parsing section was rewritten and then placed in a subroutine to allow for multiple logfiles. Bunches of unnecessary backslashes removed from print statements. Time of last update now includes GMT offset instead of full GMT. Tries to estimate size of headers and error messages to account for bytes that are not included in the log entry byte count. Allows perl regular expressions (where possible) in all searches. Allows multiple logfiles to be analyzed in sequence, with any compressed logfiles automatically recognized by their file extension. Removed -f and -z options because they are no longer needed. Added -c option for searching based on server response code. Added the uppercase options -A, -C, -D, -T, and -N which perform the negation of the corresponding lowercase letters, i.e. they force wwwstat to not include any log entries with the given pattern in the address, response code, date, time, or archive name. Version 0.4 (now called oldwwwstat) April 19, 1994 Removed escapes to allow regular expressions in -d and -t searches. Fixed minor bug of outputing instead of . Made use of $startTag and $endTag explicit for report output. Added option to append subdomain info on end of local hosts. Added support for IdentityCheck (rfc931) logfile format. Added output of Totals by Remote Identifier when Do_Ident is requested. Added -r option to select Do_Ident when IdentityCheck is enabled. NOTE: For security reasons, you should not publish to the web any report that lists the Remote Identifiers. This option is intended for server maintenance only. Version 0.3 March 9, 1994 Added links for last server summary, table-of-contents, and a reference to the standard distribution site (all because similar things looked good in Kevin Hughes' getstats output). Automatically determines URL of previous month's summary. Now allows extra spaces on Alias directive lines in srm.conf. Now recognizes Redirect directives and estimates size of message. No longer counts automatically redirected directory names twice -- it estimates size of redirect message and counts that instead. Now uses normal printf's instead of perl forms. Reversed order of printed fields to allow for long names and the ability to read its own output (see the -i option below). Updated the country-codes file to reflect latest standards/spelling. Added the following options (phew!): Display Options: -h Help -- just display the usage message and quit. -e Display all invalid log entries on STDERR; -- this is great for finding trashed log entries for cleaning. -l Do display full IP address of clients in my domain. -L Don't display full IP address of clients in my domain. -o Do display full IP address of clients from other domains. -O Don't display full IP address of clients from other domains. -u Do display IP address from unresolved domain names. -U Don't display IP address from unresolved domain names. -v Verbose display (to STDERR) of each log entry processed; -- useful, but not recommended for long logs. -x Display all requests of nonexistant files to STDERR; -- this is great for finding misadvertized or moved URLs. Input Options: -f Read from the following access_log file instead of the default; -- allows you to read archived (or test) logfiles. -z Use zcat to uncompress the log file while reading [requires -f]; -- allows you to read compressed archive logfiles; use "gzip -9" to get factor of 10 reduction in file sizes. -s Get the server directives from the following srm.conf file; -- allows you to archive the configuration along with the log. -i Include the following file (assumed to be a prior wwwstat output); -- incredibly great, allows you to keep partial summary periods in wwwstat output files and purge the logfile. Inventive admins can find many uses for this, such as being used by scripts to provide fast, up-to-the-minute stats. Search Options (include in summary only those log entries): -a Containing the following "substring" in the IP address. -d Containing the following "substring" in the date. -t Containing the following "substring" in the time. -n Containing the following "substring" in the archive (URL) name. -- allows you to restrict logfile summaries to an area of particular interest; great for custom author summaries; Search strings are matched as substrings, prefix (if string starts with a caret "^"), or suffix (if string ends with "$"). Note that strings containing $ must be enclosed in single quotes for most shell command lines. Version 0.2 January 21, 1994 Added support for the /~username form of files. Added general support for Alias and ScriptAlias configurations. Now reads the server config file to get site configuration. Sped up the process by caching file sizes (fewer file stats). Added options to display full IP addresses in subdomain listing. Expanded some form field sizes. Now sorts archive section by name. Version 0.1 January 14, 1994 Added support for HTML output. Added reversed subdomain statistics. Added the logic for grouping files in archive sections. Rewrote the whole damn thing. Version 0.0 Originally from fwgstat 0.035 (jem@sunsite.unc.edu) with all the extra options stripped out and many bugs fixed. In turn, fwgstat was heavily based on xferstats, which is packaged with the Wuarchive FTP daemon. Fwgstat is for multi-server stats.