How often should websites be cached

In this way you can find out how often your Internet pages have been visited

You can find detailed statistics and log files about how often and by whom your website has been visited in your STRATO customer login. The service is available to you from the PowerWeb Starter hosting package (older packages from BasicWeb XL).


Useful background knowledge:

 

A web server logs every single access to an element (e.g. an HTML file or an image) within your web presence. Failed or denied access (e.g. due to password protection) is also noted. The server writes in a so-called "log file" from which domain the access is made, at what time and which element (e.g. which page or which image) is accessed. Our server also logs which browser was used to access the item.

 

For data protection reasons, the host name or IP address of the client calling your website is anonymized in the log files. Only the entries for the client's host or, if this cannot be determined, the client's IP address are anonymized in the log files. The format of all other entries does not change.

 

The first 9 bits of the IP address of the entry are converted into a hash value. The IP address 123.123.123.123 could become the "anonymous" IP address 123.123.122.243. An anonymized IP address is valid for a maximum of 24 hours. Then the same starting IP address would become another "anonymous" IP address.

Example of host name anonymization:

If there is a host name in the log file, the domain portion is retained. Everything else is replaced with the string "anon-123-123-122-243.domain.tld". The character string "anon" stands for "anonymous". The groups of digits represent the anonymized IP address.

 

Example:

 

out: ascend-tk-p66.rhrz.uni-berlin.de - - [15 / Jan / 2016: 12: 28: 23 +0100] "GET /Urlaub-mit-mir.gif HTTP / 1.1" 200 8032 "http://www.wunschname-1.de/urlaub/ibiza. html "" Mozilla / 5.0 .... etc ....

 

becomes: anon-123-123-122-243.uni-berlin.de - - [15 / Jan / 2016: 12: 28: 23 +0100] "GET /Urlaub-mit-mir.gif.gif HTTP / 1.1" 200 8032 "http://www.wunschname-1.de/urlaub/ ibiza.html ".... etc ...

It can be seen that a server at the University of Berlin accessed the "ibiza.html" page in the "vacation" folder on January 15, 2016 at 12:28 pm with a Mozilla browser version 5.0. However, the information on the IP address that accessed your website remains anonymous.

Example of non-resolvable IP addresses:
If there is an IP address in the log file because no host name could be resolved, a host name is generated according to the following scheme: "anon-123-456-165-41.invalid". Here, too, the string "anon" stands for "anonymous" and the groups of digits stand for the anonymized IP address. The top-level domain ".invalid" is appended because it is never assigned as a real TLD and is intended for such purposes.
out: 123.456.789.001 - - [15 / Jan / 2016: 12: 28: 23 +0100] "GET /Urlaub-mit-mir.gif HTTP / 1.1" 200 8032 "http://www.wunschname-1.de/seite.htm" "Mozilla / 5.0 .... etc ....
becomes: anon-123-456-165-41.invalid - - [15 / Jan / 2016: 12: 28: 23 +0100] "GET /Urlaub-mit-mir.gif HTTP / 1.1" 200 8032 "http://www.wunschname-1.de/seite.htm" "Mozilla / 5.0 .... etc. ....
In this example you can see the information that the call came from a server that is within the IP range 123.456.000.01 to 123.456.255.99. The exact information about which specific IP it was, however, remains anonymous here as well.

Important basic terms
The most important terms related to statistics are Accesses (English Hits) and Page queries (English Page Views).
The number of Hits / hits indicates how often inquiries have been made to your Internet presence.
Each page, but also each image, is counted as one access. So if you have a page with 5 pictures and a stored music (e.g. midi file), 7 accesses (1 page + 5 pictures + 1 music file) are generated. Requests for pages that do not exist or pages for which access has been denied are also reported as hit counted.
The term Page Views however, only describes the page views (i.e. how often HTML and TXT files have been accessed).
The mentioned page with pictures and music files creates only one Page view / page view. Unsuccessful requests are not counted here.
So if you want to compare your own access statistics with others, you should always put the right values ​​in relation to one another. If you hear a very high number from others, say several thousand hits a month, it is very often only about Hits / hits. However, these can only provide limited information about the attractiveness of an Internet presence, as the numbers quickly become very large if many images and frames are used.
If you want to know how intensively your online offer is accepted by the Internet users, it is better to count the number of Page views / page views to use.

Please note that a maximum of 400 days can be displayed.
In general, you have to allow for a certain amount of distortion in the number of hits.
This is mainly caused by so-called proxy servers such as those used for B. be used by online services. Proxy servers are used to temporarily store all requested Internet documents and graphics so that the next request for a page no longer has to be fetched from the Internet, but directly from the provider's local memory (proxy server). This is much faster. However, because the file is no longer requested directly from our server, no access can be logged. So these readers are, so to speak, invisible to you. The "real" number of hits on your website can therefore be significantly higher than the log files suggest.
Conversely, search engine indexing robots distort your statistics in a positive way. Keyword search engines use small programs ("bots") that call up your entire Internet presence and read them into the search engine database. This will give you logged access, but this will not be done by a real reader.

More terms related to retrieval statistics

 

In addition to the basic terms hits and page views mentioned, our retrieval statistics show you a few other variables:

 

Files, (Files)

indicates how many files were successfully retrieved. Every internet page, every picture and every other retrievable file (e.g. midi file, video sequence) is counted as a "file". The example page with 5 pictures and a stored music is counted as 7 files (1 page + 5 pictures + 1 music file).

Each time an Internet page is accessed, the browser first checks (depending on the cache) whether it still has the page in its own cache. If so, it compares whether the document on the server is more up-to-date. Only in this case does it actually pick it up from the server.

 

If it is not more up-to-date, the page is fetched from the cache and does not have to be transmitted over the network. The same process takes place with providers who use proxy servers.

 

The server logs this request, in which the desired file was not transferred because it was still available locally, with the code 304.

Sessions, German Meetings

indicates how many different domains have accessed your Internet presence. If a domain accesses your internet presence again, this will not be counted as a new session within one day.

 

The number of "sessions" allows some conclusions to be drawn about the number of visitors. Because large providers use a proxy server and some visitors access your Internet pages several times within a day, a certain amount of distortion must be taken into account.

 

indicates how much transfer volume the retrieved files caused. An example page with a file size of 14 kilobytes (KB) and 5 images of 10 KB each causes a transfer volume of 64 KB. Our statistics also give you information about:

  • The browser ("Browser type") with which your Internet presence was accessed most frequently.
  • The pages from which a user accessed your pages (so-called "Referrer URL")
  • The domain from which the pages were accessed (so-called "Client domain"). This domain usually specifies the access provider, e.g. AOL
  • About the most frequently accessed pages ("Urls") or elements ("Items", e.g. images, CGI scripts)

The aforementioned data are important information for the further development of your Internet presence. Depending on the distribution of the accessing browser types, it can make sense, for example, to use certain HTML extensions that can only be displayed by individual browser types, or to stop using them.

 

The "Referrer URL"is interesting information about how your visitors became aware of your Internet presence. If there are no major search engines here, this could be an indication for you to check your registration there or to optimize your entry.

 

The "Client domain"Finally, conclusions can be drawn about the countries from which your viewers are accessing your Internet pages.

 

The hit list of the most frequently accessed pages or elements tells you what your viewers particularly like. This feedback shows where there is potential for expansion of your Internet presence and which areas are not so well received and may need to be improved.


A) Use of the evaluated access statistics

Log into the STRATO customer login and select your package. Then select in the left menu statistics under your Package. You will now receive an overview of the most important statistical evaluation parameters in a graphical representation.

 

Further information: How to use the statistics analysis in your hosting package

 



B) Use of the original log files

If you need the unevaluated original access statistics ("log files"), you can download them from the Log files tab. All dates on which your Internet presence was accessed in the last 6 weeks are available.

 

If you only want to evaluate individual domains or subdomains, select the option using the radio button.

 

 

You will now see the browser dialog box known from Downloads, in which you specify what the file should be called and where it should be saved. The storage location and name can be freely selected. However, you should use .gz maintained.

 

In order to view the file afterwards, it must first be decompressed. So that you can download the data faster, the statistics are automatically saved by us in .gzFormat filed. This is a common compression format similar to .zip. In order to be able to read the file, you have to decompress it again with an appropriate unpacking program.

 

Common programs such as 7-Zip can process the file without any problems. If you have not yet installed any of these programs, you will also find links to the corresponding programs below the link to the log file download.

 

In the following we will show you the manual decompression process using 7-Zip. The handling is comparable in other compression programs.

 

If you have installed 7-Zip, you only have to double-click on the log file and the program will open it automatically.

 

 

 

Select the file acces_log_xxx and then choose Unzip. Enter the location you want and choose OK.

 

 

Open the folder in which the log file was unzipped and double-click on the acces_log_xxx file.

 

The unzipped file is a text file and can be opened with any text program. In the dialog window select your text program, for example Notepad or Notepad ++.

 

 

The opened log file could look like this, for example:

 

 

Then save the file as a .TXT file. You can then open the file as usual.