How to reduce the size of log files
The size of access log files can be easily reduced by 95% with pre-filtering and compression

Reducing the size of access log files before sending them to make them much faster to transfer and to process. This can be achieved with compression and pre-filtering.

Tools like "grep google" and "gzip" are your friends.


On average, compressing text files leads to reduce the size by 85%.

Oncrawl can read multiple compression formats:

  • zip

  • gzip

  • tar (+gz)

  • bzip2 (bz2)

  • 7z (however the 7z ppmd compression format is NOT supported. Please disable this option in 7z if necessary.)

  • xz

However, in some cases, certain compression options within these formats may not be supported.
You can use your favorite compression program to compress access log files before sending them. Please note that RAR files are NOT supported.


On average, pre-filtering reduces logs files by 90%. Sometimes 99% ! Access logs contain a lot of information that is discarded by Oncrawl because it's not relevant for SEO analysis. 

Pre-filtering is as easy as keeping only lines that contains the word:
in lowercase or uppercase.

If you work on Linux or MacOS this can be achieved with grep:

Combining both

When combined compression and pre-filtering together we usually reduce the file size by 95%. That's why we recommend using both.

On Linux or Macos just do:

Happy Uploading

