From f7df467eab2c280c7e97bf2bf342aa09bd2ac40b Mon Sep 17 00:00:00 2001 From: benoit74 Date: Sun, 11 Aug 2024 20:40:59 +0200 Subject: [PATCH] Document capabilities and known limitations Signed-off-by: benoit74 --- README.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/README.md b/README.md index ab83ce6..6b770bc 100644 --- a/README.md +++ b/README.md @@ -11,6 +11,13 @@ Zimit adheres to openZIM's [Contribution Guidelines](https://github.com/openzim/ Zimit has implemented openZIM's [Python bootstrap, conventions and policies](https://github.com/openzim/_python-bootstrap/docs/Policy.md) **v1.0.1**. +Capabilities and known limitations +-------------------- + +While we would like to support as many websites as possible, making an offline archive of any website with a versatile tool obviously has some limitations. + +See for instance capabilities and known limitations of warc2zim in its [README](https://github.com/openzim/warc2zim/blob/main/README.md). There are also some limitations in Browsertrix Crawler (used to fetch the website) and wombat (used to properly replay dynamic web requests), but these are not (yet?) clearly documented. + Technical background --------------------