436 Commits

Author SHA1 Message Date
benoit74
1287351c1d
Upgrade to browsertrix crawler 1.5.6 2025-02-27 19:37:21 +00:00
benoit74
b85a6b7e4e
Merge pull request #480 from openzim/upgrade
Upgrade to Browsertrix Crawler 1.5.5
2025-02-27 08:50:05 +01:00
benoit74
eebc75f868
Pin warc2zim version in preparation for 3.0.2 release 2025-02-27 07:33:34 +00:00
benoit74
00f0e475ae
Upgrade to browsertrix crawler 1.5.5 2025-02-27 07:33:33 +00:00
benoit74
363ff40767
Prepare for 3.0.2 2025-02-24 09:40:04 +00:00
benoit74
dd65902556
Release 3.0.1 v3.0.1 2025-02-24 09:37:40 +00:00
benoit74
3e2ad5fede
Merge pull request #476 from openzim/upgrade
Upgrade to browsertrix crawler 1.5.4
2025-02-24 10:34:08 +01:00
benoit74
5e53be6fa4
Pin warc2zim version in preparation for 3.0.1 release 2025-02-24 09:33:44 +00:00
benoit74
1b5b9bb80b
Upgrade to browsertrix crawler 1.5.4 2025-02-24 09:33:44 +00:00
benoit74
bce22ceac1
Prepare for 3.0.1 2025-02-17 10:08:49 +00:00
benoit74
e3cd12b0d1
Release 3.0.0 v3.0.0 2025-02-17 10:02:43 +00:00
benoit74
ee0f4c6cec
Use released warc2zim 2.2.2 2025-02-17 09:52:55 +00:00
benoit74
91d5edda4a
Merge pull request #472 from clach04/patch-1
Correct link in README.md
2025-02-15 09:42:49 +01:00
clach04
3eb6c09046
Correct link in README.md
Signed-off-by: clach04 <clach04@gmail.com>
2025-02-14 22:02:17 -08:00
benoit74
a9efec4797
Merge pull request #471 from openzim/fix_browsertrix_args
Enhance support of Browsertrix Crawler arguments
2025-02-14 15:35:32 +01:00
benoit74
2f7a83e187
Fixes following review 2025-02-14 14:28:40 +00:00
benoit74
96c4c3bdfd
Clarify args variables/functions names 2025-02-14 14:28:39 +00:00
benoit74
7bfb4b25f0
Remove confusion between zimit, warc2zim and crawler stats filenames 2025-02-14 14:27:28 +00:00
benoit74
ed1a8a0aa9
Use preferred Browsertrix Crawler arguments and fix multiple/file seeds support 2025-02-14 14:27:26 +00:00
benoit74
dc6b5aafb7
Enhance support of Browsertrix Crawler arguments 2025-02-14 14:23:19 +00:00
benoit74
4f9085b10e
Merge pull request #470 from openzim/keep_tmp_folder
Keep temporary folder when crawler or warc2zim fails, even if not asked for
2025-02-14 09:46:12 +01:00
benoit74
b4ec60f316
fixup! Keep temporary folder when crawler or warc2zim fails, even if not asked for 2025-02-13 15:31:51 +00:00
benoit74
ee82837aaa
Keep temporary folder when crawler or warc2zim fails, even if not asked for 2025-02-13 13:19:23 +00:00
benoit74
bc73193ce0
Merge pull request #469 from openzim/crawler_1_5_3
Upgrade to crawler 1.5.3 and better indicate/handle interruptions
2025-02-13 13:03:27 +01:00
benoit74
101fb71a0b
Better processing of crawler exit codes with soft/hard limits 2025-02-13 10:51:14 +00:00
benoit74
3a7f583a96
Upgrade to Browsertrix Crawler 1.5.3
Include restore of total number of pages, following upstream fix.
2025-02-13 10:44:20 +00:00
benoit74
8b4b18bfb7
Prepare for 2.1.9 2025-02-07 08:59:54 +00:00
benoit74
d228e9f346
Release 2.1.8 v2.1.8 2025-02-07 08:57:14 +00:00
benoit74
2e48ea1af6
Merge pull request #466 from openzim/prepare_release
Pin warc2zim for release
2025-02-07 09:50:49 +01:00
benoit74
a7e1026b2e
Pin warc2zim for release 2025-02-07 08:38:20 +00:00
benoit74
cc84848c32
Merge pull request #464 from openzim/upgrade
Upgrade to Browsertrix Crawler 1.5.1
2025-02-07 09:36:24 +01:00
benoit74
6ec53f774f
Upgrade to Browsertrix Crawler 1.5.1 2025-02-07 08:24:27 +00:00
benoit74
5af981c01c
Remove ARM64 job temporarily, still not working 2025-02-07 08:07:23 +00:00
benoit74
b4c0495f48
Fix arm runner selector 2025-02-06 21:19:08 +00:00
benoit74
cea10bd3b5
Add second build job on native arch for ARM64 2025-02-06 21:17:46 +00:00
benoit74
4ef9a0d380
Remove support for ARM64, this is not working anymore and was painfully slow 2025-02-06 21:11:40 +00:00
benoit74
bf0dcd2ffc
Merge pull request #462 from openzim/upgrade_py
Upgrade Python 3.13, Crawler 1.5.0 and others
2025-02-06 14:45:03 +01:00
benoit74
9396cf1ca0
Alter crawl statistics following 1.5.0 release 2025-02-06 13:39:33 +00:00
benoit74
0f136d2f2f
Upgrade Python 3.13, Crawler 1.5.0 and others 2025-02-06 13:39:32 +00:00
benoit74
0cb84f2126
Prepare for 2.1.8 2025-01-10 12:46:51 +00:00
benoit74
4835adbdd7
Prepare for 2.1.8 2025-01-10 12:41:01 +00:00
benoit74
14670d4c69
Release 2.1.7 v2.1.7 2025-01-10 10:24:47 +00:00
benoit74
8cddcf0666
Merge pull request #450 from openzim/upgrade_crawler
Upgrade to browsertrix crawler 1.4.2, fix integration tests and fix docker label
2025-01-09 13:45:22 +01:00
benoit74
97ea6dfd7b
Fix Docker label to follow new convention 2025-01-09 10:41:22 +00:00
benoit74
8d42a8dd93
Move integration tests to test website 2025-01-09 10:41:05 +00:00
benoit74
00d2433383
Upgrade to browsertrix crawler 1.4.2 2025-01-09 09:06:08 +00:00
benoit74
b5dac3c309
Merge pull request #434 from openzim/upgrade_crawler
Upgrade to browsertrix crawler 1.4.0-beta.0
2024-11-15 16:49:26 +01:00
benoit74
16a4f8d4d8
Upgrade to browsertrix crawler 1.4.0-beta.0 2024-11-15 15:46:40 +00:00
benoit74
e9adc38856
Merge pull request #430 from openzim/set_return_code
Properly exit with code
2024-11-08 15:40:50 +01:00
benoit74
bfa226bf81
Properly exit with code 2024-11-08 14:22:35 +00:00