Remove trailing characters when retrieving Browsertrix Crawler version

2025-09-22 19:38:32 -04:00 · 2024-03-27 13:18:04 +00:00 · 2024-03-27 13:18:04 +00:00 · e24479945f
commit e24479945f
parent 3070fe9724
2 changed files with 8 additions and 1 deletions
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@ -12,6 +12,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 - New `--version` flag to display Zimit version
 - New `--logging` flag to adjust Browsertrix Crawler logging (#273)
 - Use new `--scraper-suffix` flag of warc2zim to enhance ZIM "Scraper" metadata (#275)
+- New `--noMobileDevice` CLI argument

 ### Changed

@ -21,6 +22,12 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 - Adopt Python bootstrap conventions
 - Removed handling of redirects by zimit, they are handled by browsertrix crawler and detected properly by warc2zim
 - Upgrade to Python 3.12 + upgrade dependencies
+- `--userAgent` CLI argument overrides again the `--userAgentSuffix` and `--adminEmail` values
+- `--userAgent` CLI arguement is not mandatory anymore
+
+### Fixed
+
+- Fix support for Youtube videos (#291)

 ## [1.6.3] - 2024-01-18

--- a/src/zimit/zimit.py
+++ b/src/zimit/zimit.py
@ -370,7 +370,7 @@ def run(raw_args):
    except Exception:
        logger.error("Failed to get Browsertrix crawler version")
        raise
-    crawler_version = crawl.stdout
+    crawler_version = crawl.stdout.strip()
    logger.info(f"Browsertrix crawler: version {crawler_version}")

    # pass a scraper suffix to warc2zim so that both zimit, warc2zim and crawler