Remove trailing characters when retrieving Browsertrix Crawler version

This commit is contained in:
benoit74 2024-03-27 13:18:04 +00:00
parent 3070fe9724
commit e24479945f
No known key found for this signature in database
GPG Key ID: B89606434FC7B530
2 changed files with 8 additions and 1 deletions

View File

@ -12,6 +12,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- New `--version` flag to display Zimit version
- New `--logging` flag to adjust Browsertrix Crawler logging (#273)
- Use new `--scraper-suffix` flag of warc2zim to enhance ZIM "Scraper" metadata (#275)
- New `--noMobileDevice` CLI argument
### Changed
@ -21,6 +22,12 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- Adopt Python bootstrap conventions
- Removed handling of redirects by zimit, they are handled by browsertrix crawler and detected properly by warc2zim
- Upgrade to Python 3.12 + upgrade dependencies
- `--userAgent` CLI argument overrides again the `--userAgentSuffix` and `--adminEmail` values
- `--userAgent` CLI arguement is not mandatory anymore
### Fixed
- Fix support for Youtube videos (#291)
## [1.6.3] - 2024-01-18

View File

@ -370,7 +370,7 @@ def run(raw_args):
except Exception:
logger.error("Failed to get Browsertrix crawler version")
raise
crawler_version = crawl.stdout
crawler_version = crawl.stdout.strip()
logger.info(f"Browsertrix crawler: version {crawler_version}")
# pass a scraper suffix to warc2zim so that both zimit, warc2zim and crawler