[PR #60] [CLOSED] Add config option to ignore bad SSL certs when archiving #2556

Closed
opened 2026-03-01 17:59:54 +03:00 by kerem · 0 comments
Owner

📋 Pull Request Information

Original PR: https://github.com/ArchiveBox/ArchiveBox/pull/60
Author: @pirate
Created: 1/9/2018
Status: Closed

Base: masterHead: ssl-check


📝 Commits (2)

  • 57276a7 add option to skip ssl validity check
  • 249d177 Merge branch 'master' into ssl-check

📄 Description

In the modern age of HTTPS everywhere, it's becoming increasingly common for sites to set up SSL badly or forget to renew their certificates (especially older 90's era sites), rendering the sites inaccessible.
When browsing I always want to be alerted of bad certs, but when archiving 90% I want it to attempt archiving cites with bad certs anyway in case the content is important. After all, archiving a certificate error page is useless 100% of the time, so instead of throwing out hands up and skipping the site it's nice to at least attempt getting a snapshot.

TODO:

  • Add option to skip checking SSL certificates for validity
  • Implement fetch_wget cli option to skip cert check
  • Implement fetch_pdf cli option to skip cert check
  • Implement fetch_screenshot cli option to skip cert check

Last two steps just require doing some research on the chrome headless command line interface to find the right flags.

More Info:
- https://bugs.chromium.org/p/chromium/issues/detail?id=721739
- headless --ignore-certificate-errors
- https://groups.google.com/a/chromium.org/forum/#!topic/headless-dev/eiudRsYdc3A


🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.

## 📋 Pull Request Information **Original PR:** https://github.com/ArchiveBox/ArchiveBox/pull/60 **Author:** [@pirate](https://github.com/pirate) **Created:** 1/9/2018 **Status:** ❌ Closed **Base:** `master` ← **Head:** `ssl-check` --- ### 📝 Commits (2) - [`57276a7`](https://github.com/ArchiveBox/ArchiveBox/commit/57276a79543b8a1821d2c11e225d1b46d7bdbcf9) add option to skip ssl validity check - [`249d177`](https://github.com/ArchiveBox/ArchiveBox/commit/249d1775833dd3ab6c4470bf0101be4215d4553b) Merge branch 'master' into ssl-check ### 📄 Description In the modern age of HTTPS everywhere, it's becoming increasingly common for sites to set up SSL badly or forget to renew their certificates (especially older 90's era sites), rendering the sites inaccessible. When browsing I always want to be alerted of bad certs, but when archiving 90% I want it to attempt archiving cites with bad certs anyway in case the content is important. After all, archiving a certificate error page is useless 100% of the time, so instead of throwing out hands up and skipping the site it's nice to at least attempt getting a snapshot. TODO: - [X] Add option to skip checking SSL certificates for validity - [X] Implement `fetch_wget` cli option to skip cert check - [ ] Implement `fetch_pdf` cli option to skip cert check - [ ] Implement `fetch_screenshot` cli option to skip cert check Last two steps just require doing some research on the chrome headless command line interface to find the right flags. More Info: - https://bugs.chromium.org/p/chromium/issues/detail?id=721739 - headless `--ignore-certificate-errors` - https://groups.google.com/a/chromium.org/forum/#!topic/headless-dev/eiudRsYdc3A --- <sub>🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.</sub>
kerem 2026-03-01 17:59:54 +03:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2556
No description provided.