mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 09:06:02 +03:00
[GH-ISSUE #141] Add link from lifehacker.com destroys index.html due to wrong title detection #96
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#96
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Strubbl on GitHub (Feb 13, 2019).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/141
Describe the bug
when i archive a page from lifehacker, e.g. https://lifehacker.com/stop-recycling-amazons-plastic-packaging-1832536576
i only see garbage in the log output. here is it in a shortened form:
I would have expected it to be something like this:
So i guess title detection fails for this page.
Steps to reproduce
Steps to reproduce the behavior:
Software versions (please complete the following information):
e6d5cd4432@Strubbl commented on GitHub (Feb 13, 2019):
Yeah, it's because of multiple
@Strubbl commented on GitHub (Feb 13, 2019):
A workaround suggestion could be to remove all content which is between
@pirate commented on GitHub (Feb 19, 2019):
I think I've fixed it, try the latest master (
1b36d5b) and let me know if you have any issues.