mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2026-04-25 17:16:00 +03:00
[GH-ISSUE #1167] Bug: Youtube Download Issues (Videos don't play) #726
Labels
No labels
expected: maybe someday
expected: next release
expected: release after next
expected: unlikely unless contributed
good first ticket
help wanted
pull-request
scope: all users
scope: windows users
size: easy
size: hard
size: medium
size: medium
status: backlog
status: blocked
status: done
status: idea-phase
status: needs followup
status: wip
status: wontfix
touches: API/CLI/Spec
touches: configuration
touches: data/schema/architecture
touches: dependencies/packaging
touches: docs
touches: js
touches: views/replayers/html/css
why: correctness
why: functionality
why: performance
why: security
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ArchiveBox#726
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @taylor-schneider on GitHub (Jun 27, 2023).
Original GitHub issue: https://github.com/ArchiveBox/ArchiveBox/issues/1167
Describe the bug
Downloading a youtube video shows complete with no errors. The problem is that the singlefile.html and the wget related resources do not play the video (it just hangs). And the capture did not grab comments for the video. The media page has a webm file that will play, but the controls on the player do not work (scrolling or drag and drop do not work).
Steps to reproduce
Screenshots or log output
Here are the logs from adding a video
Here are the logs from launching the server, opening the webpage, clicking on the wget page, and clicking play in the resulting page:
ArchiveBox version
@pirate commented on GitHub (Jun 28, 2023):
This is expected behavior, singlefile does not run the video player js / rewrite it to request from your local archive, so most non-native players don't work in singlefile archives, but you can go to the
./archive/<timestmap>/mediafolder to find the webm/mp4/mp3 media files extracted by youtube-dl/yt-dlp.If you need higher fidelity archiving with inline videos working embedded in the archived page, I recommend ArchiveWeb.page / WebRecorder.net instead. (which we'd love to integrate with in the future)
@melyux commented on GitHub (Jul 12, 2023):
It would be quite rad to have the embedded videos actually work on the Wget downloads at least, and Singlefile. Right now the media just sits in the folder but is not accessible.
@pirate commented on GitHub (Jul 18, 2023):
You should be able to click the files in the folder listing displayed in the media area of the snapshot details page and your browser will play the videos. Embedding the video back in the original page is beyond the scope of what I'm willing to implement.
@melyux commented on GitHub (Jul 19, 2023):
I get it. One of the advantages of this project over Web Archive is the lack of space and rate limiting constraints, so it makes it actually possible to archive a page completely, including embedded videos/audios, without relying on outside resources. Could be a cool thing to have
@pirate commented on GitHub (Oct 19, 2023):
Embedded videos do work with my friends projects here, you should check them out if you need that: https://webrecorder.net/
They did an incredible job with high fidelity preservation, so you can even replay YouTube right in the native player interface. Might integrate with their engine someday but for now I'm focusing more on "digesting content" out of pages (slightly easier) instead of preserving it perfectly in its original format (a much harder problem).
sincehigh-water-mark gets set even when indexing fails #1970sincehigh-water-mark gets set even when indexing fails #3482