[PR #1727] Add caddl plugin for 3D/CAD asset extraction #2986

Open
opened 2026-03-01 18:01:19 +03:00 by kerem · 0 comments
Owner

Original Pull Request: https://github.com/ArchiveBox/ArchiveBox/pull/1727

State: open
Merged: No


Implements a new plugin to download 3D and CAD assets from web pages.

Features

  • Detects and downloads files with 3D/CAD extensions (.blend, .stl, .obj, .gltf, .glb, .fbx, .vrm, .usdz, etc.)
  • Parses HTML from singlefile/dom extractors to find asset URLs
  • Configurable timeout, max file size, SSL verification, and user agent
  • Uses curl for downloads (already available in most systems)
  • Isolated plugin that doesn't depend on ArchiveBox core
  • Includes tests and UI templates

Fixes #668

Generated with Claude Code


Summary by cubic

Adds a new caddl plugin to detect and download 3D/CAD assets from captured pages using Chrome + Puppeteer via the existing browser session. Addresses issue #668 by enabling capture of common 3D formats with simple UI views and configurable limits, including support for login/captcha sites.

  • New Features

    • Finds asset links in the DOM (href/src and page text); resolves relative URLs.
    • Supports .blend, .stl, .obj, .gltf/.glb, .fbx, .vrm, .usdz, .step/.stp, and more.
    • Downloads through Chrome CDP with timeout and max size; no curl or extra SSL/cookie config needed.
    • Emits ArchiveResult per file; includes embed, thumbnail, icon, and fullscreen templates.
    • Isolated plugin with tests and filename sanitization; no ArchiveBox core dependency.
  • Bug Fixes

    • Fixes thumbnail layout by adding display:flex so the emoji and label stack and center correctly.

Written for commit e937688be0. Summary will update automatically on new commits.

**Original Pull Request:** https://github.com/ArchiveBox/ArchiveBox/pull/1727 **State:** open **Merged:** No --- Implements a new plugin to download 3D and CAD assets from web pages. ## Features - Detects and downloads files with 3D/CAD extensions (.blend, .stl, .obj, .gltf, .glb, .fbx, .vrm, .usdz, etc.) - Parses HTML from singlefile/dom extractors to find asset URLs - Configurable timeout, max file size, SSL verification, and user agent - Uses curl for downloads (already available in most systems) - Isolated plugin that doesn't depend on ArchiveBox core - Includes tests and UI templates Fixes #668 Generated with [Claude Code](https://claude.ai/code) <!-- This is an auto-generated description by cubic. --> --- ## Summary by cubic Adds a new caddl plugin to detect and download 3D/CAD assets from captured pages using Chrome + Puppeteer via the existing browser session. Addresses issue #668 by enabling capture of common 3D formats with simple UI views and configurable limits, including support for login/captcha sites. - **New Features** - Finds asset links in the DOM (href/src and page text); resolves relative URLs. - Supports .blend, .stl, .obj, .gltf/.glb, .fbx, .vrm, .usdz, .step/.stp, and more. - Downloads through Chrome CDP with timeout and max size; no curl or extra SSL/cookie config needed. - Emits ArchiveResult per file; includes embed, thumbnail, icon, and fullscreen templates. - Isolated plugin with tests and filename sanitization; no ArchiveBox core dependency. - **Bug Fixes** - Fixes thumbnail layout by adding display:flex so the emoji and label stack and center correctly. <sup>Written for commit e937688be0ae5e8cecbabb70cd6598d40312c687. Summary will update automatically on new commits.</sup> <!-- End of auto-generated description by cubic. -->
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ArchiveBox#2986
No description provided.