[新服务]ArchiveBox用于保存网页的存档
date
Nov 2, 2021
slug
ArchiveBox-WebArchive
status
Published
summary
可以保存网页的html, pdf, jpg等格式
tags
service
type
Post
data:image/s3,"s3://crabby-images/ccf9e/ccf9e73d7f84b55c096b7ce1b20f8f5d08e81f3e" alt="notion image"
Summary
步骤
data:image/s3,"s3://crabby-images/5406d/5406dbf232d6d8d1fe722fe3c5c44633afd249ea" alt="notion image"
data:image/s3,"s3://crabby-images/194a9/194a9709c6b7e1f263c06dcfe443887d0a5befd6" alt="notion image"
mkdir /data/archivebox && cd /data/archivebox
mkdir data
# 已经将远程google drive加载到 /data/gd_stanford
mkdir -p /data/gd_stanford/_service/archivebox/data/archive
# link
ln -s /data/gd_stanford/_service/archivebox/data/archive ./data/archive
chown -R 999:999 data && chmod 755 data
curl -O 'https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/master/docker-compose.yml'
vi docker-compose.yml
# 记住你的端口
# optional
curl -O https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/master/etc/sonic.cfg
vi sonic.cfg
docker-compose run archivebox init --setup
docker-compose run archivebox schedule --every=day --depth=1 www.nine.im
docker-compose run archivebox config --set PUBLIC_INDEX=False
docker-compose run archivebox config --set PUBLIC_SNAPSHOTS=False
docker-compose run archivebox config --set PUBLIC_ADD_VIEW=False
#
docker-compose run archivebox status
docker-compose run archivebox add https://example.com/some/page
docker-compose run archivebox add --depth=1 ~/Downloads/bookmarks_export.html
docker-compose run archivebox list --sort=timestamp --csv=timestamp,url,is_archived
data:image/s3,"s3://crabby-images/cc694/cc69463cce81e32059806d48e99bab8afd3fdab2" alt="notion image"
验证
data:image/s3,"s3://crabby-images/4286e/4286ef3d8b814e73915441cbf1631c975e85f8ba" alt="notion image"