Command Line Tools
Full-fledged web archiving systems and archiving utilities, right in your terminal.
pywb
Python
pywb is a Python 3 web archiving toolkit for creating and replaying
high-fidelity web archives.
Browsertrix Crawler
TypeScript
Docker
Browsertrix Crawler is a standalone browser-based high-fidelity crawling
system, designed to run a complex, customizable browser-based crawl in a
single Docker container
WARCIT
Python
WARCIT converts local directories into WARCs.
CDXJ Indexer
Python
CDXJ Indexer generates CDXJ (and CDX) indexes from WARC and ARC files.
Libraries
Integrate web archiving functionality into your codebase.
WARCIO
Python
JavaScript
WARCIO is a fast, standalone Python and JavaScript library for
interacting with WARC files.
wabac.js
JavaScript
wabac.js provides a full client-side web archive replay system (or
“wayback machine”).
Wombat
JavaScript
Wombat is a client-side JavaScript-based URL rewriting system.
HAR2WARC
CLI
Python
Convert HAR (HTTP Archive) files to WARCs. Also available as a command
line tool.
Browsertrix Behaviors
JavaScript
A set of automated actions injected into the browser to perform certain
operations on a page, such as scrolling or interacting with elements to
load more content.
AuthSigner
Python
A signing & verifying server for arbitrary data over HTTP.
Labs
Our experiments in web archiving!
OldWeb.Today
Connects emulated web browsers to web archives, allowing users to browse
the old web exactly as it was!
OEmbed.link
Use OEmbed.link to render embeds from social media sites that support
the OEmbed API to a unique URL that can be archived.
Specs
Specifications for building interoperable web archiving tools.
WACZ
Web Archive Collection Zipped (WACZ) is a packaging standard for web
archives on the web.
WACZ Signing and Verification
The mechanics for signing and verifying WACZ files for proof of
authenticity.
Support our sustainability
Webrecorder is committed to releasing quality free and open source software that supports web archiving for all. We strive to build tools that empower individuals creating decentralized archives and aid global cultural institutions in their digital preservation endeavors.
If our goals align with yours, please support the long-term sustainability of Webrecorder tools.