Developer Tools

Utilities, packages, and scripts for building your own web archiving tools.

Command Line Tools

Full-fledged web archiving systems and archiving utilities, right in your terminal.

pywb

Python
pywb is a Python 3 web archiving toolkit for creating and replaying high-fidelity web archives.

Browsertrix Crawler

TypeScript
Docker
Browsertrix Crawler is a standalone browser-based high-fidelity crawling system, designed to run a complex, customizable browser-based crawl in a single Docker container

WARCIT

Python
WARCIT converts local directories into WARCs.

CDXJ Indexer

Python
CDXJ Indexer generates CDXJ (and CDX) indexes from WARC and ARC files.

Libraries

Integrate web archiving functionality into your codebase.

WARCIO

Python
JavaScript
WARCIO is a fast, standalone Python and JavaScript library for interacting with WARC files.

wabac.js

JavaScript
wabac.js provides a full client-side web archive replay system (or “wayback machine”).

Wombat

JavaScript
Wombat is a client-side JavaScript-based URL rewriting system.

HAR2WARC

CLI
Python
Convert HAR (HTTP Archive) files to WARCs. Also available as a command line tool.

Browsertrix Behaviors

JavaScript
A set of automated actions injected into the browser to perform certain operations on a page, such as scrolling or interacting with elements to load more content.

AuthSigner

Python
A signing & verifying server for arbitrary data over HTTP.

Labs

Our experiments in web archiving!

OldWeb.Today

Connects emulated web browsers to web archives, allowing users to browse the old web exactly as it was!

OEmbed.link

Use OEmbed.link to render embeds from social media sites that support the OEmbed API to a unique URL that can be archived.
View All Webrecorder Tools on GitHub

Specs

Specifications for building interoperable web archiving tools.

WACZ

Web Archive Collection Zipped (WACZ) is a packaging standard for web archives on the web.

WACZ Signing and Verification

The mechanics for signing and verifying WACZ files for proof of authenticity.
View All Webrecorder Specs

Support our sustainability

Webrecorder is committed to releasing quality free and open source software that supports web archiving for all. We strive to build tools that empower individuals creating decentralized archives and aid global cultural institutions in their digital preservation endeavors.

If our goals align with yours, please support the long-term sustainability of Webrecorder tools.