Blog

What’s new in the world of web archiving?

An icon representation of Deduplication

Reduce Storage with Crawl Deduplication

Product

A new feature to save you storage space.

ByEmma Segal-Grossman and Tessa Walsh
The Browsertrix logo with versions 1.19 through 1.21 displayed next to it

Execution Time Addons, Robots.txt, Profile Refreshes, Custom Schedules, and More

Product

An overview of exciting new features from Browsertrix 1.19, 1.20, and 1.21.

ByTessa Walsh
The Browsertrix logo with version 1.18 displayed next to it, overlaid on an email icon and a file icon

Browsertrix 1.18: Large URL Lists and Beautiful Emails

Product

Browsertrix 1.18 brings support for large URL lists, new email templates, and UX improvements for crawling and curating.

ByEmma Segal-Grossman
The Browsertrix logo with version 1.17 displayed next to it, overlaid on a pause and resume button

Browsertrix 1.17: Crawl Pause/Resume and Lower Numbers of Browser Windows

Product

Crawl pause/resume and lower number of browser windows

ByTessa Walsh
Screenshot of the Page Behavior crawl workflow section in Browsertrix

Create, Use, and Automate Actions With Custom Behaviors in Browsertrix

Product

It is now easier than ever to automate custom page actions in Browsertrix.

ByTessa Walsh
A screenshot of the home page of https://govarchive.us/ listing available collections

Introducing GovArchive.us & Mirroring Entire Sites with Web Archives

Product

Introducing GovArchive.us and tooling to mirror web sites using web archives.

ByIlya Kreymer
The words “Public Collections” sandwiched between a grid of website thumbnails on a glowing dark blue background

Introducing Public Collections in Browsertrix

Product

Now you can curate, personalize, and share all your your crawls in one place.

ByWebrecorder Team
The Browsertrix logo with the text “1.13”, atop a blue glowing background and a darkened globe icon

Browsertrix 1.13: The Translations and Internationalization Release

Product

¡Browsertrix para todos! With your help, we’re translating Browsertrix into new languages.

ByIlya Kreymer, Emma Segal-Grossman, Sua Yoo, and Clara Itzel
A screenshot of the crawler proxy server dropdown menu in the Browsertrix crawl workflow editor

Browsertrix 1.12: Proxies, Crawling Defaults, and Simplified Workflow Creation

Product

Proxies, crawling defaults, and simplified workflow creation!

ByTessa Walsh and Emma Segal-Grossman
A screenshot of the archived item list dropdown menu with a new option titled "Download Item" selected

Browsertrix 1.11: Self Sign-Up, QA Improvements, Easier Downloading and new APIs

Product

Self sign-up, easier downloads, and better crawl analysis stats!

ByEmma Segal-Grossman, Henry Wilkinson, Tessa Walsh, and Ilya Kreymer
A screenshot of Browsertrix' Quality Assurance tab with an analysis run in progress

Browsertrix 1.10: Now with Assistive QA!

Product

Tired of visiting every single page in your web archive to ensure it was captured properly? So are we!

ByHenry Wilkinson

ReplayWeb.page 2.0

Product

New branding, adblock for embeds, code refactor, and so much more!

ByHenry Wilkinson and Ilya Kreymer

Browsertrix 1.9

Product

Browsertrix has had some big improvements since our last blog post, lets take a look at some of the more recent ones!

ByHenry Wilkinson

An update on the WACZ format

Product

It has been over two years since we've first introduced the WACZ format and I wanted to give a brief update on exciting new tools and integrations of WACZ, and also provide a glimpse of what's next in the evolution of the format.

ByIlya Kreymer

Announcing pywb 2.7.0 release

Product

We are excited to announce the release of pywb 2.7, with a new interactive banner and calendar interface!

ByTessa Walsh and Ilya Kreymer