Image
event

A deep dive into the craft of building data compressors

Summary
Dmitri Pavlichin (Amazon)
Packard 202; Zoom
Oct
14
Date(s)
Content

 

Zoom ID: 92716427348; Passcode: 032264

 

Abstract

General purpose compressors like Gzip and Zstandard perform well on many kinds of data and are standard tools of the computing trade. We can often do better with algorithms specialized for particular data domains, like genomic or numeric data, but the process of building new compressors is often ad hoc and requires domain expertise and compression familiarity. This talk dives into the craft of building new data compressors, focusing on examples from tabular and genomic datasets.

 

Bio

Dmitri Pavlichin is an applied scientist at Amazon. In an earlier life he did research in information theory and bioinformatics as a postdoc with Tsachy Weissman at Stanford and co-founded a data compression startup.