Creating a Faster TAR Extractor

4 · Michael Xu · Jan. 26, 2022, 9:03 p.m.
Tarballs are used industry-wide for packaging and distributing files, and this is no different at Databricks. Every day we launch millions of VMs across multiple cloud providers. One of the first steps on every one of these VMs is extracting a fairly sizable tar.lz4 file containing a specific Apache Spark™ runtime. As part of an... The post Creating a Faster TAR Extractor appeared first on Databricks....