Helios: hyperscale indexing for the cloud & edge (part II)

1 · Adrian Colyer · Nov. 2, 2020, 5 a.m.
Helios: hyperscale indexing for the cloud & edge, Potharaju et al., PVLDB’20 Last time out we looked at the motivations for a new reference blueprint for large-scale data processing, as embodied by Helios. Today we’re going to dive into the details of Helios itself. As a reminder: Helios is a distributed, highly-scalable system used at Microsoft for flexible ingestion, indexing, and aggregation of large streams of real-time data that is designed to plug into relationals engines. The system coll...