Machine Learning for a Better Developer Experience

1 · Netflix, Inc. · July 21, 2020, 3:04 a.m.
Summary
Stanislav Kirdey, William HighImagine having to go through 2.5GB of log entries from a failed software build — 3 million lines — to search for a bug or a regression that happened on line 1M. It’s probably not even doable manually! However, one smart approach to make it tractable might be to diff the lines against a recent successful build, with the hope that the bug produces unusual lines in the logs.Standard md5 diff would run quickly but still produce at least hundreds of thousands candidate l...