Scalability Improvement of Apache Impala 2.12.0 in CDH 5.15.0

1 · Cloudera · Jan. 30, 2019, 10:24 p.m.
Key Takeaways We have significantly improved Impala in CDH 5.15.0 to address some of the scalability bottlenecks in query execution. 64 concurrent streams of TPC-DS queries at 10TB scale in a 135-node cluster now run at 6x query throughput compared to previous releases. In addition to running faster, the query success rate also improved from 73% to 100%. Overall, Impala in CDH 5.15.0 provides massive improvements in throughput and reliability while reducing the resource usage significantly. Rea...