Data Science & Engineering Platform: Data Lineage and Provenance for Apache Spark

1 · Cloudera · Dec. 11, 2018, midnight
This blog post was published on Hortonworks.com before the merger with Cloudera. Some links, resources, or references may no longer be accurate. This is the third in a series of data engineering blogs that we plan to publish. The first blog outlined the data science and data engineering capabilities of Hortonworks Data Platform. Motivation Apache […] The post Data Science & Engineering Platform: Data Lineage and Provenance for Apache Spark appeared first on Cloudera Blog....