The Data Cards Playbook: A Toolkit for Transparency in Dataset Documentation

1 · Google AI Research · Nov. 17, 2022, 6:32 p.m.
Summary
Posted by Mahima Pushkarna, Senior Interaction Designer, and Andrew Zaldivar, Senior Developer Relations Engineer, Google Research As machine learning (ML) research moves toward large-scale models capable of numerous downstream tasks, a shared understanding of a dataset’s origin, development, intent, and evolution becomes increasingly important for the responsible and informed development of ML models. However, knowledge about datasets, including use and implementations, is often distributed ac...