Cross-Modal Contrastive Learning for Text-to-Image Generation

2 · Google AI Research · May 26, 2021, 7:05 p.m.
Posted by Han Zhang, Research Scientist and Jing Yu Koh, Software Engineer, Google Research Automatic text-to-image synthesis, in which a model is trained to generate images from text descriptions alone, is a challenging task that has recently received significant attention. Its study provides rich insights into how machine learning (ML) models capture visual attributes and relate them to text. Compared to other kinds of inputs to guide image creation, such as sketches, object masks or mouse tra...