>
Description: Imagen 4 is a latent diffusion model that generates high quality images from text prompts. Imagen 4 performs well in photorealistic composition seings and has improved spelling and typography, instruction following and richer colors, textures and details compared to previous Imagen models. Inputs: The inputs consist of natural-language text strings (e.g. instructions for creating a synthetic image using a visual description) or image files. Outputs: Outputs are generated high quality images in response to text and image inputs. Architecture: Imagen 4 utilises latent diffusion, which is the de facto standard approach for modern image and video models, achieving high quality performance in generative media applications.