AI that creates net-new content (images, audio, video) rather than just classifying or predicting.
- Diffusion Models: The technology behind Midjourney and DALL-E. They learn to generate images by reversing a process of adding noise to data.
- GANs (Generative Adversarial Networks): Two networks (a generator and a discriminator) compete to create highly realistic synthetic data.
- Multimodality: Modern models seamlessly blend text, vision, and audio in a single architecture.