Meet new Amazon Nova AI models that help build highly reliable AI agents

Nova 2 Omni is a unified multimodal reasoning and generation model that can process text, images, video, and speech inputs while generating both text and images—an industry first. It handles up to 750,000 words, hours of audio, long videos, and hundred-page documents, simultaneously analyzing entire product catalogs, testimonials, brand guidelines, and video libraries at once. This eliminates the cost and complexity of connecting multiple specialized models. For example, marketing teams can analyze product details across all formats to instantly generate complete campaigns including headlines, copy, social posts, and visuals in one workflow. While there are no comparable models in the industry to Nova 2 Omni, it demonstrates strengths in public benchmarks of multimodal reasoning on documents, images, videos, and audio, and can generate high-quality images similar to other leading image-generation models.



