close
close

Amazon announces Nova, a new family of multimodal AI models

Amazon announces Nova, a new family of multimodal AI models

At its re:Invent conference on Tuesday, Amazon Web Services (AWS), Amazon’s cloud computing division, announced a new family of multimodal generative AI models that it calls Nova.

There are four text-generating models in total: Micro, Lite, Pro and Premier. Micro, Lite and Pro will be available to AWS customers on Tuesday, while Premier will arrive in early 2025, Amazon CEO Andy Jassy said on stage.

In addition, there is an image generation model, Nova Canvas, and a video generation model, Nova Reel. Both also launched on AWS this morning.

“We have continued to work on our own boundary models,” Jassy said, “and those boundary models have made tremendous progress in the last four to five months. And we thought if we got value out of it, you would probably get value out of it.”

The text-generating Nova models, which are optimized for 15 languages ​​(but mainly English), vary greatly in size and performance.

Micro can only record and output text, but offers the lowest latency of all – it processes text and generates responses the fastest.

Lite can process image, video and text input reasonably quickly. Pro offers a balanced combination of accuracy, speed and cost for a range of tasks. And Premier is the most powerful and designed for complex workloads.

Like Lite, Pro and Premier can analyze text, images and videos. All three are good for tasks like digesting documents and summarizing charts, meetings, and diagrams. However, AWS positions Premier as a “teacher” model for building fine-tuned custom models, rather than a model that can be used alone.

Micro has a context window with 128,000 tokens, meaning it can handle up to around 100,000 words. Lite and Pro have context windows with 300,000 tokens, which is equivalent to about 225,000 words, 15,000 lines of computer code, or 30 minutes of footage.

In early 2025, certain Nova models’ context windows will be expanded to support over 2 million tokens, AWS says.

Jassy claims the Nova models are among the fastest in their class – and among the most cost-effective to run. They are available in AWS Bedrock, Amazon’s AI development platform, where they can be tuned to text, images, and videos and distilled for improved speed and greater efficiency.

“We optimized these models to work with proprietary systems and APIs, making it much easier for you to perform multiple orchestrated automated steps – agent behavior – with these models,” Jassy added. “So I think these are very compelling.”

Leave a Reply

Your email address will not be published. Required fields are marked *