This AI Tool Aids Enterprise Workflows by Analyzing Images

Palmyra-Vision from AI startup Writer can check an ad to see if it is compliant with regulations, among other use cases

Ben Wodecki, Jr. Editor

February 28, 2024

1 Min Read
Palmyra logo
Writer

At a Glance

  • Writer launches new multimodal AI model to provide image-based insights for businesses.

Generative AI startup Writer made waves in January after its Palmyra X V3 outpaced Google's PaLM 2. Now, it has unveiled a multimodal model capable of generating text based on images.

Palmyra-Vision is a multimodal model that can analyze images and generate content based on the objects and visuals in the images. It can extract handwritten text, classify objects and examine graphs and charts.

The multimodal model is designed to empower enterprise customers. The startup said that customers have routinely sought image-to-text capabilities to support their workflows.

Enterprises could use Palmyra-Vision, for example, to extract information from charts or help create Alt text descriptions. The model can even generate new content based on visuals like suggesting compliant copy for marketing images.

The model also has the ability to answer specific questions, so users could, for instance, ask if advertisements meet particular legal or regulatory requirements.

Palmyra-Vision was benchmarked against VQAv2, a dataset of open-ended questions on over 265,000 images that requires an understanding of vision, language, and common-sense knowledge. The model scored 84.4%, outperforming both OpenAI’s GPT-4V and Gemini 1.0 Ultra from Google.

Palmyra-Vision can be accessed via the Writer’s image analyzer app, which can be found in the startup’s library of prebuilt apps. The startup can also build custom apps to fit Palmyra-Vision to specific enterprise use cases.

Related:AI Startup Trumps Google in Stanford's Model Rankings

Writer’s multimodal unveiling follows recent updates to its text generation models to add support for multilingual capabilities in 30 languages including Spanish, French and Chinese along with English.

Last September, it raised $100 million in a Series B funding round that included participation by clients Accenture and Vanguard.

Read more about:

ChatGPT / Generative AI

About the Author(s)

Ben Wodecki

Jr. Editor

Ben Wodecki is the Jr. Editor of AI Business, covering a wide range of AI content. Ben joined the team in March 2021 as assistant editor and was promoted to Jr. Editor. He has written for The New Statesman, Intellectual Property Magazine, and The Telegraph India, among others. He holds an MSc in Digital Journalism from Middlesex University.

Keep up with the ever-evolving AI landscape
Unlock exclusive AI content by subscribing to our newsletter!!

You May Also Like