Salesforce Developers Blog

How to Build a Multimodal Agent in Salesforce

Avatar for Charles WatkinsCharles Watkins
Unlock multimodal agentic workflows with Prompt Builder and Agentforce.
How to Build a Multimodal Agent in Salesforce
July 10, 2025
Listen to this article
0:00 / 0:00

Large language models (LLMs) excel at understanding and acting on written language. When combined with an agentic layer like Agentforce, they can take action on a user’s behalf. Until recently, they have been limited to text. But with the rise of multimodal LLMs, you can create agents that consider images and files when answering a question or building a plan.

In our latest Agentforce Decoded video, you’ll learn how to build a general-purpose action that enables your agents to use images and PDFs as context. We’ll walk you through a practical example — parsing receipts to create expense reports — and show you how this capability opens up new possibilities for your agentic workflows. You can use this same approach to allow your agents to troubleshoot issues from screenshots, parse contracts, or analyze documents at runtime.

To learn more about building multimodal workflows and agent actions, check out the resources below.

Resources

About the author

Charles Watkins is a Lead Developer Advocate at Salesforce. You can follow him on LinkedIn and GitHub.

More Blog Posts

Create Smarter Conversations in MuleSoft with Einstein AI

Create Smarter Conversations in MuleSoft with Einstein AI

Streamline AI interactions within your integration flows by using the Chat Generate From Messages connector in MuleSoft.August 07, 2025

Agentify Your Website Content with the Data Cloud Web Content (Crawler) Connector

Agentify Your Website Content with the Data Cloud Web Content (Crawler) Connector

Add a conversational agent to your website that’s grounded with your website's content using the Web Content (Crawler) connector for Data Cloud.August 14, 2025

Integrate Data Cloud's Document AI with Agentforce

Integrate Data Cloud's Document AI with Agentforce

Extending Data Cloud's Document AI with Agentforce opens up new possibilities for automating and streamlining your data processing workflows.September 11, 2025