MuleSoft Direct Connector Reference (Beta)
The MuleSoft Direct connector supports various external sources and file formats for ingestion into Data Cloud.
The external sources are provided as MuleSoft application assets published to MuleSoft Anypoint Exchange. Each external source supports these file formats for ingestion.
Source | Supported Content Types | Ingested format |
---|---|---|
Confluence | Pages | HTML |
Google Drive | TXT, PDF, HTML, ASPX, Log, Google Docs |
|
Sharepoint | TXT, PDF, HTML, ASPX, Log | Same as source |
Sitemap | XML | Same as source |
Source | Credentials |
---|---|
Confluence |
|
Google Drive |
|
Sharepoint |
|
Sitemap | Access for public sitemaps that don't require authentication only |
When you create unstructured data lake objects (UDLOs), you can use these filters to define the files and content you want to ingest.
Source | File Name pattern (input) | Expected output |
---|---|---|
Confluence, Google Drive | * | All files |
Only PDF files | ||
[A specific word] | Files matching [a specific word] exactly. | |
[A specific word]* | Files starting with [a specific word] | |
[A specific word] | Files containing [a specific word] | |
Sitemap | * | Data from all URLs in Sitemap |
/amf.* | Data from all URLs in Sitemap with a path starting with /amf. | |
[A specific word] | Data from all URLs in Sitemap with [a specific word] | |
[A specific word]* | Data from URLs in Sitemap with a path starting with [a specific word]* | |
[A specific word] | Data from all URLs in Sitemap with a path which contains [a specific word] |