Create a Microsoft SharePoint Unstructured (Site Pages & Site Assets) Data Lake Object

Create an Unstructured Data Lake Object (UDLO) in Data 360 to ingest your organization's content from Microsoft SharePoint into Data 360.

The data stream ingests data from your selected repositories into a UDLO and maps it to an Unstructured Data Map Object (UDMO). Data 360 uses the UDMO to create a search index and uses it to ground AI-generated responses.

See the Unstructured Data File Formats and Connectors for a list of supported file formats.

This feature is a Beta Service. A customer may opt to try a Beta Service in its sole discretion. Any use of the Beta Service is subject to the applicable Beta Services Terms provided at Agreements and Terms. If you have questions or feedback about this Beta Service, contact the Data 360 Connector team at datacloud-connectors-beta@salesforce.com.

User Permissions Needed
To connect unstructured data:Data Cloud Architect

Before you begin:

  1. From the App Launcher, select Data Cloud.

  2. Click Data Lake Objects and then click New.

  3. Select the From External Files tile, and click Next.

  4. From the New Data Lake Object screen, select Microsoft SharePoint Unstructured Data (Site Pages & Site Assets) and click Next.

  5. From the Select connection dropdown list, choose available connections. Data 360 auto-populates the source based on the connection that you select.

  6. Add the SharePoint Site Name that you collected earlier.

  7. Select one or both of these SharePoint libraries.

    • Site Pages: Modern SharePoint pages used to create and display unstructured content such as announcements, knowledge articles, and dashboards within a SharePoint site. The page content is stored in the .aspx format, and each page is ingested as a separate content item by the connector.
    • Site Assets: A SharePoint library that stores supporting files used by Site Pages, such as images, media, and other resources required to render page content. These assets can include multiple file formats such as images, documents, and media files.
  8. Click Next.

  9. Add an Object Name and an Object API Name for the UDLO. See Data Lake Object Naming Standards.

  10. From the Data Space dropdown list, select a data space in which to create the new UDMO or a data space from which to select an existing UDMO.

  11. Map the UDLO to a UDMO.

    • To create a new UDMO, click New.
    • To use an existing UDMO, click Existing, and select a UDMO from the list.
  12. Optionally, leave the checkbox selected to create a search index configuration for the UDMO using system defaults that automatically selects text fields and a chunking strategy for each field. You can deselect the checkbox and create a search index configuration later if you choose not to do so now.

  13. Select the checkbox to enable content harmonization (Beta) for the UDMO. You can leave it deselected and enable content harmonization later if you choose not to do so now. To enable this Beta feature, you need to enable content harmonization and rendering on the feature manager.

    Note: When you enable content harmonization, you enable the collection of Content Viewer engagement data.

  14. Click Next, or if you created a search index configuration, review the details.

  15. Leave the remaining fields as-is to use the default settings, or rename and change the Search Configuration details and objects to make changes.

  16. Save your work.

The connector begins to ingest your content. The ingestion process takes several minutes and the length of time depends on the size and number of the Microsoft SharePoint assets you're ingesting. See the Monitoring guide for checking the ingestion progress.

See Also