Main Settings (Remote Content Crawler)

To learn about content crawlers and content Web services, click here.

To specify the destination folder and security for documents imported by this content crawler:

Note: Depending on what type of remote content crawler you are creating, you might see additional settings. To see online help for those settings click the help button on the associated page.

  1. Under Destination Folders, specify into which folders you want to import content. The content crawler attempts to import a link to every document it finds into the most subordinate subfolder within the destination folder that allows the link to pass. Click here for a flow chart showing how the content crawler determines into which folders it will import content.

  2. If the content Web service used by this content crawler supports folder mirroring (specified on the Advanced Settings page of the Content Web Service Editor), you can have this content crawler create Directory folders that duplicate the folder structure of the content repository being crawled by selecting Mirror the source folder structure.

    Notes:

  3. To require that documents pass the filters of destination folders before the documents are imported into those folders, select Apply Filter of Destination Folder. By default, documents do not need to pass the filters of destination folders, so all documents will be imported into all destination folders.

    Note:
    This feature is not available if you mirror the source folder structure.

  4. To accept all imported documents into the portal and make them immediately available to users, select Automatically approve imported documents. By default, documents require approval. This means that before the link to the imported document is available to users, it must be approved by a portal administrator with at least Edit access to the destination folder.

    If you are mirroring the folder structure, you might want to set imported documents to be approved automatically and restrict users to Read access (users in the Administrators group always have Admin access). If you set imported documents to require approval, be aware that any portal administrator who has at least Edit access can also modify the folders and content, and can therefore make your portal folders and content out of sync with your source repository.

  5. If the content Web service used by this content crawler supports security importation and the source repository users and groups correspond to portal users and groups (specified in the Global ACL Sync Map), you can have this content crawler import the security settings for each document by selecting Import security with each document. This automatically makes documents that are available to source repository users available to the mapped portal users.

    Note:
    Because read access is equivalent in the source repository and the portal, but write access is not, only read access is imported; write access is ignored because write access to a document in an external repository allows you to edit the document, but write access (referred to as Edit access) in the portal allows you to edit the properties and security settings of that document.

  6. Under Document Access Privileges, you can perform the following actions to manually grant users and groups access to the content imported by this content crawler:


  1. Click Administration.
  2. Open the Content Crawler Editor: