A content source provides access to external content repositories, allowing users to import content into the portal through the use of content crawlers and document submission. Each content source is configured to access a document repository. For example, a content source for a secured Web site can be configured to fill out the Web form necessary to gain access to that site.
This topic discusses:
To learn how to create or edit administrative objects (including content sources), click here.
A Web content source allows users to import content from the Web into the portal through Web content crawlers or Web document submission.
When you install the portal, the World Wide Web content source is created. This content source provides access to any unsecured Web site.
To learn about the Web Content Source Editor, click one of the following editor pages:
A remote content source allows users to import content from an external content repository into the portal through remote content crawlers or remote document submission.
Some crawl providers are installed with the portal and are readily available to portal users, but others require you to manually install them and set them up. For example, Oracle provides the following crawl providers:
Note: For information on obtaining crawl providers, refer to the Oracle Technology Network at http://www.oracle.com/technology/index.html. For information on installing crawl providers, refer to the Installation Guide for Oracle WebCenter Interaction (available on the Oracle Technology Network at http://www.oracle.com/technology/documentation/bea.html) or the documentation that comes with your crawl provider, or contact your portal administrator.
To create a remote content source:
To learn about the Remote Content Source Editor, click one of the following editor pages:
The following crawl providers, if installed, include at least one extra page to the Remote Content Source Editor:
Depending on users' permissions, they might be able to view, submit, or crawl documents from a content source.
Action |
Permissions Needed |
Access documents imported into the portal |
|
Crawl documents into the portal |
|
Submit a document into the portal |
|
If you have content sources that access sensitive information, be aware that users that have access to the content source and have the additional permissions listed in the table could access anything that the user that the content source impersonates can access. For this reason, you might want to create multiple content sources that access the same repository but that use different authentication information and for which you allow different users access.
If you delete a content source from which documents have been imported into the portal, the links to the documents will still exist, but users will no longer be able to access these documents.