Note: The content crawler job does not perform the actions associated with the settings on this page (refreshing or deleting documents); the Document Refresh Agent does. Every time the Document Refresh job runs, it looks at the settings for each document to determine whether anything needs to be done. Therefore, documents are only refreshed or deleted as frequently as the Document Refresh job runs.
To set options that affect how the documents imported by this content crawler are refreshed or deleted:
Under Document Expiration, specify whether these documents should be deleted after a specified period. Choose one of the following options:
To
specify that documents should not be deleted due to expiration, choose
Never expire.
Note: If you set documents to
be refreshed (described in step 2) and set documents to be deleted if
their source documents are not found (described in step 3), even documents
set to never expire can be deleted.
To specify that documents should be deleted
after a specified period, choose Delete
after, type a number in the box, and choose a period in the drop-down
list.
Tip: If you want to delete all documents previously imported
by this content crawler, you can set the documents to expire immediately
(for example, setting them to delete after 1 minute) and apply these settings
to existing documents as described in step 4. The next time the Document
Refresh job runs, it deletes all documents previously imported by this
content crawler.
Under Link and Property Refresh, specify whether these documents and their associated properties should be periodically refreshed.
Choose one of the following options:
To specify that documents and their associated properties should not be refreshed, choose Never.
To specify that documents should be periodically refreshed, choose Every, type a number in the box, and choose a period in the drop-down list.
If you specified that documents should be
refreshed, you can also specify whether the associated properties should
be refreshed. By default, when a document is refreshed, the associated
property values are also refreshed from the source document.
To avoid updating the document properties, select Only
confirm the validity of the links to these documents. The Document
Refresh Agent checks to see if the source document still exists. If it
does exist, nothing happens. If the document is missing, the settings
you specify for how to handle broken links (described in step 3) are applied.
If you run the Document Refresh Agent every day, this feature is
useful for removing broken links quickly; otherwise, running the Document
Refresh Agent on an enterprise-scale portal can take more than a day.
If you specified that documents should be refreshed, under Broken Links, specify what to do if source documents are missing upon refresh. Choose one of the following options:
To specify that documents should remain in the portal, choose Left alone.
To
specify that documents should be deleted as soon as the source documents
cannot be found, choose Deleted immediately.
Note: In case the document has
been only temporarily moved or deleted, you might want to wait before
deleting a document when the source document is missing (described next).
To specify that documents should be deleted if the source documents cannot be found within a specified period, choose Deleted after, type a number in the box, and choose a period in the drop-down list.
If you change the settings on this page after this content crawler has run and you want to apply these new settings to previously imported documents, select Apply these settings to existing documents created by this content crawler. These settings will be applied when you click Finish.
To display the page associated with this help topic: