Manage content sources

A content source is a collection of addresses that are the seeds of the content that you want to crawl. A content source also specifies settings that define the crawl behavior and the schedule on which the content will be crawled.

To manage content sources, you must first open the Manage Content Sources page:

  • On the Search Administration page, under Crawling, click Content sources.

What do you want to do?

Add a content source

Edit a content source

Start, stop, pause, or resume crawling of content sources

Delete a content source

Add a content source

  1. On the Manage Content Sources page, click New Content Source.

  2. On the Add Content Source page, in the Name box in the Name section, type a name for the content source.

  3. In the Content Source Type section, select the type of content that you want to crawl using this content source.

  4. In the Start Addresses section, in the Type start addresses below (one per line) box, type the URLs from which the search system should start crawling.

  5. In the Crawl Settings section, select the behavior for the type of content that you selected.

  6. In the Crawl Schedules section, you can specify when to complete full and incremental crawls.

    To schedule full crawls, on the Full Crawl drop-down list, click a schedule. You can create a custom schedule by clicking Create Schedule. A full crawl crawls the entire content source whether or not the content source has changed.

    To schedule incremental crawls, on the Incremental Crawl drop-down list, click a schedule. You can create a custom schedule by clicking Create Schedule. An incremental crawl crawls content in the content source that has changed since the last crawl.

  7. If you want to begin a full crawl immediately, in the Start Full Crawl section, select the Start full crawl of this content source check box.

  8. Click OK.

Top of Page

Edit a content source

You can edit a content source to change the schedule on which the content is crawled, the seed addresses, or the crawl settings. However, you cannot change the content type by editing a content source.

  • On the Manage Content Sources page, in the list of content sources, point to the content source that you want to edit, click the arrow that appears, and then click Edit on the menu that appears.

    Information about the settings for content sources can be found in the Add a content source section.

Top of Page

Start, stop, pause, or resume crawling of content sources

You can start, stop, pause, or resume the crawls of individual content sources.

Paused crawls can be resumed, while stopped crawls cannot. Stopping a crawl causes the next crawl to be a full crawl.

  • To start, stop, resume, or pause crawling of a single content source, in the content sources list, select one of the following on the menu of the content source that you want to configure:

    • Start Full Crawl     

    • Start Incremental Crawl     

    • Resume Crawl     

    • Pause Crawl     

    • Stop Crawl      When you select this option, you will need to click OK in the message box that appears asking whether you want to stop the crawl.

Top of Page

Delete a content source

When you delete a content source, all content crawled from that source is removed from the search index and will therefore be unavailable during searches.

  1. On the Manage Content Sources page, in the list of content sources, click Delete on the menu of the content source that you want to delete.

  2. In the message box, click OK to confirm that you want to delete the content source.

Top of Page

Share Facebook Facebook Twitter Twitter Email Email

Was this information helpful?

Great! Any other feedback?

How can we improve it?

Thank you for your feedback!

×