Steps to remove documents from the ESP index

Symptom

There are specific documents in a given collection that need to be removed from the ESP index.    

Cause

A user may want or need to manually remove documents from the ESP index for any number of reasons. For example, the documents may be old and outdated, they may be test documents, they may be formatted improperly, etc.

Resolution

There are two simple approaches to removing specific documents from a given collection.  The difference between the two methods depends on whether the user has contentids of the documents to be deleted or internalids of the documents to be deleted. 

1. If the user has contentids of the documents to be deleted, the filetraverser can be used with a text file containing the contentids of the documents.  In this case, the contentids should appear in the text file as one per line.  For example:
http://contoso.com/mydoc1.html
http://contoso.com/mydoc2.pdf
http://contoso.com/mydoc3.html

The command for the filetraverser must include the collection name and the path to the text file containing the contentids of the documents to be deleted:
filetraverser -c <collection> -t <delete_file>

For example (collection = mycollection, text file = delete.txt):
filetraverser -c mycollection -t C:\temp\delete.txt


2. If the user has internalids of the documents to be deleted, the indexeradmin command can be used with a text file containing the internalids of the documents.  In this case, the internalids should appear in the text file as one per line.  For example:
975d02ee431a7a022a5bc3abc53fb8ed_mycollection
bd9c3e19343f4d18c8d5d37b1160b0cf_mycollection
9beb827015e00ffd709667693566fc76_mycollection

The indexeradmin command must include the rdocs flag, the path to the text file containing the internalids of the documents to be deleted, the collection name and a sessionid:
indexeradmin rdocs <delete_file> <collection> <sessionid>

For example (text file = delete.txt, collection = mycollection, sessionid = 99):
indexeradmin rdocs delete.txt mycollection 99

Note:The sessionid is an integer, usually 99 or 100.  The number used is not important to the functionality. A warning message may be displayed indicating the sessionid is in use.  In this case, the command should be run again with the sessionid increased by one.

Rekvizīti

Raksta ID: 2522209. Pēdējo reizi pārskatīts: 2011. gada 14. apr.. Pārskatījums: 1

Atsauksmes