Saturday, March 19, 2016

Endeca Basics : What is Stemming?

The Stemming feature expands search results to include root words and variants of root words.Stemming enables words with a common root (such... thumbnail 1 summary
The Stemming feature expands search results to include root words and variants of root words.Stemming enables words with a common root (such as the singular and plural forms of nouns) to be used interchangeably by searches. 

Stemming enables the MDEX engine to consider alternate forms of individual words as equivalent for the purpose of search query matching.

For example
search results for the word "shirt" will include the plural form "shirts", while a search for shirts will include the singular form shirt.

Enable Stemming

Stemming can be enabled as follow :-
1. Forge based system -  Open CRS.stemming.xml under <<Endeca_App>>/config/pipeline folder
2. CAS based system - Open CRS.stemming.xml under <<Endeca_App>>/config/mdex folder

add the following snippet to enable for English:

<STEMMING>
  <STEM_EN_US ENABLE="TRUE" />

</STEMMING>

Note:-  Endeca OOTB provides the stemming dictionary for many languages.




No comments

Post a Comment

Text Widget