Editing the Lists |
|
Five word lists are maintained within the Translate Table editor. To switch between lists, simply click the list name in the LISTS task bar along the left side of the translate table editor.
Besides simply typing words into a particular list. Data can be manipulated in the following ways:
Method |
A translate table consists of three groups of lists: SEARCH & LOAD, LOAD ONLY and SEARCH ONLY.
The SEARCH & LOAD group consists of five lists: Suffix List, Stop List, Exception List, Start List and Synonym List.
The suffix list is used to define a list of passes containing suffix patterns used in the stemming process.
The word list consists of three columns:
Column |
Description |
Threshold |
Must be numeric. Number of characters required to be in a word for current pattern-replacement processing to take place. (Overrides Process Threshold if greater) |
Pattern |
ASCII pattern to be matched as a suffix for replacement. |
Replacement |
ASCII string to be used as a suffix to replace matched pattern. |
The stop list is used to define an alphabetized list of stop words.
The exception list is used to define a list of words that are not to be stemmed.
The start list is used to define only specific words to be indexed.
The word list consists of two columns:
Column |
Description |
Pattern |
Words to be found in the data |
Replacement |
Words to be indexed in place of their corresponding Pattern |
The synonym list is used to define a thesaurus or to handle special plurals such as “mice” which is the plural of “mouse”.
The word list consists of two columns:
Column |
Description |
Pattern |
Words to be found in the data |
Replacement |
Words to be indexed in place of their corresponding Pattern |
The LOAD ONLY group consists of four lists: Stop List, Exception List, Start List and Synonym List.
The stop list is used to define an alphabetized list of stop words.
The exception list is used to define a list of words that are not to be stemmed.
The start list is used to define only specific words to be indexed.
The word list consists of two columns:
Column |
Description |
Pattern |
Words to be found in the data |
Replacement |
Words to be indexed in place of their corresponding Pattern |
The synonym list is used to define a thesaurus or to handle special plurals such as “mice” which is the plural of “mouse”.
The word list consists of two columns:
Column |
Description |
Pattern |
Words to be found in the data |
Replacement |
Words to be indexed in place of their corresponding Pattern |
The SEARCH ONLY group consists of four lists: Stop List, Exception List, Start List and Synonym List.
The stop list is used to define an alphabetized list of stop words.
The exception list is used to define a list of words that are not to be stemmed.
The start list is used to define only specific words to be indexed.
The word list consists of two columns:
Column |
Description |
Pattern |
Words to be found in the data |
Replacement |
Words to be indexed in place of their corresponding Pattern |
The synonym list is used to define a thesaurus or to handle special plurals such as “mice” which is the plural of “mouse”.
The word list consists of two columns:
Column |
Description |
Pattern |
Words to be found in the data |
Replacement |
Words to be indexed in place of their corresponding Pattern |