Field Token |
|
The field token in the data parsing information part of the Load Definition File may contain a data token, a literal, a dynamic data item, a conditional expression, a multiple choice expression or a special token.
Data Token
Data Tokens are simple strings used to identify data fields within the
Data Source File. Data within
the Data Source File must be
stored as token:value pairs. When the token is matched, the value portion
is loaded as field data.
Example: Data Tokens
Related Data Field |
Field Token |
Special Processing |
_PURGE_DATA |
"May 31, 2004" |
(d) |
_ADDRESS |
address |
|
_LONGITUDE |
longitude |
(x4) |
_LATITUDE |
latitude |
(y4) |
_CATEGORY_ID |
"64" |
|
The data tokens in this example are address, longitude, and latitude.
Literals
Literals may be used in place of data tokens. Literals may be numeric values
or date-time constants for numeric fields, or they may be text constants
for string and text fields. All literals must be enclosed within a pair
of quotation marks.
Example: Literals
Related Data Field |
Field Token |
Special Processing |
_CREATE_DATE |
"today" |
(d) |
_PURGE_DATA |
"May 31, 2004" |
(d) |
_ADDRESS |
address |
|
_LONGITUDE |
longitude |
(x4) |
_LATITUDE |
latitude |
(y4) |
_CATEGORY_ID |
"64" |
|
_SOURCE |
"The Daily News" |
|
The literals "today" and "May 31, 2004" are treated as dates because the extract date (d) special processing value has been specified. The literal "today" will evaluate to the current date that this record is processed.
Dynamic
Data Items
Dynamic Data Items may be used in place of data tokens. A dynamic data
item may be dynamic value or a dynamic date-time value for numeric fields,
or a dynamic text string for string or text fields. Relative date-time
values and simple date-time expressions must be enclosed within a pair
of quotation marks. All other dynamic data items must be enclosed within
a pair of grave accents.
Example: Dynamic Data Items
Related Data Field |
Field Token |
Special Processing |
_CREATE_DATE |
"today" |
(d) |
_PURGE_DATA |
"today + 7 days" |
(d) |
_ADDRESS |
address |
|
_LONGITUDE |
longitude |
(x4) |
_LATITUDE |
latitude |
(y4) |
_CATEGORY_ID |
`(_FLAGS << 4 )` |
|
_FULL_PATH |
path |
|
_CONTENT |
`_@SCAN("192.168.1.2", 1048, _FULL_PATH, "BODY" )` |
|
The first dynamic data item, "today + 7 days", evaluates to seven days after the current date. The second dynamic data item, `(__FLAGS << 4 )` shifts the bits in __FLAGS four to the left. The third dynamic item utilizes the _@SCAN function to retrieve content for indexing from the document file.
Conditional
Expressions
Conditional Expressions may be used in place of data items. Conditional
expressions must always be enclosed within a pair of grave accents. Any
data token, literal, or dynamic data item that can be used directly in
the second field can be used as the conditional result.
Example: Conditional Expressions
Related Data Field |
Field Token |
Special Processing |
_DATE |
copy_date |
(d) |
_PURGE_DATE |
`IF (_DATE) ( _DATE + "7 days" ) ENDIF` |
(d) |
_ADDRESS |
address |
|
The conditional expression used in this example, checks if _DATE is defined and increments it by 7 days if it is.
Multiple
Choice Expressions
Multiple Choice Expressions may be used in place of data items. Similar
to conditional expressions, multiple choice expressions must always be
enclosed within a pair of grave accents. Any data token, literal, or dynamic
data item that can be used directly in the second field can be used as
the conditional result or the alternate result.
Example: Multiple Choice Expressions
Related Data Field |
Field Token |
Special Processing |
_CREATE_DATE |
copy_date |
(d) |
_DATE |
`IF (_CREATE_DATE) _CREATE_DATE ELSE @CDT ENDIF` |
(d) |
_PURGE_DATE |
`( _DATE + "7 days" )` |
|
_ADDRESS |
address |
|
The multiple choice expression used in this example, checks if _CREATE_DATE is defined and uses it if it is, if it is not, @CDT is used which evalutes to the current date and time.
Special
Token
Special tokens are simple strings beginning with an at sign @, used to
access meta data not stored within a document, or in one case, the entire
document.
The following special tokens are supported:
Token |
Description |
@IDT |
Returns the identity value generated for the document by the scanning software. |
@CDT |
Returns the current system date and time at thetime the document is being indexed. When properly controlled, the indexed date provides an ideal value for a timeline used as a primary index, since an additional scan to obtain the value is not required. Normally, a document must be partially scanned to obtain the value being used by the primary index prior to the full scan for storage and indexing of data. |
@FDT |
Returns the file system date and time stampassociated with the document by the operatingsystem where the document is being stored. When properly controlled, the file system date provides an ideal value for a timeline used as a primary index, since an additional scan to obtain the value is not required. Normally, a document must be partially scanned to obtain the value being used by the primary index prior to the full scan for storage and indexing of data. |
@ALL |
Returns the entire document for storage and indexing. |
@URL |
Returns the URL which may be used to access the document from the internet. The URL is generated by replacing the path to the directory tree with the base URL within the file path. |
@FNM |
Returns the filename for the document being processed. |
@CNT |
Returns the number of individual folders, beyond the root, in the path to the document being processed. |
@D01 |
Returns the first folder, beyond the root, in the path to the document being processed. |
@D02 |
Returns the second folder, beyond the root, in the path to the document being processed. |
@D0n |
Returns the nth folder, beyond the root, in the path to the document being processed. |
@U01 |
Returns the last folder, prior to the filename, in the path to the document being processed. |
@U02 |
Returns the next to the last folder, prior to the filename, in the path to the document being processed. |
@U0n |
Returns the nth folder from the last in the path to the document being processed. |
Example: Special Tokens
Related Data Field |
Field Token |
Special Processing |
_DATE |
@FDT |
(d) |
_PURGE_DATE |
"today + 7 days" |
(d) |
_AUTHOR |
byline |
|
_DOCUMENT |
@URL |
|
_CATEGORY_ID |
"64" |
|
_SOURCE |
"The Daily News" |
|
The special tokens used in this example are @FDT and @URL.