usage_information same_as _information about usage
usage is_a _entity
web_site_usage_information same_as _information about web_site_usage
web_site_information is_a site_information
web is_a _entity
site_information same_as _information about site
web_site is_a site
The tag sequence is encoded using the following rules:
Each word is assigned a letter codes as follows: T = thing or entity, A = agent, M = modifier, E = event or action, P = property, X = information, or _ (underscore) = the word is part of a compound word that inherits the last word’s code.
Normal processing treats adjacent words as hypernym-hyponym pairs, such that W1W2 yields the interpretation: W1W2W3 is_a W2W3, and W2W3 is_a W3, unless an exception is indicated by an operator below.
| (bar) indicates that preceding words (up to the latest bar) should be processed independently, such that W1W2|W3 yields the interpretation: W1W2 is_a W2, and W1W2W3 is_a W2W3 (e.g., see social media presence)
If previous words should be excluded from the concept, then / (slash) can be used to move the previous word marker to the current word position, which excludes previous words from normal processing.
> (greater than) indicates that a word should be copied forward one position for each repetition of this operator and independently interpreted with the remaining word sequence, such that W1>W2W3 yields the interpretation: W1W2W3 is_a W2W3, W2W3 is_a WW3, and W1W3 is_a W3 (e.g., see real-time geographic location)
If | (bar) precedes (greater than), then the preceeding words will all be copied forward to the to the position indicated, such that W1W2|>W3W4 yields the interpretation: W1W2W4 is_a W4 (e.g., see web site usage information)
If a word W1 should be copied to two or more forward positions, then the | (bar) can be used to introduce an additional copy operation, such that W1>>|>W2W3W4 yields the interpretation: W1W4 is_a W4 and W1W3W4 is_a W3W4 (e.g., anonymous user usage data)
If a word W1 should be re-located forward, then the / (slash) can be used to exclude the word from normal processing, such that W1/>W2|W3 yields the interpretation: W1W3 is_a W3, but not W1W2 is_a W2 (e.g., see aggregated behavior information).
- (hyphen) indicates that the adjacent words should be considered as an inseperable whole when modifying a subsequent word, such that W1-W2W3 yields the interpretation: W1W2W3 is_a W3, but not W2W3 is_a W3 (e.g., see server provided cookie)
_ (underscore) indicates that the matching word(s) are a continuation of the previous word in a multiword phrase that should not be processed normally (e.g., see social media presence, but without the interpretation W1W2 is_a W2)