Help for Searching the SAS Web


The search engine handles many types of queries. The simplest query is a single keyword, such as:

	lightbulb
Searching for common words (like "computer" or "html") may take a lot of time. Please be considerate of other users.

It is often helpful to use more powerful queries. The search system supports many different index/search engines, with varying capabilities. At present, our engine uses an indexing system called Glimples which supports:

The different types of queries (and how to use them) are discussed below. The different options - case-sensitivity, the ability to show matched lines vs. entire matching records, and the ability to specify match count limits - can all be specified with buttons and menus in the search forms.

A structured query has the form:

	tag-name : value

where tag-name is a Content Summary attribute name, and value is the search value within the attribute. If you click on a Content Summary, you will see what attributes are available for a particular Broker. A list of common attributes is shown here.

Keyword searches and structured queries can be combined using Boolean operators (AND and OR) to form complex queries. Lacking parentheses, logical operation precedence is based left to right. For multiple word phrases or regular expressions, you need to enclose the string in double quotes, e.g.,

	"internet resource discovery"
or
	"discov.*"

Examples

Simple keyword search query:
	Arizona
This query will return all objects containing the word Arizona.

Boolean query:

	Arizona AND desert
This query will return all objects that contain both words anywhere in the object in any order.

Phrase query:

	"Arizona desert"
This query will return all objects that contain Arizona desert as a phrase. Notice that you need to put double quotes around the phrase.

Boolean queries with phrases:

	"Arizona desert" AND windsurfing
Simple Structured query:
	Title : windsurfing
This query will return all objects where the Title attribute contains the value windsurfing.

Complex query:

	"Arizona desert" AND (Title : windsurfing)
This query will return all objects that contain the phrase Arizona desert and where the Title attribute of the same object contains the value windsurfing.

Query options selected by menus or buttons

These checkboxes allow some control of the query specification.

Case insensitive:
By selecting this checkbox the query will become case insensitive (lower case and upper case letters differ). Otherwise, the query will be case senstive. The default is case insensitive.
Keywords match on word boundaries:
By selecting this checkbox, keywords will match on word boundaries. Otherwise, a keyword will match part of a word (or phrase). For example, "network" will matching "networking", "sensitive" will match "insensitive", and "Arizona desert" will match "Arizona desertness". The default is to match keywords on word boundaries. Note: The previous two options do not apply to attribute names. Attribute names are always case insensitive and must match fully.

Result set presentation

These checkboxes allow some control of presentation of the query return.

Display matched lines (from content summaries):
By selecting this checkbox, the result set presentation will contain the lines of the Content Summary that matched the query. Otherwise, the matched lines will not be displayed. The default is to display the matched lines.
Display object descriptions (if available):
Some objects have short, one-line descriptions associated with them. By selecting this checkbox, the descriptions will be presented. Otherwise, the object descriptions will not be displayed. The default is to display object descriptions.
Verbose display:
This checkbox allows you to set whether results are displayed listing the filename, host, path, and Content Summary each on separate lines, or just with two lines listing the filename (without a label) and the Content Summary (with a label). The default is verbose.

Regular Expressions

Some types of regular expressions are supported by Glimpse. A regular expression search can be much slower that other searches. The following is a partial list of possible patterns. (For more details see the Glimpse manual pages.) Regular expressions are currently limited to approximately 30 characters, not including meta characters. Regular expressions will generally not cross word boundaries (because only words are stored in the index). So, for example, "lin.*ing" will find "linking" or "flinching," but not "linear programming."

List of Common Attribute Names

Below we list the set of attributes. Clicking on a hypertext link below will provide a brief explanation about each.


The search engine is built using the Harvest system.