At the
heart of any search endeavour, no matter what
kind of search tool you are using, there are three
areas that can affect your search result significantly:
-
Content of search engine
-
Search logic or algorithm
-
Presentation of search result
We discussed last week how content
of search engines can significantly affect your
search result. This week, we look at search logic.
When confronted with a search engine,
most of us type one or few words in search window
and hope the search result shows most relevant
information.
Now, what can you do besides typing
a few relevant words into the search window ?
Well, actually a lot ! You can specify
that words must be in the title of a page, or
that specifying words must be in an URL, or in
a special HTML tag or you can use logical operators
between words like AND, OR, and NOT....
Most search engines employ keyword-based
search system, that work on occurrence or non-occurrence
of search word to identify relevant documents
in its database. Though a few search engines have
started using fuzzy logic based searches - still
these are in experimental basis.
While keyword based searches are
extremely powerful and ideally suited for database
search - its major weakness is its inability to
understand word meaning. As a result, it searches
mechanically for words - leaving the problem of
synonyms squarely on searcher's shoulder.
To give an example - searching for
'garment' will not get you documents dealing in
'apparel' or 'clothes' or 'T-Shirt'. Worse, the
search engine may even overlook the keyword 'garments'
- unless it has facility for truncation.
So, as searcher, it becomes your
responsibility to construct search expression
in such a way that it gets maximum relevant documents
from search engine database.
Every search engine provides a set
of tools for constructing efficient search expression.
Perhaps the most important of all is the Boolean
search operators.
What
is Boolean Search ?
Boolean searching is based on a system
of symbolic logic developed by George Boole, a
19th century English mathematician. Most keyword
searchable computer databases support Boolean
searches. Boolean search techniques may be used
to perform accurate searches without producing
many irrelevant documents.
When you perform a Boolean search,
you search the computer database for the keywords
that best describe your topic. The power of Boolean
searching is based on combinations of keywords
with connecting terms called operators. The three
basic operators are the terms AND, OR, and NOT.
AND
This operator combines two search
words in a search expression that retrieves documents
containing BOTH the words.
For example: The search expression
'Fabrics AND Buyer' will retrieve all documents
that have both the words 'fabrics' and 'buyer
AND operator narrows a search. More
words you combine with AND - greater will be precision
in search result. However, you may also miss out
relevant documents that do not contain ANY OF
THE words specified.
OR
The OR operator combines two search
words in a search expression that retrieves documents
containing ANY of the words.
For example, the search expression
'Agent OR Agents OR Distributor OR Distributors'
will fetch all documents that have ANY of the
search words.
Obviously, OR operator broadens or
widens a search to include documents containing
any of the search words. This operator is particularly
useful when there are several common synonyms
for a concept or variant spellings of a word.
However, indiscriminate use of OR operator may
fetch junk or unwanted documents.
NOT
NOT operator excludes unwanted documents
having the specified search word. For example,
the search expression 'Buyer NOT Agent' first
fetches all documents that have the word Buyer
then goes on to remove all documents from this
collection that have the word 'agent' in it. The
result is a collection of documents that have
the word buyer but not the word agent.
Boolean search terms may be combined
in various ways to carefully refine searches.
For example:
(Buyer OR 'Buying Agent') AND (UK
OR England OR Britain)
NOTE: A
phrase (i.e more than one word) is always marked
with inverted comma (e.g. 'Buying Agent')
This place is too short for more
explanation. Interested readers may check following
tutorials for graphical illustration of Boolean
operators.
|