Title of Invention

PROGRESSIVE RELAXATION OF SEARCH CRITERIA

Abstract A method for managing an information search, the method comprising the steps of: receiving a query (402) at a database server (106), the query comprising a sequence of sub-queries and a specification of a progression in which to execute the sub-queries; and executing (404) one or more of the sub-queries in an order specified by the progression, characterized in that each sub-query is executed only if the results of the previously executed sub-queries are not sufficient to fulfill a search request.
Full Text

PROGRESSIVE RELAXATION OF SEARCH CRITERIA
FIELD OF THE INVENTION
[0001] The present invention relates generally to information retrieval and, more
specifically, to techniques for specifying progressive relaxation of search criteria.
BACKGROUND OF THE INVENTION
[0002] Applications for servicing client, or end user, queries often operate such that
not only are exact matches for a user-specified query identified if they exist, but the
closest non-exact matches are also identified and returned to the end user. In this context,
the term "query" is not limited to a conventional database query, such as a query in SQL
(Structured Query Language). Generally, a query includes any search for information
through any search mechanism, such as a conventional search engine or search function.
Typically, the user's search request is eventually transformed into a structured database
query.
[0003] One approach to servicing search requests or queries, in order to identify
existing exact matches and non-exact matches, involves: (1) rewriting or reconstructing
the user query to include all allowable variations of the original query; (2) retrieving a
"hit-list" for the reconstructed concatenated query by submitting the query to a database
server; and (3) ordering the hit-list in an order based on the relevance to the original
search criteria (sometimes referred to as "relevance ranking").
[0004] For example, if a user initiates a search for information on "cheap pen" on
some form of information repository, such as a database or the collection of information
that is accessible via the Internet, an "expanded query" is constructed to include both the
original query and to include one or more sub-queries that relax the requirements of the
original query. An expanded query associated with a search for "cheap pen" might
include sub-queries for other allowable versions of "cheap pen," such as "cheap NEAR
pen," "cheap AND pen," "$cheap AND $pen" (where "$" represents a grammatical stem
operation), "cheap OR pen" and the like. A hit-list is produced based on this set of sub-
queries, and the hit-list is then ordered. The ordering may be based on, for example, the
specific sub-query that produced a given hit and a speculative relevance to the end user
that is requesting the information.
[0005] In such an approach, useless work may be performed because all of the sub-
queries are executed, whether or not necessary to actually fulfill the user's request and
interest. That is, the first sub-query executed may produce a sufficient number of hits or

sufficiently relevant results to satisfy the user's interest. Furthermore, if a given sub-
query is particularly unselective, it may produce many more hits than are necessary to
satisfy the user's interest and unnecessary work is performed by parsing the query
statement, querying the information repository, and producing and ordering the results.
[0006] Another approach involves: (1) executing sub-queries associated with
allowable variations of the original query, in series in descending order of priority; and
(2) retrieving hits until enough hits have been located, based on some criteria. This
approach involves an entity other than the database server, such as an end user or a search
mechanism, issuing a query to the database server based on the original search criteria,
receiving results from the query, issuing another query to the database server that expands
the original search criteria, receiving results, and continuing this iterative process until the
search request has been satisfied according to some quantitative criteria. Query response
time and network performance suffer when using this approach due to the potential for
multiple complete round-trip communications between the entity and the database server,
which unnecessarily load the system. In this context, and throughout the specification, a
complete round-trip communication refers to the network communication between a
client entity and a database server, as well as the processing performed by the database
server, which often includes: (1) parsing the query; (2) constructing a query execution
plan; (3) optimizing the query execution; and the like. Secondary client-server
communications refer to communications between client and server applications which do
not incur the same processing overhead as complete round-trip communications.
[0007] Both of the foregoing approaches are inefficient in terms of response time,
processing, and network loading. Furthermore, these approaches are cumbersome for
developers of search applications and mechanisms because they require such applications
and mechanisms to speculatively relax the search requirements and process results with
respect to relevance ranking. Furthermore, they provide limited capabilities, if any, for
end users to affect the priority of search term variations in the context of relaxation of the
original search criteria and, consequently, the relevance of associated results.
[0008] Based on the foregoing, it is clearly desirable to provide an improved
mechanism for servicing information searches. There is a more specific need to provide
more control to an end user that is requesting a search for particular information to
provide more efficient and more relevant performance.

BRIEF DESCRIPTION OF THE DRAWINGS
[0009] The present invention is illustrated by way of example, and not by way of
limitation, in the figures of the accompanying drawings and in which like reference
numerals refer to similar elements and in which:
[0010] FIG. 1 is a block diagram that illustrates a simplified example of an operating
environment in which an embodiment may be implemented;
[0011] FIG. 2 is a block diagram that illustrates relevant functional components of a
database server and a search mechanism;
[0012] FIG. 3 is a flow diagram that illustrates a process for managing an information
search;
[0013] FIG. 4 is a flow diagram that illustrates a process for managing an information
search; and
[0014] FIG. 5 is a block diagram that illustrates a computer system upon which an
embodiment of the invention may be implemented.
DETAILED DESCRIPTION
[0015] A method and mechanism are described for specifying progressive relaxation
of search criteria. In the following description, for the purposes of explanation, numerous
specific details are set forth in order to provide a thorough understanding of the present
invention. It will be apparent, however, that the present invention may be practiced
without these specific details. In other instances, well-known structures and devices are
shown in block diagram form in order to avoid unnecessarily obscuring the present
invention.
OVERVIEW
[0016] In order to provide a more efficient search mechanism, control over the
relaxation of a search query is provided to users that are requesting searches. Progressive
relaxation of queries allows for complex queries without compromising performance.
Generally, complex queries are processed to provide increased recall, or completeness, of
results without compromising the precision, or quality, of the results. Through such a
mechanism, a user can specify a sequence of sub-queries that is associated with variations
of the main search term, and specify a progression in which to execute the sub-queries.
Hence, users can impart their priorities with respect to search term variations used in
relaxing the main search criteria, which further allows the users to impart their notion of
the relevance of results that may be returned by particular sub-queries. Unlike prior

approaches, the sub-queries are not all immediately and fully executed by the database
server when they are received. Rather, each sub-query is only executed when the
previously executed sub-query has not produced results that satisfy the user.
[0017] In an embodiment, the sub-queries select data from a database based on search
criteria associated with the information being requested, which may include variations of
original search terms, and the progression according to which the sub-queries are
executed corresponds to a progressive relaxation of the original search terms.
[0018] According to one aspect, a series of sub-queries specified by a user are
received at a server, for example, and executed in an order based on the progression
specified by the user. Hence, multiple sub-queries may be executed by the server in
response to only a single complete round-trip communication between the user/client and
the server, and the server need only process a single query. Network loading and
response time and are thereby improved.
[0019] In an embodiment, a series of sub-queries specified by a user are received at
the database server in a single round-trip communication. The server then executes the
sub-queries in an order based on the progression specified by the user, however, each sub-
query is executed only if the results of the previously executed sub-queries were not
sufficient to fulfill the search request. For example, if the user requests a specific number
of result hits, the sub-queries are executed in series until the number of requested hits
have been produced, but no more sub-queries than are necessary are executed. Thus, for
example, if execution of the first sub-query provides enough hits or sufficiently relevant
hits to satisfy the user request, then none of the other sub-queries are executed and,
consequently, computational resources are conserved. In such an embodiment, the user
may specify "stop criteria" (e.g., a minimum number of hits, an amount of time, a volume
of data, or a combination of such criteria) through interaction with a user interface prior to
submission of a search. Once the search results satisfy the stop criteria, the database
server does not execute any further sub-queries. Hence, this user-specified stop criteria
approach would only require one complete round-trip communication between the client
and server, and the server need only process a single query.
[0020] Alternatively, the user may iteratively request more hits through interaction
with a user interface, as needed to satisfy the user's interest. In this case, the user
interaction is not sending a new query, but indicating that the server is to proceed to the
next sub-query in the original received query. Similarly to the user-specified stop criteria
approach, only one complete round-trip communication and associated processing is
required. This scenario may require multiple secondary communications between the

client and server because the server does not have knowledge of the stop criteria prior to
formulation of a query execution plan. However, these secondary client-server
communications are not computationally expensive, complete round-trip communications
that require significant query processing overhead.
[0021] Non-limiting examples of some benefits provided by the techniques described
herein are (1) a reduction in network load due to a reduction in computationally expensive
complete round-trip communications between a client and a server; (2) a reduction in
unnecessary load on a database server due to executing queries that are not actually
required to meet an end-user's needs; (3) an elimination or reduction in work from a
search engine or middleware application with respect to relevance ranking processing;
and (4) ease of use, by shifting work to a database server from an end user, and/or ease of
application development, by shifting work to a database server from a search engine or
middleware application, both with respect to construction of queries based on original
search terms.
OPERATING ENVIRONMENT EXAMPLE
[0022] FIG. 1 is a block diagram that illustrates a simplified example of an operating
environment in which an embodiment of the invention may be implemented.
[0023] Embodiments comprise techniques for managing information searches, which
includes managing database queries. Hence, an example of an operating environment
includes a client 102, a search mechanism 104, a database server 106 and a database 108.
[0024] Client 102 is a client computer software application that executes on a
computing platform, such as a desktop or laptop computer, to communicate with a server
computer software application, such as search mechanism 104 or database server 106.
Client 102 facilitates a process in which an end user, such as a person requesting a search
for information, communicates with search mechanism 104 in order to request data or
information. For example, client 102 may be a conventional web browser that facilitates
communication over a network, such as the Internet. Thus, client 102 may display on a
display terminal various web pages, data input frames, query results, and the like. Client
102 displays information or pages served from search mechanism 104.
[0025] Search mechanism 104 is typically a combination of a computer software
application and the computer hardware on which the application executes, such as a
server computer. Search mechanism 104 provides a user interface and searching
functionality to an end user using client 102. For non-limiting examples, search
mechanism 104 may be what is commonly referred to as a search engine, for searching

the Internet for information contained in web pages, or may be an interface to a more
specialized information source, such as a search function on an e-commerce or corporate
web page, or may be an application server.
[0026] Search mechanism 104 serves as an interface between client 102 and a
database server 106, and can be considered a client of database server 106. Search
mechanism 104 and database server 106 can be configured together on a single
computing platform and can be configured as related software modules that provide
integrated functionality to client 102. Thus, search mechanism 104 and database server
106 are not required to be separate entities as depicted in FIG. 1.
[0027] Database server 106 provides the data requested by an application server, such
as search mechanism 104, on behalf of a client 102. The database server does all of the
remaining query processing not performed by the search mechanism 104 or by a user of
client 102. Thus, database server 106 is a highly functional interface to a data repository,
such as database 108, for managing a large amount of data in a multi-user environment.
In broad terms, database server 106 accesses data in database 108 pursuant to a request.
[0028] Database 108 is a collection of data treated as a unit. Generally, the purpose
of a database 108 is to store and retrieve related information. Database 108 comprises
some type of data storage unit or data container, such as a data table with associated rows
and columns or an object class with associated objects. Database 108 typically comprises
multiple other structures for storing, accessing and manipulating data, for example,
indexes on tables. Embodiments do not require any specific logical or physical structure
for database 108.
TEXT QUERIES
[0029] According to one embodiment, a CONTAINS operator is used to implement a
progressive relaxation mechanism as described herein. A CONTAINS operator is used in
the WHERE clause of a SELECT statement to specify a query expression for a text query.
The CONTAINS operator returns a relevance score for every row selected, which is
obtainable with a SCORE operator. One grammar for the CONTAINS operator is called
CONTEXT. Another applicable grammar is CTXCAT. An example of a query that
includes a CONTAINS operator is as follows:

CONTAINS (
[schema.]column
textquery VARCHAR2
[,label NUMBER])
RETURN NUMBER;
where [schema.]column specifies the text column to be searched, which preferably has a
text index associated therewith. The textquery parameter specifies (1) the query
expression that defines the search in the column, or (2) a marked-up string that specifies a
query based on the CTXCAT grammar. In specifying a query based on the CTXCAT
grammar, via a query template, the query string uses the following tags:
, which signals that this query is to be interpreted as a query
template;
, which specifies the query string;
grammar= , which specifies the grammar of the query;
, which specifies the score preference; and
datatype= , which specifies the type of number returned as score.
[0030] For each row selected, CONTAINS returns a number between 0 and 100 that
indicates how relevant the document row is to the query, with 0 indicating that no
matches are found in a given row.
[0031] The CONTEXT grammar is the default grammar for the CONTAINS operator
and allows use of query operators in a query expression. For example, the logical
operator AND allows for searching for all documents that contain two different words;
the ABOUT operator allows for searching on concepts; the WITHIN operator allows for
section searching, such as in sections of XML or HTML documents; and the NEAR
operator allows for proximity searches, stem searches, fuzzy, and thesaural operators for
expanding a query expression, or stated differently, for relaxing the search criteria
associated with a query. In addition, an index on a table that stores text is typically of
indextype "context", if the CONTEXT grammar is expected to be utilized for text
searches.
FUNCTIONAL COMPONENTS
[0032] FIG. 2 is a block diagram that illustrates relevant functional components of
database server 106 and search mechanism 104.

[0033] Search mechanism 104 comprises a rule interpreter 220. Rule interpreter 220
operates to interpret and translate rules associated with an information search. For
example, in one embodiment, which is described in more detail below, a user of client
102 (FIG. 1) specifies a search term or value, referred to as search criteria, in the form of
rules which are interpreted by rule interpreter 220 and translated into another format. For
example, a user might provide a set of search rules such as (search terms, AND, OR),
which specifies search criteria (i.e., original search terms in original form, conjunctive
form of search terms; disjunctive form of search terms) and an order in which to execute
related sub-queries (i.e., execute original form first, conjunctive form next, disjunctive
form next).
[0034] Upon receiving a set of rules, rule interpreter 220 constructs a query template
based on the set of rules. According to one embodiment, the query template is in the
form of a template, as described above. According to another embodiment,
the query template is in the form of a "rewrite template," which is described below. In
both embodiments, the query template represents search criteria and an order in which
sub-queries associated with the search criteria are to be executed.
[0035] In another embodiment, rule interpreter 220 generates a relaxation query 250a
based on the rules received, for submission to database Server 106. The relaxation query
250a includes search criteria and an order in which constituent sub-queries associated
with the search criteria are to be executed.
[0036] Rule interpreter 220, or other modules within search mechanism 104, may be
configured to provide other functionality to make users' interaction with database server
106 user-friendly.
[0037] Database server 106 functionally comprises a query relaxation engine 202 and
a query execution engine 210. Query relaxation engine 202 comprises a template
interpreter 204 and a query constructor 206, and query execution engine 210 comprises a
context module 208.
[0038} Template interpreter 204 is configured to interpret various forms of search
criteria, such as query templates, that specify or otherwise represent a sequence of sub-
queries and an order in which to execute the sub-queries. Template interpreter 204 passes
information to query constructor 206, which processes the received information into a
query. The query is executed to fetch information from database 108 (FIG. 1), possibly in
conjunction with context object 208.
[0039] Template interpreter 204 is configured to interpret query templates, such as
what may be output from rule interpreter 220 or some other source. For example, rule

interpreter 220 receives a set of rules from an end user and generates a query template
based on those rules, which serves as a request to construct a query based on the query
template. In turn, template interpreter 204 receives and interprets the query template and
passes information to query constructor 206 that is necessary to construct a query
according to a query language that database server 106 supports, such as SQL or PL/SQL.
In one embodiment, a rewrite template is received from rule interpreter 220 and includes
statements that effectively request that a query be constructed based on the set of rules.
An example of a rewrite template, with respect to a search for "cheap pen", is as follows:

where the arguments of the TRANSFORM function include a prefix associated with the
search terms, a suffix associated with the search terms, and an operator related to the
search terms. Thus, a prefix or suffix can be specified in association with the search
terms, in addition to a relevant operator associated with the search terms, such as "&"
which represents logical operator "AND " or "/" which represents logical operator "OR".
The prefix and suffix fields provide support for variations such as stem and wildcard
searches. Hence, use of this embodiment in conjunction with rule interpreter 220
pro\/ides a user-friendly capability for end users to simply provide a set of rules as input
for a search, whereby through a series of processes an associated query is constructed for
execution by database server 106 to fetch information from database 108 (FIG. 1).
[0040] Based on the foregoing rewrite template, template interpreter 204 parses the
search phrase "cheap pen" into multiple words, "cheap" and "pen", and each word is

transformed according to the specified rules and passed to query constructor 206 for
generation of a query on database 108 (FIG. 1). For example, processing of the function
TRANSFORM(TOKENS,' ','$','&'), with respect to the search terms "cheap pen",
functions to construct a sub-query based on suffix stems of "cheap" and "pen", in the
conjunctive form. Thus, a sub-query is constructed based on the REWRITE operator and
the TRANSFORM function, which searches for (1) "cheaper" AND "pen"; and (2)
"cheapest" AND "pen".
[0041] The preceding delineation of functionality between template interpreter 204
and query constructor 206 is not limiting, for these functions may be performed by an
integrated interpreter 204/constructor 206 software module.
[0042] Query constructor 206 receives various forms of information that define a
search, such as text strings, text query templates, rewrite templates, and non-text
parameters. Query constructor 206 then constructs queries from the information received,
such as relaxation query 250c. Such queries are, in turn, submitted to query execution
engine 210 for execution against database 108 (FIG. 1).
[0043] Query execution engine 210 interfaces with and executes queries, such as
relaxation queries 250a, 250b, 250c, against database 108 (FIG. 1). Query execution
engine 210 comprises a context module 208 that implements the functionality of the
CONTEXT grammar, when searching for text. Hence, if a CONTAINS operator is
identified in the process of executing a SQL query, the CONTAINS argument text string
or template is passed to the context module 208, possibly via template interpreter 204, for
processing according to the functionality of the CONTEXT grammar. In addition, query
execution engine 210 supports execution of a relaxation query 250b, received directly
from an end user.
USER-SPECIFIED PROGRESSIVE RELAXATION OF QUERY REQUIREMENTS
[0044] According to an aspect of the invention, native support for progressive
relaxation of database queries is provided in a database server application. Hence,
applications and end-users are able to specify a sequence of sub-queries that is associated
with a data search, and an order in which to execute the sub-queries to produce a desired
result. Therefore, users can affect the manner and extent of the search performed to
enhance the performance of the search. In addition, users can affect the quality of the
relevance ranking of related results or hit-lists, without significant degradation in query
response time or throughput. Such a technique improves query response time and

throughput by reducing unnecessary query/search term expansions and computationally
expensive complete round-trip communications between client and server applications.
[0045] In an embodiment, capabilities are provided for a user to specify a progression
sequence of sub-queries to be executed with respect to different sections of a document,
such as HTML and XML documents. For example, a user may specify that a term be
searched for first in the title of one or more documents and next in the body of the one or
more documents and next in the description of graphic objects that appear in the one or
more documents. Hence, refined search capabilities are provided.
[0046] In an embodiment, a query that includes a CONTAINS clause can be issued
against a database, wherein the search string is not required to adhere to conventional text
query grammar. Instead, a search string may adhere to text query template format, as the
following example illustrates, in which a query template is used as the query expression
argument for a CONTAINS operator. The following provides an example query for
searching text stored in a data table that is indexed with a context index type:
SELECT * from tablename
WHERE CONTAINS (columnname, 'search criteria', 1) > 0.
[0047] Thus, according to this embodiment, a query template is provided as the
argument to the CONTAINS clause, depicted in the preceding query as the "search
criteria." In such an embodiment, the following example mark-up document, or query
template, can be used as an argument to a CONTAINS clause. That is, the query
template is used to represent the search criteria. The query template specifies a sequence
of sub-queries, i.e., probes into one or more tables, and a progressive order in which to
execute the sub-queries against a specified column of a table.


[0048] The preceding query and template illustrate a search on text, however, any
data value or data type stored in one or more databases may be searched by applying the
techniques described herein. Furthermore, the databases may be distributed and
associated with multiple distributed servers, such as in the context of searching the
Internet.
[0049] In an alternative embodiment, the progression is implemented with SQL or
PL/SQL extension operators instead of a mark-up template. For example, a
PROGRESSION operator is defined as an extension to existing set of SQL operators, thus
providing native database support for specification of progressions according to the
techniques described herein.
[0050] During execution of a query that includes a clause such as a CONTAINS
clause for text or a similar clause for non-textual data, upon identification of a
tag, the execution engine will not process the relevant portion of the query as a typical
query. Rather, a database server, such as database server 106 (FIG. 1), extracts the
information between the tags, determines whether a progression is specified,
such as via a tag, and executes related sub-queries in an order
specified by the progression.
[0051] In the example above, a sub-query would first be executed to search for
documents that contain the phrase "cheap pen". Next, a sub-query could be executed to
search for documents that contain both words "cheap" and "pen". Finally, a sub-query
could be executed to search for documents that contain either of the words "cheap" or
"pen".
[0052] In one embodiment, each of the three sub-queries may be automatically run in
series according to the specified progression, with the results ordered according to which
sub-query produced the, given results. For example, hits returned from the first sub-query
are produced or displayed with higher relevance ranking than hits returned from the
second and third sub-queries.
[0053] The manner in which the relevance rankings are presented is not limited. For
example, the relevance ranking of the results may be identified simply by the order of
presentation to the search requestor or may be assigned numerical rankings, percentages,
etc. Such a scenario eliminates multiple round-trip communications between the client
and the server and consequent network overhead.
[0054] In an alternative embodiment, each of the sub-queries may only be executed, if
at all, when the previously execute sub-queries did not fulfill the information search

request. For example, if tens or hundreds of documents that include the phrase "cheap
pen" are located by executing only the first sub-query, then execution of the remaining
sub-queries is suspended or foregone completely. Thus, such a scenario eliminates
multiple complete round-trip communications between the client and the server and
consequent query processing and network overhead, as well as unnecessary probing.
However, if the user specifically requests additional results, then the second and/or third
queries can then be executed as necessary to fulfill the specific requirements of the search
request. In such a scenario, subsequent client-server communications are not what is
referred to above as complete round-trip communications, but are considered secondary
communications because they require little or no "overhead" processing of the query,
such as parsing, optimization, and the like.
PROCESS FOR MANAGING AN INFORMATION SEARCH-CLIENT
[0055] FIG. 3 is a flow diagram that illustrates a process for managing an information
search. According to one aspect of the invention, this process is performed by an end
user who is requesting an information search, typically via a user interface.
[0056] At block 302, one or more information values are specified relating to
information being searched for. At block 304, an order in which to execute sub-queries
associated with the one or more values is specified. For example, a user may submit a
query template, as described above, or a set of rules from which a query is constructed, as
described above. Blocks 302 and 304 are preferably performed concurrently, however
the steps may be performed in sequence. In one embodiment, the values and order are
specified via a user interface, and in one embodiment, in the form of an XML document.
[0057] At block 306, the one or more information values and the order are submitted
for execution of one or more of the sub-queries in the order specified. This information
may be submitted directly to a database server, such as database server 106 (FIG. 1), or
indirectly through a middleware application such as a search engine or function.
[0058] In one embodiment, an information value that is associated with one or more
particular sections of a document is specified at block 302. In a related embodiment, an
information value is specified at block 302 that is associated with at least two particular
sections of a document, and the order specified at block 304 includes an order in which to
execute sub-queries associated with the information value with respect to the at least two
particular sections of the document, as previously described.

PROCESS FOR MANAGING AN INFORMATION SEARCH-SERVER
[0059] FIG. 4 is a flow diagram that illustrates a process for managing an information
search. According to one aspect of the invention, this process is performed by a database
server.
[0060] At block 402, a query is received that includes a sequence of sub-queries and
specification of a progression in which to execute the sub-queries. For example, a query
that includes a CONTAINS clause for querying text is received, which includes a query
template as an argument of the clause, as described above. In one embodiment, the query
is received in the form of an XML document. The query that is received may, for
example, include sub-queries that are associated with a particular section of a document.
[0061] At block 404, one or more of the sub-queries are executed in an order
specified by the progression. In one embodiment, the process includes receiving a
request for a particular number of results from the search, for example, along with the
query received at block 402 or iteratively from an end user or otherwise. For an example
of the iterative process, an end user may request, with successive communications, query
results ten at a time. Thus, a database server or search mechanism successively provides
the results in response to those requests, i.e., ten at a time. Furthermore, each sub-query
of the one or more sub-queries is executed only if previously executed sub-queries have
not produced results sufficient to fulfill the request for the specified number of results.
For example, if the first executed sub-query produces thirty results and the user is
iteratively requesting hits ten at a time, then the second sub-query is not executed until the
user requests the thirty-first through fortieth hits.
[0062] In contrast to prior approaches using concatenated queries, all of the sub-
queries are not necessarily executed. Further, in contrast with prior approaches in which
each sub-query is submitted to the database server in series, the present embodiment can
submit multiple sub-queries to the database server using only one complete round-trip
communication between an end user that is requesting the search, such as at client 102
(FIG. 1) and a server that is searching for the information, such as database server 106
(FIG. 1). Control over which sub-queries are actually executed, and in what order, is
provided by the end user rather than assumed by a software application and/or search
engine or function, and is applied by the database server.
[0063] In one embodiment, all of the sub-queries select data from a database based on
a respective particular set of values, i.e., search criteria, and the progression corresponds
to a relaxation of the search criteria. For example, a sub-query is constructed for each of
the original search term and variations of the search term "cheap pen", using a particular

set of values such as (1) "cheap" and "pen", (2) a stem of "cheap" and "pen", and (3)
"cheap" or pen". Therefore, the search criteria "cheap pen" is progressively relaxed by
executing the associated sub-queries, which expand the search, according to the specified
progression.
IMPLEMENTATION MECHANISM-HARDWARE OVERVIEW
[0064] FIG. 5 is a block diagram that illustrates a computer system 500 upon which
an embodiment of the invention may be implemented. Computer system 500 includes a
bus 502 or other communication mechanism for communicating information, and a
processor 504 coupled with bus 502 for processing information. Computer system 500
also includes a main memory 506, such as a random access memory (RAM) or other
dynamic storage device, coupled to bus 502 for storing information and instructions to be
executed by processor 504. Main memory 506 also may be used for storing temporary
variables or other intermediate information during execution of instructions to be
executed by processor 504. Computer system 500 further includes a read only memory
(ROM) 508 or other static storage device coupled to bus 502 for storing static information
and instructions for processor 504. A storage device 510, such as a magnetic disk, optical
disk, or magneto-optical disk, is provided and coupled to bus 502 for storing information
and instructions.
[0065] Computer system 500 may be coupled via bus 502 to a display 512, such as a
cathode ray tube (CRT) or a liquid crystal display (LCD), for displaying information to a
computer user. An input device 514, including alphanumeric and other keys, is coupled
to bus 502 for communicating information and command selections to processor 504.
Another type of user input device is cursor control 516, such as a mouse, a trackball, or
cursor direction keys for communicating direction information and command selections
to processor 504 and for controlling cursor movement on display 512. This input device
typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis
(e.g., y), that allows the device to specify positions in a plane.
[0066] The invention is related to the use of computer system 500 for implementing
the techniques described herein. According to one embodiment of the invention, those
techniques are performed by computer system 500 in response to processor 504 executing
one or more sequences of one or more instructions contained in main memory 506. Such
instructions may be read into main memory 506 from another computer-readable
medium, such as storage device 510. Execution of the sequences of instructions
contained in main memory 506 causes processor 504 to perform the process steps

described herein. In alternative embodiments, hard-wired circuitry may be used in place
of or in combination with software instructions to implement the invention. Thus,
embodiments of the invention are not limited to any specific combination of hardware
circuitry and software.
[0067] The term "computer-readable medium" as used herein refers to any medium
that participates in providing instructions to processor 504 for execution. Such a medium
may take many forms, including but not limited to, non-volatile media, volatile media,
and transmission media. Non-volatile media includes, for example, optical, magnetic, or
magneto-optical disks, such as storage device 510. Volatile media includes dynamic
memory, such as main memory 506. Transmission media includes coaxial cables, copper
wire and fiber optics, including the wires that comprise bus 502. Transmission media can
also take the form of acoustic or light waves, such as those generated during radio-wave
and infra-red data communications.
[0068] Common forms of computer-readable media include, for example, a floppy
disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD-
ROM, DVD, any other optical or magneto-optical medium, punchcards, papertape, any
other physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-
EPROM, any other memory chip or cartridge, a carrier wave as described hereinafter, or
any other medium from which a computer can read.
[0069] Various forms of computer readable media may be involved in carrying one or
more sequences of one or more instructions to processor 504 for execution. For example,
the instructions may initially be carried on a magnetic disk of a remote computer. The
remote computer can load the instructions into its dynamic memory and send the
instructions over a telephone line using a modem. A modem local to computer system
500 can receive the data on the telephone line and use an infra-red transmitter to convert
the data to an infra-red signal. An infra-red detector can receive the data carried in the
infra-red signal and appropriate circuitry can place the data on bus 502. Bus 502 carries
the data to main memory 506, from which processor 504 retrieves and executes the
instructions. The instructions received by main memory 506 may optionally be stored on
storage device 510 either before or after execution by processor 504.
[0070] Computer system 500 also includes a communication interface 518 coupled to
bus 502. Communication interface 518 provides a two-way data communication coupling
to a network link 520 that is connected to a local network 522. For example,
communication interface 518 may be an integrated services digital network (ISDN) card
or a modem to provide a data communication connection to a corresponding type of

telephone line. As another example, communication interface 518 may be a local area
network (LAN) card to provide a data communication connection to a compatible LAN.
Wireless links may also be implemented. In any such implementation, communication
interface 518 sends and receives electrical, electromagnetic or optical signals that carry
digital data streams representing various types of information.
[0071] Network link 520 typically provides data communication through one or more
networks to other data devices. For example, network link 520 may provide a connection
through local network 522 to a host computer 524 or to data equipment operated by an
Internet Service Provider (ISP) 526. ISP 526 in turn provides data communication
services through the world wide packet data communication network now commonly
referred to as the "Internet" 528. Local network 522 and Internet 528 both use electrical,
electromagnetic or optical signals that carry digital data streams. The signals through the
various networks and the signals on network link 520 and through communication
interface 518, which carry the digital data to and from computer system 500, are
exemplary forms of carrier waves transporting the information.
[0072] Computer system 500 can send messages and receive data, including program
code, through the network(s), network link 520 and communication interface 518. In the
Internet example, a server 530 might transmit a requested code for an application program
through Internet 528, ISP 526, local network 522 and communication interface 518.
[0073] The received code may be executed by processor 504 as it is received, and/or
stored in storage device 510, or other non-volatile storage for later execution. In this
manner, computer system 500 may obtain application code in the form of a carrier wave.
EXTENSIONS AND ALTERNATIVES
[0074] Alternative embodiments of the invention are described throughout the
foregoing description, and in locations that best facilitate understanding the context of the
embodiments. Furthermore, the invention has been described with reference to specific
embodiments thereof. It will, however, be evident that various modifications and changes
may be made thereto without departing from the broader spirit and scope of the invention.
For example, embodiments are described in the context of database queries; however, the
techniques described are applicable to electronic searches for information, in general. For
another example, embodiments are described in the context of text searches; however, the
techniques are applicable to searches for types of information other than text. Therefore,
the specification and drawings are, accordingly, to be regarded in an illustrative rather
than a restrictive sense.

[0075] In addition, in this description certain process steps are set forth in a particular
order, and alphabetic and alphanumeric labels may be used to identify certain steps.
Unless specifically stated in the description, embodiments of the invention are not
necessarily limited to any particular order of carrying out such steps. In particular, the
labels are used merely for convenient identification of steps, and are not intended to
specify or require a particular order of carrying out such steps.

WE CLAIM :
1. A method for an information search comprising the steps of:
receiving a query (402) at a database server (106), the query comprising a sequence of
sub-queries and a specification of a progression in which to execute the sub-queries; and
executing (404) one or more of the sub-queries in an order specified by the
progression, characterized in that each sub-query is executed only if the results of the
previously executed sub-queries are not sufficient to fulfill a search request.
2. The method as claimed in Claim 1, comprising the steps of:
receiving (402) a request for a particular number of results from the search; and
wherein each sub-query of the one or more sub-queries is executed (404) only if one
or more previously executed sub-queries do not fulfill the request for the particular number of
results.
3. The method as claimed in Claim 1, wherein all the sub-queries select data from a
database (108) based on respective search criteria, and wherein the progression corresponds
to a relaxation of original search criteria associated with the information search.
4. The method as claimed in Claim 1, wherein the progression is specified by an end
user (102) that is requesting the information search.
5. The method as claimed in Claim 3, wherein the progression is specified through
interaction with a user interface (104).
6. The method as claimed in Claim 1, wherein the step of executing (404) one or more,
sub-queries comprises the steps of:
executing a first sub-query, according to the progression, of the sequence of sub-
queries;
providing results from the execution of the first sub-query;
receiving a request to provide more results;
in response to receiving the request, executing a second sub-query (404), according to
the progression, of the sequence of sub-queries; and
providing results from the execution of the second sub-query.

7. The method as claimed in Claim 1, wherein the step of receiving comprises
receiving search criteria associated with the sequence of sub-queries and the progression in
the form of an XML data.
8. The method as claimed in Claim 1, wherein the step of receiving comprises
receiving a query from an electronic search engine (104) that constructs the query based on a
set of one or more rules, wherein the set is specified by an end user that is requesting the
information.
9. The method as claimed in Claim 1, wherein the step of receiving comprises the step
of receiving a set of search criteria specified by an end user (102) and a request (306) to
construct a database query based on the search criteria; and wherein the method comprises
the step of:
constructing (206) a database query based on the search criteria.
10. The method as claimed in Claim 1, wherein the step of receiving a query
comprises receiving a query that specifies search criteria in association with one or more
sections of a document, and receiving a progression that specifies an order in which to
execute sub-queries associated with the search criteria against the one or more sections of the
document.
11. A method for searching of information comprising the steps of:
specifying (302) search criteria related to the information being searched for;
specifying (304) an order in which to execute sub-queries associated with the search
criteria; and
transmitting (306) a query including the search criteria and the order, for execution of
one or more of the sub-queries by a database server in the order specified, wherein each sub-
query is executed only if the results of the previously executed sub-queries are not sufficient
to fulfill a search request.
12. The method as claimed in Claim 11, wherein the steps of specifying (302, 304) are
initiated by an end user (102) that is requesting the search for information.
13. The method as claimed in Claim 11, wherein the step of specifying (302) search
criteria comprises specifying (304) a single set of values, and wherein the step of specifying

an order comprises specifying an ordered set of one or more rules associated with the set of
values.
14. The method as claimed in Claim 13, wherein the ordered set of one or more rules
comprises a logical operator.
15. The method as claimed in Claim 11, wherein the steps of specifying comprise
specifying in the form of an XML data.
16. The method as claimed in Claim 11, wherein the step of specifying search criteria
comprises specifying search criteria (302) in association with one or more sections of a
document, and wherein the step of specifying an order (304) comprises specifying an order in
which to execute sub-queries associated with the search criteria (302) against the one or more
sections of the document.
17. The method as claimed in Claim 16, wherein the search criteria is specified (304)
in association with at least two particular sections of a document, and wherein the order
specifies to execute (404) sub-queries associated with the search criteria against the at least
two particular sections of the document in a particular order.
18. A computer apparatus for implementing the method for information search, as
claimed in Claims 1-17, comprising:
a database server (106) for receiving a query, the query including a sequence of sub-
queries and a specification of a progression in which to execute the sub-queries; and
means (210) for executing one or more of the sub-queries in an order specified by the
progression;
wherein each sub-query is executed only if the results of the previously executed sub-
queries are not sufficient to fulfill a search request.



ABSTRACT

TITLE: PROGRESSIVE RELAXATION OF SEARCH CRITERIA.
A method for managing an information search, the method comprising the
steps of:
receiving a query (402) at a database server (106), the query comprising
a sequence of sub-queries and a specification of a progression in which to
execute the sub-queries; and
executing (404) one or more of the sub-queries in an order specified by the
progression,
characterized in that each sub-query is executed only if the results of the
previously executed sub-queries are not sufficient to fulfill a search
request.

Documents:

02065-kolnp-2005-abstract.pdf

02065-kolnp-2005-claims.pdf

02065-kolnp-2005-description complete.pdf

02065-kolnp-2005-drawings.pdf

02065-kolnp-2005-form 1.pdf

02065-kolnp-2005-form 2.pdf

02065-kolnp-2005-form 3.pdf

02065-kolnp-2005-form 5.pdf

02065-kolnp-2005-international publication.pdf

2065-KOLNP-2005-(11-04-2012)-CORRESPONDENCE.pdf

2065-KOLNP-2005-(11-04-2012)-FORM-1.pdf

2065-KOLNP-2005-ABSTRACT 1.1.pdf

2065-KOLNP-2005-AMANDED CLAIMS.pdf

2065-KOLNP-2005-ASSIGNMENT.pdf

2065-KOLNP-2005-CANCELLED PAGES.pdf

2065-KOLNP-2005-CLAIMS 1.1.pdf

2065-KOLNP-2005-CORRESPONDENCE 1.2.pdf

2065-KOLNP-2005-CORRESPONDENCE-1.1.pdf

2065-KOLNP-2005-CORRESPONDENCE-1.3.pdf

2065-KOLNP-2005-CORRESPONDENCE.pdf

2065-KOLNP-2005-DESCRIPTION (COMPLETE) 1.1.pdf

2065-KOLNP-2005-DRAWINGS 1.1.pdf

2065-KOLNP-2005-EXAMINATION REPORT.pdf

2065-KOLNP-2005-FORM 1-1.2.pdf

2065-KOLNP-2005-FORM 1.1.1.pdf

2065-KOLNP-2005-FORM 13.pdf

2065-KOLNP-2005-FORM 18.pdf

2065-KOLNP-2005-FORM 2.1.1.pdf

2065-KOLNP-2005-FORM 26.pdf

2065-KOLNP-2005-FORM 3.1.1.pdf

2065-KOLNP-2005-FORM 3.pdf

2065-KOLNP-2005-GPA.pdf

2065-KOLNP-2005-GRANTED-ABSTRACT.pdf

2065-KOLNP-2005-GRANTED-CLAIMS.pdf

2065-KOLNP-2005-GRANTED-DESCRIPTION (COMPLETE).pdf

2065-KOLNP-2005-GRANTED-DRAWINGS.pdf

2065-KOLNP-2005-GRANTED-FORM 1.pdf

2065-KOLNP-2005-GRANTED-FORM 2.pdf

2065-KOLNP-2005-GRANTED-SPECIFICATION.pdf

2065-KOLNP-2005-OTHERS.pdf

2065-KOLNP-2005-OTHERS1.1.pdf

2065-KOLNP-2005-PETITION UNDER RULE 137.pdf

2065-KOLNP-2005-REPLY TO EXAMINATION REPORT.pdf

abstract-02065-kolnp-2005.jpg


Patent Number 253161
Indian Patent Application Number 2065/KOLNP/2005
PG Journal Number 27/2012
Publication Date 06-Jul-2012
Grant Date 28-Jun-2012
Date of Filing 20-Oct-2005
Name of Patentee ORACLE INTERNATIONAL CORPORATION
Applicant Address 500 ORACLE PARKWAY, REDWOOD SHORES, CA 94065 UNITED STATES OF AMERICA
Inventors:
# Inventor's Name Inventor's Address
1 DIXON, PAUL 302 ORCHARD AVENUE REDWOOD CITY, CA 94065, UNITED STATES OF AMERICA
2 ALPHA, SHAMIM OLD-FERRIGHAT ROAD PABNA 6600, BANGLADESH
PCT International Classification Number G06F 17/30
PCT International Application Number PCT/US2004/010020
PCT International Filing date 2004-03-31
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 10/434, 845 2003-05-08 U.S.A.