Exhaustivity
According to WordNet 2 the adjective "exhaustive" means
through or thoroughgoing (very thorough; exhaustively complete) "an exhaustive
study"; "made a thorough search"; "thoroughgoing research"
In
classification it is often demanded that classes should be collectively
exhaustive (sometimes thought of as covering the whole
universe of knowledge) and mutually exclusive, meaning that any item should have a place in
the system and that there is no overlapping between classes. This principle is
also known from probability theory:
"In probability theory, a set of events is collectively
exhaustive if at least one of the events must occur. For example,
when rolling a six-sided die, the outcomes 1, 2, 3, 4, 5, and 6
are collectively exhaustive, because they encompass the entire
range of possible outcomes.
Compare this to the concept of a set of outcomes which are
mutually exclusive, which means that at most one of the events
must occur. The set of all possible die rolls is both
collectively exhaustive and mutually exclusive." (Wikipedia,
2005).
Exhaustivity is
also known in classification and indexing theory and in information retrieval.
Major and minor
descriptors are examples on how some databases applies different levels of
indexing exhaustivity.
"Traditionally
the two most important factors governing the effectiveness of an index language
have been thought to be the exhaustivity of indexing and the
specificity of the index language. There has been much debate about the
exact meaning of these two terms. Not wishing to enter into this controversy I
shall follow Keen and Digger[1972] in giving a working definition of each.
For any
document, indexing exhaustivity is defined as the number of different
topics indexed, and the index language specificity is the ability of the
index language to describe topics precisely. Keen and Digger further define
indexing specificity as the level of precision with which a document is
actually indexed. It is very difficult to quantify these factors. Human
indexers are able to rank their indexing approximately in order of increasing
exhaustivity or specificity. However, the same is not easily done for
automatic indexing.
It is
of some importance to be able to quantify the notions of indexing exhaustivity
and specificity because of the predictable effect they have on retrieval
effectiveness. It has been recognized (Lancaster[1968]) that a high level of
exhaustivity of indexing leads to high recall* and low precision*. Conversely, a
low level of exhaustivity leads to low recall and high precision. The converse
is true for levels of indexing specificity, high specificity leads to high
precision and low recall, etc. It would seem, therefore, that there is an
optimum level of indexing exhaustivity and specificity for a given user
population.
Quite a
few people (Sparck Jones[1972, 1973], Salton and Yang[1973]), have attempted to
relate these two factors to document collection statistics. For example,
exhaustivity can be assumed to be related to the number of index terms assigned
to a given document, and specificity related to the number of documents to which
a given term is assigned in a given collection" (van Rijsbergen, 1979, p. 24-25)
Literature:
Keen, E. M. &
Digger, J. A. (1972). Report of an Information Science Index Languages Test.
Aberystwyth College of Librarianship, Wales.
Lancaster, F. W.
(1968). Information Retrieval Systems: Characteristics, Testing and
Evaluation. New York: Wiley.
Salton, G. &
Yang, C. S. (1973). On the specification of term values in automatic indexing.
Journal of Documentation, 29, 351-372.
Sparck Jones, K. (1972).
A statistical interpretation of term specificity and its application in
retrieval, Journal of Documentation, 28, 111-21.
Sparck Jones, K. (1973). Does Indexing Exhaustivity Matter? Journal of the
American Society for Information Science, 24(5), 313-316.
van Rijsbergen,
C. J. (1979). Information Retrieval. 2. ed. London: Butterworths. Online
edition 1999.
http://www.dcs.gla.ac.uk/~iain/keith/
Wellish, H. H. (1995). Indexing from A to Z.
2nd edition. New York: H. W. Wilson. (Exhaustivity pp.
175-180).
Wikipedia.
The free encyclopedia. (2005). Collectively exhaustive.
http://en.wikipedia.org/wiki/Collectively_exhaustive
Birger Hjørland
Last edited: 01-08-2006
HOME