<!DOCTYPE TEI.2 PUBLIC '-//TEI//DTD TEI Lite 1.0//EN'
    "file:///d:/SGML/Public/TEI/Derived/teixlite.dtd" [
<!--*
    "http://www.hcu.ox.ac.uk/TEI/Lite/DTD/teixlite.dtd" [
*-->
<!ENTITY amp "&#38;">
]>
<!--* 
.* TEI Document No:  ED P3
.* Title:  Resolution of Theory Conflicts
.* Drafted:  extracted from EDP1 16 Feb 89
.* Revised:  touched May 22 (reason not recorded)
.*           document date corrected, 18 July 89
.*
*-->
<TEI.2>
<teiHeader>
<fileDesc>
<titleStmt>
<title>TEI ED P3: Resolution of Theory Conflicts</title>
</titleStmt>
<publicationStmt>
<publisher>Text Encoding Initiative</publisher>
<date>1989</date>
</publicationStmt>
<sourceDesc>
<p>Created in electronic form.
Retagged 19 July 2001 from Waterloo Script source.</p>
</sourceDesc>
</fileDesc>
<revisionDesc>
<list>
<item>2001-07-19 : CMSMcQ : retagged from Waterloo Script to TEI Lite.</item>
<item>1989-05-22 : CMSMcQ : touched for some unknown reason.</item>
<item>1989-02-16 : CMSMcQ : extracted this from ED P1.</item>
</list>
</revisionDesc>
</teiHeader>
<text>
<front>
<titlePage>
<docTitle n='TEI EDP3'>
<titlePart>Theoretical Stance</titlePart>
<titlePart>And Resolution of Theory Conflict</titlePart>
</docTitle>
<titlePart>TEI EDP3</titlePart>
<docDate>22 May 1989</docDate>
</titlePage>

<div1 type="abstract">
<p>
All markup schemes implicitly rely upon a theory of textual objects.
This document discusses six approaches to the problem of developing
a useful conceptual basis for markup in fields marked by diversity
of theoretical approach.  They are:
<list type="ordered">
<item>reliance upon a single theory</item>
<item>pluralism (informal description of several theories)</item>
<item>formal pluralism (formal descriptions of several theories)</item>
<item>eclecticism</item>
<item>controlled semantics</item>
<item>polytheoretical consensus</item>
</list>
</p>
</div1>
</front>
<body>
<div1 type="section"><head>The Problem of Theoretical Diversity</head>
<p>
Textual features, their attributes, and their structural relations
cannot be postulated in a conceptual vacuum.  They require some
theoretical basis.  In many fields (e.g. classical metrics) scholars
agree on many or all of the pertinent features and their
characteristics; in others, several divergent theories posit different
sets of textual features; in still others, scholars disagree without the
theoretical bases of the disagreement becoming visible.
</p>
<p>
In the extreme cases, theoretical diversity poses no problems for the
working committees:  on the one hand, a clear scholarly consensus can
readily be translated into a single list of textual features, while on
the other a lack of theoretical clarity will make it virtually
impossible to elicit any consensus as to the textual features at stake,
and no tag set can be developed at all.
</p>
<p>
The middle case, however, presents the working committees with a
delicate problem.
</p>
</div1>

<div1><head>Harmonization of Theoretical Conflict</head>
<p>
At one extreme, tags might be provided only to represent the features
tagged by a given theory, without any consideration given to their
relation to similar features used by other theories.  At the other
extreme, a tag set might express a consensus among representatives of
various theories and provide a <q>theory neutral</q> or
<q>poly-theoretical</q> notation for the expression of analytic results.
</p>
<p>
Such harmonization or resolution of theoretical diversity takes place
over a set of <q>systems.</q> The universe of systems to be considered
comprises:
<list>
<item>different theories current in the field</item>
<item>different practices current in the field</item>
<item>schemes used in the various corpora relevant to the field</item>
</list>
</p>
<p>
Six levels of theoretical harmonization can be specified; work may, but
need not, progress through the six levels sequentially.  In developing a
tag set to encode the analytic results of a field, a number of
possibilites exist:
<list type="ordered">
<item><p>Choice of a single theory:
provide tags for a single system, ignoring the others in the field.</p></item>
<item><p>Pluralism (informal):
elicit, for each system, a full description of the system (as it applies
to text encoding), including
<list type="ordered"><item><p>list of features</p></item>
<item><p>examples of each feature</p></item>
<item><p>structural properties of each feature (especially combinations
with other features and the like)</p></item>
<item><p>test criteria for the recognition of the feature</p></item>
<item><p>formalizations available for the system</p></item>
</list>
This will be possible for different theories in different measure.</p></item>
<item><p>Pluralism (formal):
generate an SGML formalization for the feature set of each system.
If SGML syntax does not suffice, the metalanguage committee must be
asked to consider or develop extensions to handle the recalcitrant
features.
</p>
<p> 
Note that at this low level of formalization, different theories will
have separate and incongruent tag sets.  The meaning of any tag will
be defined only in natural language, and users of different theoretical
orientations will be responsible for any translation into their own
terms.  The same generic identifier might be used for textual features
postulated by different theories, and thus be ambiguous when viewed in
isolation.  The two following approaches handle this ambiguity
differently.</p></item>
<item><p>Eclecticism:
define a single tag set created by the union of all system-specific
tag sets, eliminating ambiguity by giving each theoretically distinct
textual feature a unique generic identifer (<q>tag</q>).  Users of the
scheme will be expected to tag some subset of the features in the set,
mixing and matching as they wish.</p></item>
<item><p>Controlled semantics:
eliminate ambiguity of generic identifiers by the explicit formal
definition of the linguistic and computational meaning of the tags.
Different usages of the same term must at this level be reduced to a
finite list of questions with enumerable sets of possible answers.
Ultimately, of course, the meaning of these questions and their answers
will be expressed in natural languages, so that this cannot amount to a
full specification of meaning.  The formal system, however, will be
constrained more fully by these formal definitions than at the lower
levels of conflict resolution.
.*
<note place="foot"><p>
As an example of formalization at this level, let us consider the
preparation of lemmatized frequency lists.  Lemmatizing practice varies
on issues such as the forms of the definite article (are French
<hi>le,</hi> <hi>la,</hi> <hi>l',</hi> <hi>les,</hi> one lemma,
two, three, or four?  do the tokens <hi>au, aux</hi> contain one lemma
each or two?), but range over a finite number of possible solutions to a
finite number of standard problems.  The lemmatization practice of
existing lists may thus be fully defined by the answers to a finite
number of questions (e.g.  <q>Are articles of different gender reduced
to separate lemmata, or the same lemma?</q> or <q>Are words which
combine a preposition and an article reduced to one lemma or two?</q>),
such that each answer is drawn from a well-defined range of possible
answers.  In practice, these questions and answers provide a sufficient
definition of the meaning of the tag <code>&amp;lit tag='lemma'></code>.
</p></note>
</p>
<p>
Now, for any given finite list of questions, with well-defined ranges
for each answer, a set of SGML tags and attributes can be created which
specifies answers to each question.  An SGML processor could then
ascertain that answers are specified for each question (although it
could not necessarily check the consistency of the answers with the
actual practice in the encoding).</p></item>
<item><p>Polytheoretical consensus:
provide the smallest possible single set of features which includes as a
subset each set of features used by an existing system, or else define
explicit mappings from the tag sets provided for one system into the tag
sets provided for another.  Unlike an eclectic tag/feature set, a
polytheoretical set avoids all redundancy and is expected to be used
as a unit, with all features tagged, so that a text tagged with such a
set is useful to researchers of widely varying theoretical persuasions.
</p></item>
</list>
Each working committee must decide on the basis of its own knowledge how
much theoretical harmonization is possible in a given area.  The working
committees should strive for the highest level of harmonization they
believe feasible, given the theoretical climate and the resources at
hand.
</p>
</div1>
</body>
</text>
</TEI.2>
<!-- Keep this comment at the end of the file
Local variables:
mode: xml
sgml-default-dtd-file:(concat sgmlvol "/SGML/Public/Emacs/teilite.ced")
sgml-omittag:t
sgml-shorttag:t
End:
-->
