|
Server : Apache/2.4.62 System : FreeBSD fbsdweb2.web.rcn.net 14.1-RELEASE FreeBSD 14.1-RELEASE releng/14.1-n267679-10e31f0946d8 GENERIC amd64 User : www ( 80) PHP Version : 8.3.8 Disable Function : NONE Directory : /domains/markrose/ |
Upload File : |
<HTML>
<HEAD> <TITLE>The sci.lang FAQ: 21 - 29</TITLE></HEAD>
<BODY BGCOLOR="#DDDDFF" TEXT="#000000">
<H2>The sci.lang FAQ: 21 - 29</H2>
<P><HR>
<STRONG><A NAME="21">21</A> <IMG Align=Top SRC="redball.gif"> How do you look up a word in a Chinese or Japanese dictionary?</STRONG>
<P><A HREF="lang18.html#20">[Previous]</A> <A HREF="#22">[Next]</A> <A HREF="langfaq.html">[Index]</A>
<P>[--markrose]
<P>The vast majority of Chinese characters can be divided into two parts, the radical and the phonetic. Each part is another, simpler character. The <B>radical</B> gives an idea of the meaning-- rather a vague idea, since traditionally there were only 214 different radicals. The <B>phonetic</B> identifies the sound, with a bit more precision: generally, all the characters that share a phonetic rhymed 2000 years ago in Archaic Chinese.
<P><IMG align=Top width=315 height=216 SRC="chinese.gif">
<P>The radical (shown in the above characters in red) is used only for its meaning; its pronunciation is irrelevant. The phonetic (shown in blue) is used only for its sound; its meaning is irrelevant. Note that a single character, such as <I>nü<sup><font size=1>3</font></sup></I> 'woman' or <I>kôu</I> 'mouth' above, can be a radical in one character and a phonetic in another. The case of <I>gu</I> 'aunt', itself built out of radical + phonetic, but used for its own phonetic value in <I>gu</I> 'type of mushroom', is also fairly common.
<P>Characters are arranged in most Chinese dictionaries by radical. To find an unknown character, then, you identify the radical, and look up its section in the dictionary. The radicals are arranged in order of increasing complexity. Each radical's section is ordered by the number of strokes in the character. Several characters may have the same number of strokes; these must simply be scanned till the right one is found.
<P>Sometimes it isn't easy to identify the radical-- it's in an odd position (e.g. on the bottom or the right rather than the top or left side-- cf. <I>rú</I> 'like' above); or it's drawn in an abbreviated form; or it's not clear which of several similar radicals the character is listed under. It's also important to know the proper method for counting strokes (e.g. <I>nü<sup><font size=1>3</font></sup></I> 'woman', <I>kôu</I> 'mouth', and <I>ma</I> 'horse' all count as three strokes).
<P>If a character isn't composed of a radical + phonetic, it's usually treated as one, graphically, for the purposes of dictionary lookup. For instance, the character for <I>hâo</I> 'good' is composed of the characters for 'woman' and 'child'-- a <I>semantic</I> compound. It's simply listed under the <I>nü<sup><font size=1>3</font></sup></I> 'woman' radical, although <I>zî</I> 'child' is not a phonetic.
<P>The People's Republic simplified a number of characters and radicals, and this changed the number of radicals-- there's 224 in my dictionary, for instance. The Japanese have made their own separate simplification.
<ul>
<li><A HREF="yingzi/yingzi.htm">More on how Chinese characters work</a>
<li><A HREF="http://zhongwen.com">A gloriously interactive Chinese dictionary</a>
</ul>
<P><HR>
<STRONG><A NAME="22">22</A> <IMG Align=Top SRC="redball.gif"> What about Nostratic and Proto-World?</STRONG>
<P><A HREF="#21">[Previous]</A> <A HREF="#23">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<P>[--markrose]
<P>In recent years some some linguists have attempted to reconstruct languages far older than Indo-European.
<P><B>Nostratic</B>, said to underlie the Indo-European, Kartvelian (South Caucasian), Afro-Asiatic, Dravidian, Uralic, Altaic, Chukchi-Kamchatkan, and Eskimo-Aleut families, was first proposed by Holger Pedersen in 1903. More recently the greater part of work on Nostratic has been done by Soviet linguists led by Vladislav Illich-Svitych, Aaron Dolgopolsky, and Vitaly Shevoroshkin.
<P>The methodology is the traditional comparative method, and over 600 roots have been proposed. Most linguists remain skeptical, believing that chance processes will have obscured any relationship at this level beyond reconstruction, or question the accuracy of the derivations (a charge which makes Nostraticists bristle). Others simply suspend judgment, especially since much of the supporting material for Nostratic is available only in Russian.
<P>A good overview on Nostratic is Kaiser and Shevoroshkin, "Nostratic", in the <CITE>Annual Review of Anthropology</CITE>, 17:309. Illich-Svitych's original Russian article (from <CITE>Etymologia</CITE>, 1965) has been translated in Shevoroshkin, ed., <CITE>Reconstructing Languages and Cultures</CITE> (1989).
<P>Joseph Greenberg has proposed a grouping which covers much the same language areas (omitting Afro-Asiastic and Dravidian, but adding Ainu and Gilyak), called <B>Eurasiatic</B>. Greenberg's method of <B>mass comparison</B> (which he has also used to group together almost all Native American languages into one superfamily, Amerind) basically consists of assembling huge lists of common words and doing eyeball comparisons.
<P>This methodology has been severely criticized by many historical linguists. If 'mass comparison' were applied to the Indo-European languages, it would be bedevilled by false positives (caused by borrowing or chance) and by specious phonetic or semantic similarites. Greenberg's methods seem to linguists to abandon the very methodological severity which has put Indo-European linguistics on a scientific footing, and distinguished it from the work of cranks. Relax the rules enough, and you can derive any language from any other.
<P>Greenberg replies that the patterns he has found are compelling enough to justify his methods, and that he is merely following in the footsteps of the originators of the comparative method: linguists had to decide that the Indo-European languages were related before attempting reconstructions.
<P>The ultimate areal comparison would be <B>Proto-World</B>, the hypothetical ancestor of all human languages. Greenberg has mentioned Proto-World, but since he is not much interested in reconstruction, his proposal is not much more than a statement of the monogenetic theory (a single origin for all languages). Most linguists are skeptical that anything could be reconstructed at this hypothetical time depth.
<P>Greenberg's work on Amerind can be found in </CITE>Language in the Americas</CITE> (1987); on Eurasiatic, in the forthcoming <CITE>Indo-European and Its Closest Relatives: The Eurasiatic Language Family</CITE>. Introductions to the Nostratic and Proto-World controversies were published in both <CITE>The Atlantic</CITE> and <CITE> Scientific American</CITE> in April 1991. The essays in Lamb and Mitchell, eds., <CITE>Sprung From Some Common Source</CITE> (1991), are also relevant.
<P>Loren Petrich maintains an <A HREF="http://www.webcom.com/petrich/writings/NostraticRefs.txt">annotated bibliography</a> on Indo-European, Nostratic, and Proto-World. I am also indebted to Peter Michalove for citations used in this entry.
<P><HR>
<STRONG><A NAME="23">23</A> <IMG Align=Top SRC="redball.gif"> What are phonemes and why's it so hard to lose a foreign accent?</STRONG>
<P><A HREF="#22">[Previous]</A> <A HREF="#24">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<P>[--markrose]
<p>The sounds (<b>phones</b>) humans can make are infinite; there's (almost always) a continuum of phones between any two phones.
<p>In any one language, however, phones are grouped into 20 to 60 or so discrete groups of sounds called <b>phonemes</b>. The range of variation for each phoneme is discounted by speakers and hearers of the language, who perceive the entire range as "the same sound."
<p>The diversity of phones, and their grouping into phonemes, can be clearly seen on this chart from William Labov's <i>Principles of Linguistic Change</i> (1994). The chart is a graph of formant frequencies F1 against F2 for the main vowels of fifty words as spoken by a single person-- in effect, a plot of fifty actual phones. (The words on the chart-- beat, bait, etc.-- are not the words being spoken, but just examples of words with those vowel sounds.)
<br><IMG Align=Top SRC="labov.gif">
<p>(Most of the sounds plotted are <b>diphthongs</b>, which are glides between two sounds; this accounts for some of the overlaps on the diagram (and for the little arrows on the symbols). For instance, the sounds Labov calls <b>ay</b> and <b>aw</b> start in about the same place, but ay heads 'northwest' toward [i] and aw heads 'northeast' toward [u].)
<p>The English phoneme /p/ has two phonetic realizations or <b>allophones</b>: aspirated [p<sup><font size=-1>h</font></sup>] beginning a word and non-aspirated [p] elsewhere. But since the two types of /p/ never distinguish one word from another, speakers of English generally don't even perceive the difference. (Linguists write phonemic transcriptions between /slashes/, and phonetic transcriptions in [brackets].)
<p>If we can find two words with different meaning but only one difference in sound between them-- a <b>minimal pair</b>-- then we've found distinct phonemes; e.g. /p/ and /b/ in English 'pit' and 'bit'. If two sounds never occur in the same phonetic environment (e.g. English [p] and [p<sup><font size=-1>h</font></sup>])-- if they're in <b>complementary distribution</b>-- then they're probably allophones of a single phoneme.
<p>Other languages do not divide up the phonetic space in the same way.
For instance, /p/ and /p<sup><font size=-1>h</font></sup>/ <i>are</i> separate phonemes in Mandarin Chinese (as in /pa<sup><font size=1>1</font></sup>/ 'eight' and /p<sup><font size=-1>h</font></sup>a<sup><font size=1>1</font></sup>/ 'flower'). And the vowels of <i>late</i> and <i>let</i>, phonemes in English, are allophones of a single phoneme /e/ in Spanish.
<p>We're trained from childhood to make the phonetic distinctions our language uses to keep its phonemes apart, and to <i>ignore</i> those that lie within phonemes. Learning to make different distinctions in a foreign language is quite difficult-- usually <i>harder</i> than making new sounds our native language lacks entirely. We'll continue to have an accent in the new language so long as we hear its sounds through our native language's phonemic filter.
<P><HR>
<STRONG><A NAME="24">24</A> <IMG Align=Top SRC="redball.gif"> How likely are chance resemblances between languages?</STRONG>
<P><A HREF="#23">[Previous]</A> <a href="#25">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<p>[--markrose]
<p>It depends-- to an astonishing degree-- on the amount of phonetic and
semantic leeway you allow for a match. But in general the answer is
"Quite likely."
<p>For the sort of comparisons that are often posted to sci.lang, where
perhaps just two consonants match, or nearly match, and the semantic
matchups are quirky, one can expect literally hundreds of random matches.
<ul>
<li><a href="chance.htm">Detailed discussion</a>
</ul>
<P><HR>
<STRONG><A NAME="25">25</A> <IMG Align=Top SRC="redball.gif"> How are tone languages sung?</STRONG>
<P><A HREF="#24">[Previous]</A> <A HREF="#26">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<P>[--markrose]
It varies. Tones are basically ignored in Mandarin Chinese songs,
for instance. (Does this make them hard to understand? Often, yes.)
However, Cantonese songs are generally written in such a way as to
preserve the relative pitch of successive syllables. E.g. a low tone
following a high tone will be on a lower note.
For more, see Marjorie Chan's paper on <a href="http://deall.ohio-state.edu/chan.9/articles/bls13.htm">Tone and Melody in Cantonese</a>.
<P><HR>
<STRONG><A NAME="26">26</A> <IMG Align=Top SRC="redball.gif"> Why are there so many words for Germany?</STRONG>
<P><A HREF="#25">[Previous]</A> <A HREF="#27">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<p>Basically, because there were Germans before there was a Germany.
Each of the Germans' neighbors came up with their own name for
them, long before there was a German state that people might want
to refer to uniformly.
<p><b>German</b> is a relatively recent borrowing from Latin <i>Germanus</i>, whose origins
are uncertain. It's been referred to Latin <i>germanus</i> 'brotherly', Germanic <i>*geromann-</i> 'spear-man', Old Irish <i>gair</i> 'neighbour', etc.
<p><b>Deutsch</b> comes from Proto-Germanic <i>*theudisko-z</i> 'of the people', from <i>*theudâ</i> 'people, nation';
originally it was used to distinguish the speech of the people from Latin, the language of scholarship. The English word 'Dutch' is a derivative, and used to be used for any northern Germanic people, later narrowed down to those closest to England; the older usage is preserved in 'Pennsylvania Dutch'.
<p>The word <i>*theudâ</i> survived into Middle English as <i>thede</i>, but was supplanted by Romance borrowings such as 'people' and 'nation'.
Non-Germanic cognates include Oscan <i>touto</i>, Irish <i>tu:ath</i>, and Lithuanian <i>tauta</i>, all meaning 'people'.
<p><p>Italian <b>tedesco</b> is another derivative of <i>*theudisko-z</i>.
<p><b>Teutonic</b> derives from a name of an ancient tribe in Jutland, the <i>Teutones</i>; if these were a German tribe their name is presumably another derivative of <i>*theudâ</i>.
<p>French <b>allemand</b> (and Spanish <i>alemán</i>, etc., as well as older English <i>Almain</i>) derive from
a particular tribe of Germans, the <i>Alemanni</i> ('all the men').
<p>Finnish <b>saksa</b> derives from the name of another tribe, the Saxons.
<p>Russian <b>nemets</b> is related to <i>nemoj</i> 'dumb, mute'; to the ancient Slavs, not speaking in an understandable language was as good as not speaking at all. Hungarian <i>német</i> is borrowed from Slavic.
<p>Latvian <b>Va:cija</b> may derive from a word meaning 'west'.
<P><HR>
<STRONG><A NAME="27">27</A> <IMG Align=Top SRC="redball.gif"> Why do both English and French have plurals in <b>-s</b>?</STRONG>
<P><A HREF="#26">[Previous]</A> <A HREF="#28">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<P>[--Miguel Carrasquer Vidal (adapted by markrose)]
<p>Despite what one might think, these are independent developments.
<p>The <b>English</b> s-plural comes
from the PIE o-stem nominative plural ending <i>*-o:s</i>,
apparently extended in Germanic to <i>*-o:s-es</i> by addition of the PIE
plural suffix <i>*-es</i> (<i>*-o:s</i> itself comes from <i>*-o-es</i>). This <i>*-o:ses</i>
became Proto-Germanic <i>*-o:ziz</i> or <i>*-o:siz</i>, depending on the accent,
which gave the attested forms-- Gothic <i>-o:s</i>, Old English <i>-as</i>, Old
Saxon <i>-os</i>, and Old Norse <i>-ar</i> (with the change *z --> r).
Already in Old English there was a tendency to extend this plural in -s to words that were not a-stems, a tendency which has since become nearly universal.
<p>The n-plural of <b>German</b> is generalized from the PIE n-stems (<i>*-on-es -->
-en</i>). It was still present in Old English n-stems, and survives
today in a few words like 'oxen'.
<p>The <b>Romance</b> s-plurals (<i>-as, -os, -es</i>) are derived from the
accusative (PIE <i>*-a:ns, *-ons, *-ens</i>). Old French still had separate
nominative and oblique (accusative/ablative) forms, but in the end,
grammatical cases were dropped completely, and usually only the
oblique forms were retained.
<p>In <b>Italian</b> and
Romanian, final -s was phonetically lost, and the plurals
are based on the nominative. The Latin nominative plural, at
least in the o- and a:-stems, was based on PIE <i>*-i</i>, of pronominal
origin, not <i>*-es</i> as in most other IE languages.
<P><HR>
<STRONG><A NAME="28">28</A> <IMG Align=Top SRC="redball.gif"> How did genders and cases develop in IE?</STRONG>
<P><A HREF="#27">[Previous]</A> <A HREF="#29">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<P>[--Mikael Thompson]
<p>Early stages of proto-Indo-European (PIE)
didn't have feminine gender. This is attested in Hittite, the oldest
recorded IE language; it had only <b>masculine</b> and
<b>neuter</b> genders, divided basically between
animate and inanimate objects. For most noun classes the PIE endings can be
reconstructed as follows:
<blockquote><table>
<tr><td> <td>Animate <td>Inanimate
<tr><td>Subject <td>*-s <td>*-0
<tr><td>Object <td>*-m <td>*-0
</table></blockquote>
<p>For animate nouns, <i>*-s</i> indicated the source of action,
<i>*-m</i> the thing acted upon; the zero ending indicates no
syntactic role. The basic idea is that only living
things can act upon other things, so only animate nouns could
take the <i>*-s</i>.
<p>Such a system is characteristic of <b>active/stative</b>
languages. Other features of PIE fit in with this observation; for instance, in PIE objects like fire and water which are
inanimate but move seemingly of their own will have two separate
names. In many languages with an
active-stative distinction there are such pairs of words.
As this distinction was lost in IE, different branches retained
just one of the words: e.g. English <i>water</i>, Greek <i>hydor</i>, Hittite <i>watar</i> form one group (from PIE <i>*wed-</i>),
while Latin <i>aqua</i> is from PIE <i>*akwa:-.</i>
<p>The animate nouns are the historical source for the <b>masculine</b>
gender, and the inanimate nouns for the
<b>neuter</b>. This is why in all the classic IE languages the neuter nominative and
accusative have identical forms, and the only basic difference between masculine and neuter nouns is in the
accusative.
<p>Earlier historical linguists cheerfully reconstructed eight cases
for PIE, on the model of Sanskrit; but the IE languages with many cases
are now considered to be innovative, not conservative. The <b>other
cases</b> developed from postpositions or derivational suffixes.
Luwian, a sister language of Hittite, for instance, has no genitive,
but has an adjective-forming suffix <i>-assi</i>, as in <i>harmah-assi-s</i> 'of the head'. (This is an adjective, not a genitive, because it can be
declined.) Genitives in other languages often seem to be developments
of cognates to this suffix.
<p>PIE didn't bother much with specifying <b>plurals</b>, but when it did, it
added an <i>*-s</i> or other endings. The neuter plural in all IE languages is
not descended from this, however-- active/stative languages typically
don't mark plurals for inanimate nouns-- but is instead a collective noun,
treated grammatically as a singular. This collective noun
ended in <i>*-a</i> in the nominative and accusative, and eventually it
developed into the <b>feminine</b>, which in all the old IE languages has the same
form in the nominative singular as does the neuter plural nominative-
accusative. It is also why the Greek neuter plural took a singular
verb.
<p>The reason it is called the feminine, of course, is that nouns
indicating females fell in this gender most of the time. This is
puzzling, and probably we must accept it as a fact whose explanation
can't be recovered from the depths of time.
<P><HR>
<STRONG><A NAME="29">29</A> <IMG Align=Top SRC="redball.gif"> What is the Sapir-Whorf hypothesis?</STRONG>
<P><A HREF="#28">[Previous]</A> <A HREF="lang30.html#30">[Next]</a> <A HREF="langfaq.html">[Index]</A>
<P>[--markrose]
<p>According to the <b>Sapir-Whorf hypothesis</b>, language determines the categories and much of the content of thought. "We dissect nature along lines laid down by our native languages... We cannot talk at all except by subscribing to the organization and classification of data which the [speech community] decrees," said Whorf, in <i>Language, Thought, and Reality</i> (1956). "The fact of the matter is that the 'real world' is to a large extent unconsciously built up on the language habits of the group," said Sapir.
<p>Both were students of Amerindian languages, and were drawn to this conclusion by analysis of the grammatical categories and semantic distinctions found in these languages, fascinatingly different from those found in European ones. (Neither linguist used the term 'Sapir-Whorf hypothesis', however; Whorf referred to the 'linguistic relativity principle'. Moreover, the principle
was almost entirely elaborated by Whorf alone.)
<p>The idea enjoyed a certain vogue midcentury, not only among linguists but among anthropologists, psychologists, and science fiction writers.
<p>However, the <i>strong form</i> of the hypothesis is not now widely believed. The conceptual systems of one language, after all, can be explained and understood by speakers of another. And grammatical categories do not really explain cultural systems very well. Indo-European languages make gender a grammatical category, and their speakers may be sexist-- but speakers of Turkish or Chinese, languages without grammatical gender, are not notably less sexist.
<p>Whorf's analysis of what he called "Standard Average European" languages is also questionable. E.g. he claims that "the three-tense system of SAE verbs colors all our thinking about time." Only English doesn't have three tenses; it has two, past and present; future events are expressed by the present ("I see him tomorrow"), or by a modal expression, merely one of a large class of such synthetic expressions. And for that matter, English distinguishes more like six than three times ("I had gone, I went, I just arrived, I'm going, I'm about to go, I'll go").
<p>To prove his point, Whorf collected stories of confusions brought about by language. For instance, a man threw a spent match into what looked like a pool of water; only there was decomposing waste in the water, and escaping gas was ignited by the spark-- boom! But it's not clear that any <i>linguistic</i> act is involved here. The man could think the pool looked like water without thinking of the word 'water'; and he could fail to notice the flammable vapors without doing any thinking at all.
<p>A <i>weak form</i> of the Sapir-Whorf hypothesis-- that language <i>influences</i> without <i>determining</i> our categories of thought-- still seems reasonable, and is even backed up by some psychological experiments-- e.g.
Kay & Kempton's finding that, in distinguishing color triads, a pair distinguished by color names can seem more distinct than a pair with the 'same' name which are actually more divergent optically (<i>American Anthropologist</i>, March 1984).
<p>It should be emphasized that, in their willingness to consider the idea that non-Western people have languages and worldviews that match the European's in precision and elegance, Sapir and Whorf were far ahead of their time.
<HR>
<P><A HREF="lang18.html">[Previous file]</A> <A HREF="lang30.html">[Next file ]</a> <A HREF="langfaq.html">[Index]</A>
</BODY>
</HTML>