Cynthia L. Allen
Middle English is the name given to the English of the period from approximately 1100 to approximately 1450. This period is marked by substantial developments in all areas of English grammar. It is also the period of English when different dialects are the most fully attested in the texts. At the beginning of the Middle English period, the sociolinguistic status of English was low due to the Norman Invasion, and although religious texts of Old English composition continued to be copied and updated, few original compositions are extant. By the end of the period, English had regained its status as the language of government, law, and literature generally.
Although some notable changes to the phonemic inventory of consonants date from the Middle English period, the most dramatic phonological developments of the period involve vowels. The reduction of the vowels of unstressed syllables, one of the changes that marks the beginning of the Middle English period, is a phonological change with substantial morphological effects, as it substantially reduced the number of distinctive inflectional forms. Constituent order replaced case marking as the primary means of signaling grammatical relations. By the end of the Middle English period, subject-verb-object order had become established as the norm.
The lexicon of English was transformed in this period by an enormous influx of French words. The role of derivational morphology declined as its functions were to some extent replaced by the adoption of French words. Most Scandinavian loans in English first appear in the texts of this period. The Scandinavian loans are typically everyday words, while the words adopted from French are more often in areas of government, law, and higher culture, reflecting the nature of the contact between English speakers and the speakers of these languages.
The density of the Scandinavian population in the northern part of England is generally held to be responsible for the earlier appearance of changes in the north than in the south. The replacement of the third person plural personal pronoun hie by the Scandinavian they is an example of a development which is apparent only in the north early in Middle English but became general in English by the end of this period.
An important phonological development of later Middle English is the beginning of the Great Vowel Shift, which affected long vowels and involved successive changes and was implemented differently in different dialects, the north-south divide being the most evident.
Early Middle English is a language that cannot be understood by Modern English readers without special study, while the language of the late Middle English period, especially that coming from the London area, can be understood with the heavy use of explanatory notes.
Mixed languages are a rare category of contact language which has gone from being an oddity of contact linguistics to the subject of media excitement, at least for one mixed language—Light Warlpiri. They show considerable diversity in structure, social function, and historical origins; nonetheless, they all emerged in situations of bilingualism where a common language is already present. In this respect, they do not serve a communicative function, but rather are markers of an in-group identity. Mixed languages provide a unique opportunity to study the often observable birth, life, and death of languages both in terms of the sociohistorical context of language genesis and the structural evolution of language.
Gemma Rigau and Manuel Pérez Saldanya
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
Catalan is a Romance language closely related to the Gallo-Romance languages. However, from the 15th century onward, it has adopted some linguistic solutions that have brought it closer to the Ibero-Romance languages, due to close contact with Spanish.
Catalan exhibits five main dialects: Central, Northern, and Balearic, which are ascribed to the Eastern dialectal branch; and Northwestern and Valencian, which belong to the Western one. Central, Northern, and Northwestern Catalan are historical dialects that derived directly from the evolution of the Latin spoken in Old Catalonia (the Catalan-speaking territory located on both sides of the Pyrenees). Conversely, Valencian and Balearic are dialects resulting from the territorial expansion of the old Crown of Aragon in the Middle Ages.
As a Gallo-Romance language, Catalan lost all final unstressed vowels different from a (
Some of the most distinctive morphosyntactic features of Catalan are the following:
(1) Catalan is the only Romance language that exhibits a periphrastic past tense expressed by means of the verb anar “go” + infinitive (Ahir vas cantar “Yesterday you sang”). The periphrastic past coexists with a simple past (Ahir cantares “Yesterday you sang”). Conversely, Catalan does not have a periphrastic future with the movement verb go.
(2) Depending on the dialect, proper names may take the definite article (el, la) or a specific personal article (en, na from the vocative Latin forms
(3) Demonstratives show a two-term system in most Catalan dialects: aquí “here” (proximal) / allà or allí “there” (distal); but in Valencian and some Northwestern dialects there is a three-term system. In contrast with other languages with a two-term system, Catalan expresses proximity both to the speaker and to the addressee with the proximal demonstrative (Aquí on jo sóc “Here where I am”; Aquí on tu ets “There where you are”). The demonstrative systems show the same deictic properties as the movement verbs anar “go” and venir “come” in Catalan dialects.
(4) To express possession by means of a pronoun or a determiner, Catalan may use the genitive clitic en (En conec l’autor “I know its autor”), the genitive personal pronoun (el nostre fill “our son”), the dative clitic (Li rento la cara “I wash his/her face”) or the definite article (Tancaré els ulls “I will close my eyes”).
(5) Existential constructions may contain the predicate haver-hi “there be,” consisting of the locative clitic hi and the verb haver “have” (Hi ha tres estudiants “There are three students”), the copulative verb ser “be” (Tres estudiants ja són aquí “Three students are already here”) or other verbs, whose behavior can be close to an unaccusative verb when preceded by the clitic hi (Aquí hi treballen forners “There are some bakers working here”).
(6) The negative polarity adverb no “not” may be reinforced by the adverbs pas or cap, in some dialects, and it can co-occur with negative polarity items (ningú “anybody/nobody,” res “anything/nothing,” mai “ever/never,” etc.). These polarity items exhibit negative agreement (No hi ha mai ningú “Nobody is ever here”). However, negative polarity items may express positive meaning in some non-declarative syntactic contexts (Si mai vens, truca’m “If you ever come, call me”).
(7) Catalan dialects are rich in yes-no interrogative and confirmative particles (que, o, oi, no, eh, etc.: (Que) plou? “Is it raining?,” Oi que plou? “It’s raining, isn’t it?”
Matthew J. Carroll
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
The Yam Languages are a primary language family that is spoken in Southern New Guinea across an area spanning around 180 km west to east across both the Indonesian province of Papua and Papua New Guinea.
The Yam languages are morphologically remarkable for their complex verbal inflection characterized by a tendency to distributed inflectional exponence across multiple sites. Under this pattern of distributed exponence, segmental formatives, that is, affixes, are identifiable but assigning any coherent semantics to these elements is often difficult and instead the inflectional meanings can only be determined once multiple formatives have been combined. This raises interesting theoretical and typological questions about monotonic notions of morpheme and the isomorphic alignment of meaning and form. Yam languages are known for their complex inflectional morphology but display comparatively impoverished word formation or derivational morphology.
Nominal inflection is characterized by moderately large inventories of cases, the largest displaying 16 cases. Nouns may also be marked for number but this is typically restricted to certain case values. Verbal paradigms are also large; verbs mark agreement with up to two arguments in person, number, and at times natural gender. Additionally, languages display numerous tense, aspect, and mood values; this typically involves at least two aspect values, multiple past tense values, and some level of grammatical mood marking. Verbs may also be marked for diathesis, direction, and/or pluractionality.
Architecturally, nominal inflection is rather straightforward with nominal taking case suffixes or clitics with little to no inflectional classes. The true complexity lies in the organization of the verbal inflectional system and the prevalence of distributed exponence. While each language exploits distributed exponence in a unique manner, there are a number of architectural generalizations that can be made across the family. The languages display a remarkably similar inflectional template for verbs and inflectional classes are organized along similar lines. The primary inflectional class divide is between prefixing and ambifixing verbs. Prefixing verbs mark their agreement with a prefix only while ambifixing verbs mark agreement with the suffix, for monovalent clauses, or with both a prefix and a suffix for bivalent verbs. The verbal template involves these agreement prefixes and suffixes that also mark tense, aspect, and mood. The most prominent of those are a set of agreement prefixes known as undergoer prefixes, which mark tense, aspect, and mood in a non-transparent or morphomic manner.
The Dravidian languages, spoken mainly in southern India and south Asia, were identified as a separate language family between 1816 and 1856. Four of the 26 Dravidian languages, namely Tamil, Telugu, Kannada, and Malayalam, have long literary traditions, the earliest dating back to the 1st century
A typical characteristic of Dravidian, which is also an areal characteristic of south Asian languages, is that experiencers and inalienable possessors are case-marked dative. Another is the serialization of verbs by the use of participles, and the use of light verbs to indicate aspectual meaning such as completion, self- or nonself-benefaction, and reflexivization. Subjects, and arguments in general (e.g., direct and indirect objects), may be nonovert. So is the copula, except in Malayalam.
A number of properties of Dravidian are of interest from a universalist perspective, beginning with the observation that not all syntactic categories N, V, A, and P are primitive. Dravidian postpositions are nominal or verbal in origin. A mere 30 Proto-Dravidian roots have been identified as adjectival; the adjectival function is performed by inflected verbs (participles) and nouns. The nominal encoding of experiences (e.g., as fear rather than afraid/afeared) and the absence of the verb have arguably correlate with the appearance of dative case on experiencers. “Possessed” or genitive-marked N may fulfill the adjectival function, as noticed for languages like Ulwa (a less exotic parallel is the English of-possessive construction: circles of light, cloth of gold). More uniquely perhaps, Kannada instantiates dative-marked N as predicative adjectives. A recent argument that Malayalam verbs originate as dative-marked N suggests both that N is the only primitive syntactic category, and the seminal role of the dative case.
Other important aspects of Dravidian morphosyntax to receive attention are anaphors and pronouns (not discussed here; see separate article, anaphora in Dravidian), in particular the long-distance anaphor taan and the verbal reflexive morpheme; question (wh-) words and the question/disjunction morphemes, which combine in a semantically transparent way to form quantifier words like someone; the use of reduplication for distributive quantification; and the occurrence of ‘monstrous agreement’ (first-person agreement in clauses embedded under a speech predicate, triggered by matrix third-person antecedents).
Traditionally, agreement has been considered the finiteness marker in Dravidian. Modals, and a finite form of negation, also serve to mark finiteness. The nonfinite verbal complement to the finite negative may give the negative clause a tense interpretation. Dravidian thus attests matrix nonfinite verbs in finite clauses, challenging the equation of finiteness with tense.
The Dravidian languages are considered wh-in situ languages. However, wh-words in Malayalam appear in a pre-verbal position in the unmarked word order. The apparently rightward movement of some wh-arguments could be explained by assuming a universal VO order, and wh-movement to a preverbal focus phrase. An alternative analysis is that the verb undergoes V-to-C movement.
George van Driem
Several language families and a few language isolates are represented in the Himalayas, the world’s greatest massif, running a length of over 3,600 km. The most well-represented language family in this region happens to be the Trans-Himalayan language family, whose very centre of gravity and phylogenetic diversity is situated within the Eastern Himalaya. This most populous language family on our planet in terms of numbers of speakers used to be known as Tibeto-Burman but, in some circles, the family formerly also went by the names “Indo-Chinese” or “Sino-Tibetan”, the latter two labels actually designating empirically unsupported and now obsolete models of language relationship. The study of Trans-Himalayan historical grammar began with Brian Houghton Hodgson in the 1830s, who during this time served at Kathmandu as the British Resident to the Kingdom of Nepal. Periodically, minor studies devoted attention to several of the more salient morphosyntactic phenomena of Trans-Himalayan historical grammar, but Stuart Wolfenden contributed the first major monograph to the subject in the 1920s. Finally, the historical morphosyntax of the Trans-Himalayan language family came to be the focus of numerous linguistic studies from the 1970s onward, and since that time our understanding of the historical grammar of the language family has changed drastically.
As ever more languages out of the hundreds of previously undocumented Trans-Himalayan tongues came to be described and analysed in great detail, it came to be understood that the flamboyant verbal agreement morphology observed in languages such as the Kiranti languages of eastern Nepal and the rGyalrongic languages of southwestern China were neither grammatically innovative nor represented typological flukes, but instead represented the most grammatically conservative languages within the entire language family. Subsequently, cognate inflectional systems or vestiges of cognate conjugational morphology were discovered in most other branches of the language family as well. The geographical centre, as well as the centre of phylogenetic diversity of the Trans-Himalayan language family, was identified as the highland arc of the Eastern Himalaya. Sinitic languages, although representing by far the most populous single branch of the Trans-Himalayan family, were now understood as constituting just one out of many subgroups, not more divergent from other branches than any one of the four dozen other subgroups making up the language family. The various types of epistemic marking systems observed sporadically throughout the region were shown to be secondary innovations, reflecting a great variety of semantically distinct language-specific grammatical categories. Particularly, languages showing the typology of the Loloish or Sinitic type were shown to be innovative in their grammar, having lost much of the original Trans-Himalayan morphosyntax.
Pierpaolo Di Carlo, Jeff Good, and Rachel Ojong Diba
The pervasiveness of multilingualism throughout the African continent has led it to be viewed as Africa’s “lingua franca.” Nevertheless, sociolinguistic research on this topic has concentrated mostly on urbanized areas, even though the majority of Africans still live in rural regions, and rural multilingualism is clearly of much older provenance than its urban counterpart. In urban domains, individual language repertoires are dominated by the interplay between European ex-colonial languages, African lingua francas, and local languages, and language ideologies emphasize the ordering of languages in a hierarchy that is tied to social status. The situation in rural areas is clearly distinct, though it has yet to be thoroughly investigated.
Early work on language use in rural Africa tended to background the presence of multilingualism and was dominated by an approach that viewed each community (or “tribe”) as having its own language. Thanks to the progressive adoption of ethnographic methods of inquiry, facilitated by language documentation research especially since the beginning of the 21st century, it has been possible to more effectively study areas of high linguistic diversity in West and Central Africa which demonstrate that multilingualism plays an integral role in structuring social relations. Available case studies document the presence of individuals with linguistic repertoires that are primarily oriented around local languages, ideologies, and practices and that do not clearly fit with what is known from urban environments. The most important theme that emerges from this work is the extent to which rural multilingualism is linked to the specific dynamics holding among communities that are near to each other rather than being a reflection of a more general, externally imposed value system.
While this result makes it difficult to characterize rural multilingualism as a single, coherent phenomenon, it does point to the utility of a shared toolkit of research strategies for exploring it in more detail. In particular, ethnographic methods are required in order to ascertain the major local social divisions which language choice both reflects and constructs in these areas, and it is additionally important to focus on how individual repertoires are tied to specific life histories rather than to assume that groupings that are salient to the outside researcher (e.g., “villages” or “compounds”) are the relevant units of analysis.
Finally, investigation of multilingualism in rural Africa is not only valuable for what it reveals about social dynamics on the continent, but it also seems likely to yield important insights for the study of sociolinguistics more broadly.
Gregory D. S. Anderson
The Munda language family constitutes the westernmost branch of the widespread Austroasiatic language family. Munda formerly was considered sister to the rest of the phylum, then known as Mon-Khmer, but this has been revised, and Munda is considered as Austroasiatic as any other branch. The internal classification of the Munda languages is still disputed, but a clear North Munda group exists and is uncontroversial. Other higher-order internal divisions remain disputed, although low-level groups like Sora-Gorum or Gutob-Remo are clear and accepted by almost all researchers today.
Phonologically speaking, Munda languages make extensive use of glottal stop and pre-glottalized stops, nasal vowels, and retroflexion. Word level prosody shows Austroasiatic features with an overlay of South Asian areal features on the phrase level. Register and tone have been reported for individual languages such as creaky voice in Gorum and a low tone in Korku.
Nouns in Munda languages may encode a range of grammatical and local cases, person and number of possessors, and covert distinctions of animacy in agreement and other morphosyntactic features. Verbs in Munda languages can be quite complex, with subject and object as well as TAM encoding, transitivity, finiteness, etc. Kherwarian languages stand out in this regard as well as for the distributional facts of the subject clitics, where the preferred locus is enclitic to the word immediately preceding the verb. Systems of negation can be very complicated and show unexpected interactions with TAM marking in languages like Gutob.
Syntactically, Munda languages show many typical South Asian features such as verb-final structure, as well as non-finite structures, and in some cases switch reference systems or noun incorporation.
The current sociolinguistic and demographic contexts of the different Munda languages range from expanding and healthy with official status in the case of Santali to seriously endangered in the case of Gorum.
Jack B. Martin
The Muskogean languages are a family of languages indigenous to the southeastern United States. Members of the family include Chickasaw, Choctaw, Alabama, Koasati, Apalachee, Hitchiti-Mikasuki, and Muskogee (Creek). The trade language Mobilian Jargon is based on Muskogean vocabulary and grammar. The Muskogean languages all have SOV word order. Noun phrases are marked for subject or non-subject case. Alienable and inalienable possession is marked on possessed nouns. Agreement on verbs for subjects and objects is sensitive to agency. The languages have grammatical tone (used to indicate verbal aspect) and switch reference. Several of the languages have measured tense systems (indicating several degrees of distance in the past).
The noun-modifying clause construction (NMCC) in Japanese is a complex noun phrase in which a prenominal clause is dependent on the head noun. Naturally occurring instances of the construction demonstrate that a single structure, schematized as [[… predicate (finite/adnominal)] Noun], represents a wide range of semantic relations between the head noun and the dependent clause, encompassing some that would be expressed by structurally distinct constructions such as relative clauses, noun complement clauses, and other types of complex noun phrases in other languages, such as English. In that way, the Japanese NMCC demonstrates a clear case of the general noun-modifying construction (GNMCC), that is, an NMCC that has structural uniformity across interpretations that extend beyond the range of relative clauses.
One of the notable properties of the Japanese NMCC is that the modifying clause may consist only of the predicate, reflecting the fact that referential density is moderate in Japanese—arguments of a predicate are not required to be overtly expressed either in the main clause or in the modifying clause. Another property of the Japanese NMCC is that there is no explicit marking in the construction that indicates the grammatical or semantic relation between the head noun and the modifying clause. The two major constituents are simply juxtaposed to each other.
Successful construal of the intended interpretations of instances of such a construction, in the absence of explicit markings, likely relies on an aggregate of structural, semantic, and pragmatic factors, including the semantic content of the linguistic elements, verb valence information, and the interpreter’s real-world knowledge, in addition to the basic structural information.
Researchers with different theoretical approaches have studied Japanese NMCCs or subsets thereof. Syntactic approaches, inspired by generative grammar, have focused mostly on relative clauses and aimed to identify universally recognized syntactic principles. Studies that take the descriptive approach have focused on detailed descriptions and the classification of a wide spectrum of naturally occurring instances of the construction in Japanese. The third and most recent group of studies has emphasized the importance of semantics and pragmatics in accounting for a wide variety of naturally occurring instances.
The examination of Japanese NMCCs provides information about the nature of clausal noun modification and affords insights into languages beyond Japanese, as similar phenomena have reportedly been observed crosslinguistically to varying degrees.
Within the Ryukyuan branch of the Japonic family of languages, present-day Okinawan retains numerous regional variants which have evolved for over a thousand years in the Ryukyuan Archipelago. Okinawan is one of the six Ryukyuan languages that UNESCO identified as endangered. One of the theoretically fascinating features is that there is substantial evidence for establishing a high central phonemic vowel in Okinawan although there is currently no overt surface [ï]. Moreover, the word-initial glottal stop [ʔ] in Okinawan is more salient than that in Japanese when followed by vowels, enabling recognition that all Okinawan words are consonant-initial. Except for a few particles, all Okinawan words are composed of two or more morae. Suffixation or vowel lengthening (on nouns, verbs, and adjectives) provides the means for signifying persons as well as things related to human consumption or production. Every finite verb in Okinawan terminates with a mood element. Okinawan exhibits a complex interplay of mood or negative elements and focusing particles. Evidentiality is also realized as an obligatory verbal suffix.
The Northeast Asia is one of the unique points on the globe where there are many language isolates and portmanteau families. From a conservative point of view, the Japanese language is a member of such a portmanteau family that has recently and increasingly been called Japonic in the Western literature. While Japanese is unquestionably a member of this Japonic language family, which consists of two Japanese languages (Japanese itself and the moribund Hachijō language) and four or five relatively closely related Ryūkyūan languages (Amami, Okinawan, Miyako, Yaeyama, and possibly Yonaguni), attempts have also been made to establish a genetic relationship between Japanese and various other language families. Most of these attempts have been amateurish, a major exception being the Koreo-Japonic hypothesis, which still remains unproven as well. It is also quite likely that the Japonic language family (or, more precisely, Insular Japonic) is the only linguistic grouping whose genetic relationship can be established beyond any doubt. A genetic relationship is also likely to exist between Japonic and a number of fragmentarily attested languages that once flourished in the south and center of the Korean Peninsula, but that died out no later than 9th century A.D. The paucity of material available does not allow one to establish solid predictive-productive regular correspondences in many cases, but intuitively the genetic relationship seems to be a matter of fact. Anything beyond intuition, however, lies in the realm of conjecture and speculation. The alleged Koreo-Japonic relationship is best explained by a centuries-long contact relationship rather than by common origin, given such factors as the virtual absence of any kind of shared paradigmatic morphology, as well as by multiple problems in establishing the real (and not imaginable or made-to-fit) regular correspondences. The Japanese-“Altaic” hypothesis is even more speculative and far-fetched. Consequently, the conclusion is that the Japanese language or the Japonic language family has no demonstrable relationship with any other language family or language isolate on the planet.
Anthony P. Grant
The Penutian language family, Penutian phylum, or better still, Penutian hypothesis is one of the largest genealogical linguistic groupings to have been proposed for western North America. It involves 16 families or isolates. Only a few of these families are demonstrably relatable to one another according to current knowledge and diachronic techniques. Sometimes Penutian is split by observers into groups of languages assumed to be interrelated, and this is done without assumptions that the groups themselves are interrelated.
This article focuses on the Canadian and US languages in “Sapir’s Penutian,” the most commonly accepted version; the most southerly family within Penutian is thus held as Yokutsan of California’s Sierra Nevada. It discusses the subclassification of the so-called Penutian languages into families and smaller units; aspects of their phonology, morphosyntax, and contact histories; and issues in their revitalization and the potential reconstruction of Proto-Penutian.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
As far as personal nouns are concerned, an agent noun is a derived noun whose general meaning is “person who usually/typically does . . .”. It is thus characterized by the feature [+ Human], irrespective of the fact that the person involved actually performs an action (e.g., French nageur “swimmer,” i.e., “a person who swims”), carries on a profession (e.g., Spanish carpintero “carpenter,” i.e., “a person who builds or repairs wooden structures”), adheres to a certain ideology or group (e.g., Italian femminista “feminist,” i.e., “a person who supports or follows the feminist movement”), etc.
Agent nouns can be both denominal and deverbal. Latin denominal agent nouns were mainly formed with –ARIUS, though the Latin agentive suffix par excellence was –TOR, which derived nouns from verbs. The reflexes of both –ARIUS and –TOR are widespread and highly productive in Romance: For the former, see Portuguese –eiro / –ario, Spanish –ero / –ario, Catalan –er, French –ier / –aire, Italian –aio / –aro / –ario, Romanian –ar, etc.; for the latter, Portuguese, Spanish, Catalan –dor, French –eur, Italian –tore, Romanian –tor, etc. At any rate, the distinction between denominal and deverbal agent nouns is not always straightforward, as is shown by the Romance forms connected with Latin present particle –NTE: Whereas most of them display a verbal base (see, e.g., Italian cantante “singer” ← cantare “to sing”), some others do not (see, e.g., Italian bracciante “hired hand” ← braccio “arm”), allowing them to be regarded as denominal derivations.
The distribution of agentive suffixes in Romance is sometimes conditioned by borrowing phenomena. For example, Italian –iere is not a direct outcome of Latin –ARIUS, but rather a borrowing from French –ier; Sardinian –eri is of Spanish/Catalan origin, whereas Piedmontese –aire and –eur are loans from Occitan and French, respectively.
A theoretical issue that is related to agent nouns in Romance is the so-called agent-instrument-locative polysemy. While traditionally attributed to the result of metaphorical and/or semantic extensions, this polysemy is better explained by such factors as homonymization, ellipsis, analogy, borrowing/calque, though, at least in some cases, we cannot exclude that a combination of both explanation types has taken place.
Pidgin languages sometimes form in contact situations where a means of communication is urgently needed between groups lacking a common code. They are typically less elaborate than any of the languages involved in their formation, and in comparison to those, reduction characterizes all linguistic levels.
The process is relatively uncommon, and the life span of pidgins is usually short – most disappear when the contact situation changes, or when another medium of intergroup communication becomes available. In some rare cases, however, they expand (both socially and structurally), and may even nativize, i. e. become mother tongues to their speakers (when they may be re-labelled “creoles”).
Pidgins are severely understudied, and while they are often mentioned as precursors to creoles, few linguists have shown a serious interest in them. As a result, many generalizations have been based on extremely limited amounts of data or even on intuition. Some frequently occurring ones is that pidginization is a case of second language acquisition, that power and prestige are important factors, and that most structures are derived from the input languages. My work with pidgins has led me to believe the opposite to be true in these cases: pidgins form through a trial-and-error process, where anything that is understood by the other party is sanctioned, this process is one of collaborative language creation (rather than one involving one group of teachers and one group of learners), and much of what finds its way in the resultant contact language do so independently of what the creators spoke prior to their encounter.
As for theoretical implications, pidgins may shed light on which features in traditional languages are necessary for communication, and which are superfluous from the point of view of pure information transmission.
Creole languages have mostly resulted from interactions between Europeans and subordinated peoples amid colonization, trade, and imperialism. Given that the creation of these languages was usually driven as much by adults as children, second-language acquisition has a larger effect upon creole language structures than it does under most other conditions of language change and contact. Namely, it has traditionally been supposed that creole languages begin as makeshift pidgin varieties, expanded from this into full languages. However, various creolists have proposed that most creoles did not in fact emerge in this way; some argue that creoles are relexifications of indigenous languages, while others argue that nothing distinguishes creole genesis from language contact more generally.
The Playful Lexicon in the Romance Languages: Onomatopoeia, Reduplication, Prosodic Templates, Blending
We describe a lexical item as “playful” or “ludic” when it shows evidence of manipulation of the relation that inheres between its form (signifier) and its meaning (signified). The playful lexicon of any given language, therefore, is the sum total of its lexical items that show signs of such manipulation.
Linguists have long recognized that the only necessary link between a word’s form and its meaning is the arbitrary social convention that binds them. However, nothing prevents speakers from creating additional, unnecessary, and therefore essentially “playful” links, associating forms with meanings in a symbolic, hence nonarbitrary way. This semantic effect is most evident in the case of onomatopoeia, through which the phonetic form of words that designate sounds is designed to be conventionally imitative of the sound. A second group of playful words combines repeated sequences of sounds with meanings that are themselves suggestive of repetition or related concepts such as collectivity, continuity, or actions in sequence, as well as repeated, back-and forth, or uncontrolled movements; or even, more abstractly, intensity and hesitation. The playfulness of truncated forms such as clips and blends is based on a still more abstract connection between forms and meanings. In the case of clipping, the truncation of the full form of a word triggers a corresponding connotative truncation or diminution of the meaning, that is, a suggestion that the referent is small—either endearingly so, as in hypocoristics, or contemptuously so, as in insults. In blending, truncation is often accompanied by overlapping, which symbolically highlights the interrelatedness or juxtaposition of the constituents’ individual meanings. Prosodic templates do not constitute a separate category per se; instead, they may play a part in the formation or alteration of words in all of the categories discussed here.
Polysynthesis is informally understood as the packing of a large number of morphemes into single words, as in (1) from Bininj Gun-wok (Evans, in press).
'I cooked the wrong meat for them again.'
Its status as a distinct typological category into which some of the world’s languages fall, on a par with isolating, agglutinating, or fusional languages, has been controversial from the start. Nevertheless, researchers working with these languages are seldom in doubt as to their status as distinct from these other morphological types. This has been complicated by the fact that the speakers of such languages are largely limited to hunter-gatherers—or were so in the not too distant past—so the temptation is to link the phenomenon directly to way of life. This proves to be oversimplified, although it is certainly true that languages qualifying as polysynthetic are almost everywhere spoken in peripheral regions and are on the decline in the modern world—few children are learning them today.
Perhaps the most pervasive of the traits that give these languages the impression of a “special” status is that of holophrasis, which can be defined as the (possible) expression of what in less synthetic languages would be whole sentences in single complex (usually verbal) words. It turns out, however, that there is much greater variety among polysynthetic languages than is generally thought: there are few other traits that they all share, although distinct subtypes can in fact be distinguished, notably the affixing as opposed to the incorporating type.
These languages have considerable importance for the investigation of the diachronic complexification of languages in general and of language acquisition by children, as well as for theories of language universals. The sociolinguistic factors behind their development have only recently begun to be studied in depth. All polysynthetic languages today are to some degree endangered (they are dying off at an alarming rate), and many have been poorly studied if at all, which makes their investigation before it is too late a prime goal for linguistics.
Timothy J. Vance
The term rendaku, sometimes translated as sequential voicing, denotes a morphophonemic phenomenon in Japanese. In a prototypical case, an alternating morpheme appears with an initial voiceless obstruent as a word on its own or as the initial element (E1) in a compound but with an initial voiced obstruent as the second element (E2) in a two-element compound. For example, the simplex word /take/ ‘bamboo’ and the compound /take+yabu/ ‘bamboo grove’ (cf. /yabu/ ‘grove’) begin with voiceless /t/, but this morpheme meaning ‘bamboo’ begins with voiced /d/ in /sao+dake/ ‘bamboo (made into a) pole’ (cf. /sao/ ‘pole’). Rendaku was already firmly established in 8th-century Old Japanese (OJ), the earliest variety for which extensive written records exist, and subsequent sound changes have made the alternations phonetically heterogeneous. Many OJ compounds with eligible E2s did not undergo rendaku, and the phenomenon remains pervasively irregular in modern Japanese. There are, however, many factors that promote or inhibit rendaku, and some of these appear to influence native-speaker behavior on experimental tasks. The best known phonological factor is Lyman’s Law, according to which rendaku does not apply to E2s that contain a non-initial voiced obstruent. Many theoretical phonologists endorse the idea that Lyman’s Law is a sub-case of the Obligatory Contour Principle, which rules out identical or similar units if they would be adjacent in some domain. Other well-known factors involve vocabulary stratum (e.g., the resistance to rendaku of recently borrowed E2s) or the morphological/semantic relationship between E2 and E1 (e.g., the resistance to rendaku of coordinate compounds). Some morphemes are idiosyncratically immune to rendaku. Other morphemes alternate but undergo rendaku in some compounds while failing to undergo it in others, even though no known factor is relevant. In addition, many individual compounds vary between a form with rendaku and a form without, and this variability is often not reflected in dictionary entries. Despite its irregularity, rendaku is productive in the sense that it often applies to newly created compounds. Many compounds, of course, are stored (with or without rendaku) in a speaker’s lexicon, but fact that native speakers can apply rendaku not just to existing E2s in novel compounds but even to made-up E2s shows that rendaku as an active process is somehow incorporated into the grammar.
The convergence between Basque and Romance is now largely unidirectional, with Basque becoming more like Romance, but shared features suggest that Basque had historically a considerable influence on the emerging Romance varieties in southern France and northern Iberia. Similar phonemic distinctions and phonetic realizations are found in adjacent Basque and Romance varieties, and sometimes beyond. The phoneme inventories of Basque and Castilian Spanish are largely identical. The Romance influence on Basque is most visible in the lexicon, as over half of the words used in everyday speech are of Latin or Romance origin. While the Basque contribution to the Romance lexicon of common nouns has been much more modest, some Basque anthroponyms have become very popular beyond the Basque Country. The integration of Latin verbs into the Basque lexicon triggered and then accelerated the switch to a tense-aspect system modeled on that of Romance. Like Spanish, the Basque varieties in Spain distinguish between two ‘be’-copulas, and two ‘have’-verbs. Certain types of relative clauses and passive constructions replicate Romance models, and a Basque mediopassive can be systematically translated into a Spanish clause with the pronoun se. The default constituent order of Basque is verb-final, but dependent clauses are often found in post-predicate position, matching the order found in Romance. While sharing many features with Romance varieties across southwestern Europe, Basque is closest to Castilian and Gascon, the two languages with which it has a long history of bilingualism and localized language shift.