Japanese Psycholinguistics  

Mineharu Nakayama

The Japanese psycholinguistics research field is moving rapidly in many different directions as it includes various sub-linguistics fields (e.g., phonetics/phonology, syntax, semantics, pragmatics, discourse studies). Naturally, diverse studies have reported intriguing findings that shed light on our language mechanism. This article presents a brief overview of some of the notable early 21st century studies mainly from the language acquisition and processing perspectives. The topics are divided into various sections: the sound system, the script forms, reading and writing, morpho-syntactic studies, word and sentential meanings, and pragmatics and discourse studies sections. Studies on special populations are also mentioned. Studies on the Japanese sound system have advanced our understanding of L1 and L2 (first and second language) acquisition and processing. For instance, more evidence is provided that infants form adult-like phonological grammar by 14 months in L1, and disassociation of prosody is reported from one’s comprehension in L2. Various cognitive factors as well as L1 influence the L2 acquisition process. As the Japanese language users employ three script forms (hiragana, katakana, and kanji) in a single sentence, orthographic processing research reveal multiple pathways to process information and the influence of memory. Adult script decoding and lexical processing has been well studied and research data from special populations further helps us to understand our vision-to-language mapping mechanism. Morpho-syntactic and semantic studies include a long debate on the nativist (generative) and statistical learning approaches in L1 acquisition. In particular, inflectional morphology and quantificational scope interaction in L1 acquisition bring pros and cons of both approaches as a single approach. Investigating processing mechanisms means studying cognitive/perceptual devices. Relative clause processing has been well-discussed in Japanese because Japanese has a different word order (SOV) from English (SVO), allows unpronounced pronouns and pre-verbal word permutations, and has no relative clause marking at the verbal ending (i.e., morphologically the same as the matrix ending). Behavioral and neurolinguistic data increasingly support incremental processing like SVO languages and an expectancy-driven processor in our L1 brain. L2 processing, however, requires more study to uncover its mechanism, as the literature is scarce in both L2 English by Japanese speakers and L2 Japanese by non-Japanese speakers. Pragmatic and discourse processing is also an area that needs to be explored further. Despite the typological difference between English and Japanese, the studies cited here indicate that our acquisition and processing devices seem to adjust locally while maintaining the universal mechanism.


Generative Grammar  

Knut Tarald Taraldsen

This article presents different types of generative grammar that can be used as models of natural languages focusing on a small subset of all the systems that have been devised. The central idea behind generative grammar may be rendered in the words of Richard Montague: “I reject the contention that an important theoretical difference exists between formal and natural languages” (“Universal Grammar,” Theoria, 36 [1970], 373–398).


Morphology in Arawak Languages  

Alexandra Y. Aikhenvald

The Arawak language family is the largest in South America in terms of its geographical spread, from Central America (Belize, Honduras, Guatemala, and Nicaragua) to as far south as Bolivia (and formerly Argentina and Paraguay). Within South America, Arawak languages are spoken in Lowland Amazonia and adjacent regions, covering Guyana, French Guiana, Surinam, Venezuela, Colombia, Peru, and Brazil, in at least ten locations north of the River Amazon, and at least ten to the south of it. There are over forty extant languages and a few dozen extinct ones. The genetic unity of Arawak languages was first recognized by Father Filippo Salvadore Gilij as early as 1783, based on a comparison of pronominal prefixes in Maipure, an extinct language from the Orinoco Valley, and in Moxo from Bolivia. The limits of the family were established by the early 20th century. Proposals to include Arawak languages in putative macro-groupings such as “Arawakan” or “Macro-Equatorial” have proved spurious and unsubstantiated. The heritage of Arawak languages survives in such common words as hammock, hurricane, barbecue, guava, and tobacco. Arawak languages are synthetic, predominantly head-marking and suffixing, with a closed and historically stable set of prefixes—bound pronouns on verbs, the relativizing prefix ka- and its negative counterpart ma-. Personal prefixes distinguish first, second, and third person, and also impersonal and indefinite forms. Prefixes mark the subject of a transitive verb and of an intransitive active verb, and the possessor on nouns. In at least two thirds of the languages, personal suffixes or enclitics express the object of a transitive verb (o), and the subject of stative verbs (s o) or the subject of non-verbal predicates. A few highly synthetic languages (including those from the Kampa subgroup in Peru) employ suffixes or enclitics to cross-reference the object and also the recipient or an oblique. There is typically a number of locative cases which can be stacked in one word. The majority of Arawak languages do not employ cases for marking core grammatical relations. The only exception is Tariana, from the multilingual Vaupés River Basin linguistic area. Here, core cases were developed under the influence of the neighboring Tucanoan languages. Inclusive–exclusive distinctions were developed in Resígaro and Palikur as a result of language contact. Open classes are verbs and nouns; adjectives tend to form an open class, and share some features with nouns, and some with verbs. Verbal roots tend to be exclusively monosyllabic. Noun roots can contain more than one syllable. Derivational processes include affixation, compounding, and various kinds of reduplication. Just a few languages have single-word serial verb constructions. The order of suffixes within a word can be variable, reflecting the scope of the morphemes. Nouns divide into obligatorily, or inalienably, possessed and optionally, or alienably, possessed. Obligatorily possessed nouns are body parts, kinship terms, and a few important possessions, for example ‘name’ and ‘house’. If the possessor is not specified, these nouns take an unpossessed form marked with a suffix, also used as a nominalizer on verbs in many languages. Alienably possessed nouns take a possessive prefix and an additional suffix (chosen based on the meaning of the noun). Most languages distinguish masculine and feminine genders in third person singular personal pronouns, demonstratives, nominalizations, and also as agreement markers on adjectives. More than half the languages have complex systems of classifiers on number words, and also on verbs, in possessive constructions, and on nouns themselves. They categorize the noun in terms of its shape, consistency, and animacy. Singular and plural numbers are fairly uniform across the family; dual has developed in Resígaro, as a consequence of language contact with the unrelated Bora. Other nominal grammatical categories include nominal tense, augmentative, diminutive, and approximative. The verb is the most complicated part of the grammar of every Arawak language, and the only obligatory constituent in a clause. Typical verbal categories include tense, aspect, evidentiality, numerous modalities (including a frustrative meaning ‘do in vain’), and valency-changing derivations—passives, reflexives, reciprocals, causatives, and applicatives. Some Kampa languages have up to six applicative derivations, including comitative, benefactive, goal, presential, separative, and instrumental. Highly synthetic languages, such as Kampa and Palikur, have patterns of noun incorporation. Many Arawak languages are located next to speakers of languages from other families. They take on their features, in grammar and sometimes also in lexicon. Tariana, the only Arawak language spoken in the multilingual Vaupés River Basin area surrounded by Tucanoan languages, has a distinct Tucanoan flavor to its grammar. Mawayana, Garifuna, and Palikur, in contact with Carib languages, have acquired a few Carib features. Resígaro has been affected by Bora, and Amuesha bears traces of contact with Quechua and other languages that are hard to identify. The interaction of genetic inheritance, language contact, and independent innovations makes Arawak languages dauntingly diverse.


Ancient Greek Views on Greek and Other Languages  

Toon Van Hal

The Ancient Greeks came into contact with possibilities and problems related to ‘language’ in several respects. The earliest epics contained implicit etymological explanations, and both the pre-Socratic philosophers and the sophists were intrigued by the link between the form of words and the meaning they carried. The adaptation of the Phoenician alphabet was an additional stimulus to start reflecting on language. ‘Letters’ became the smallest unit of inquiry in Greek language thought. Of the other units, the word was seen as the most significant level. Elaborating on the philosophical foundations laid by Plato, Aristotle, and early Stoic thinkers, Alexandrian scholars started shaping a philologically oriented tradition of grammar, which was largely oriented to the study of the eight parts of speech and directed at young students of Greek literature. Within the frame of grammar, less attention was paid to the level of the sentence, which explains why syntactic issues were not intensively explored. At its inception, Greek lexicography was an ancillary tool for understanding Greek literary texts too, directed at an audience of native speakers of Greek. Hence, lexicographical projects limited to including difficult or special words. Only once Romans began to delve into the study of Greek did the composition of general lexicons become more urgent.


Peculiarities of Raeto-Romance Word Formation  

Matthias Grünert

Raeto-Romance (RaeR.) word formation shows considerable differences between the three main varieties: Romansh of Grisons, Dolomitic Ladin, and Friulian. Although numerous processes of word formation are common to these varieties, being inherited from identical bases, their vitality differs. This is due to the detached developments in the individual areas and to different influences from the dominant neighboring languages, German and Italian, leading to numerous replications of patterns in the RaeR. varieties.


Unsettling Imperial Science: Centering Convivial Scholarship in Sociolinguistics  

Finex Ndhlovu

The universalizing posture of claims made by colonial approaches and their regimes of representation continues to inform most mainstream sociolinguistics research agendas and project designs. Such claims reflect an imperial scientific tradition that overlooks and marginalizes other ways of knowing, particularly those from communities of the global South. Decolonizing sociolinguistics entails doing at least three things. First, we must decolonize ourselves through critical reflection on our own practices and how such practices contribute to the continuation of inequalities in knowledge production and in society. Second is the need to develop new narratives, new words, new grammars, and new vocabularies for eliciting empirical data to support the suppositions and arguments we advance in our anti-conventional and anti-colonial theoretical approaches to language and society research. Such alternative trajectories require a decentering of the dominant (colonial/imperial) voice and an increase in other voices speaking from other equally valid approaches that are currently being overlooked. Third, decolonizing sociolinguistics entails developing new models that draw on a rich collection of thought from a broad spectrum of traditions of knowing. This is about promoting convivial scholarship through mobilizing diverse resources to advance collaborative engagements that link our academic pursuits to public interests, including the interests of marginalized, minority, and global Indigenous communities. Convivial scholarship says the paths we follow in doing sociolinguistics research must be those that are committed to re-membering and rehumanizing Indigenous and other Southern peoples subjected to more than 500 years of coloniality. Decolonizing sociolinguistics must, therefore, mean freeing the field from the colonial tradition of knowing by bringing back to the center historically marginalized Indigenous and Southern knowledge systems. The premise is that a sociolinguistics that works for all must open pathways and avenues for epistemic access and cognitive justice through valuing diverse founts of knowledges as key contours.


Discourse Analytic Approaches to Language and Identity  

Dorien Van De Mieroop

Rather than thinking of identity as something that defines a person in such a way that it makes them distinguishable from others, researchers using discourse analytical approaches within linguistics—especially in the fields of pragmatics and interactional sociolinguistics—tend to adopt a social constructionist perspective and thus view identity as a multimodally constituted activity or process. From this perspective, identity is not something one is or has, but something that one does or creates by means of various linguistic and paralinguistic resources as well as bodily movements. This performative view of identity has a number of implications. Rather than thinking of identity in the singular, a plural conceptualization of identities is capitalized on. Moreover, these identities should not be regarded as pertaining to only the ‘large’ macro-level sociodemographic categories individuals belong to, such as gender, race, and social class; identities are often described in much more nuanced terms. Such a fine-grained approach is needed to do justice to this performative perspective on identity, as it helps to capture the many dynamic and extremely fleeting ways in which people engage in identity work. Furthermore, all these identity constructions are not necessarily always consistent with one another, and they may sometimes even be contradictory, as people may not always be—or be able to be—equally prone to enacting a particular identity. This may depend on what they are doing and with whom, as identities are also related to the identities other people may construct around them. All these aspects make the analysis of identity quite a complex endeavor, as not only can their plural and fleeting nature make identities quite hard to capture, but it can also be quite a challenge to pin down precisely at which points in an interaction we can actually observe identity work in action.


Orthography and the Sociolinguistics of Writing  

Mirka Honkanen

Orthography is not a neutral tool for representing language in writing. Spelling is a linguistic variable capable of carrying social meaning, and orthographies are technologies embedded in larger societal structures. Spelling plays a role in the construction of national and other social identities, the delimitation of languages, the authentication and stigmatization of speaker groups, standardization, and the written representation of paralinguistic features. In these and further ways, orthography is a topic of high sociolinguistic relevance. After written language had long received less sociolinguistic attention than speech, there is now a growing body of sociolinguistic research into spelling variation and orthography as a socioculturally situated practice. Sociolinguists investigate the social role of orthographies and spelling choices. When orthographies are developed for previously unwritten languages, decisions have to be made not only regarding phonemic representation but also between creating distance from and closeness to related languages. Orthography becomes a highly debated topic also when spelling reforms are proposed; different ideological, aesthetic, financial, educational, and sociopolitical arguments are typically brought forth. Standardized spellings are seen by language users as granting languages and speakers authority. When non-standardized spellings are used in transcripts of speech, they have been shown to assign sociolinguistic stigma to the speakers represented. Non-standardized spellings are used in different less than fully regulated orthographic spaces, such as digital writing, company and personal names, literary texts, subcultural publications, advertising, and private writing. Sociolinguistic studies on spelling often rely on data from digital communication such as text messaging or social media interactions. Such studies not only describe and classify different kinds of non-standardized spellings but also increasingly establish quantitative tendencies, explore correlations with macro-level sociodemographic factors, and show the potential for respelling to construct identities, personae, and meaning at the micro level of the utterance. Spelling can index identities and stances, act as a contextualization cue, and represent prosodic and dialectal features.


Diez, Meyer-Lübke, and Co. The Founding of Romance Linguistics  

Marcello Barbato

The study of Romance linguistics was born in the 19th-century German university, and like all linguistics of that era it is historical in nature. With respect to Indo-European and Germanic linguistics, a difference was immediately apparent: Unlike Indo-European and Common Germanic, Latin’s attestation is extensive in duration, as well as rich and varied: Romance linguists can thus make use of reconstruction as well as documentation. Friedrich Diez, author of the first historical grammar and first etymological dictionary on Romance languages, founded Romance linguistics. His studies singlehandedly constructed the foundations of the discipline. His teaching soon spread not only across German-speaking countries, but also into France and Italy. Subsequently, the most significant contributions came from two scholars trained in the Indo-European field: the German linguist Hugo Schuchardt, whose doctoral thesis studied with sharp theoretical awareness the passage from Latin to the Romance languages, and the Italian Graziadio Isaia Ascoli, who showed how the Romance panorama could be extraordinarily enriched by the analysis of nonstandard varieties. The discipline thus developed fully and radiated out. Great issues came to be debated: models of linguistic change (genealogical tree, wave), the possibility of distinguishing dialect groups, the relative weight of phonology, and semantics in lexical reconstruction. New disciplines such as linguistic geography were born, and new instruments like the linguistic atlas were forged. Romance linguistics thus became the avant-garde of general linguistics. Meanwhile, a new synthesis of the discipline had been created by a Swiss scholar, Wilhelm Meyer-Lübke, who published a historical grammar and an etymological dictionary of the Romance languages.


Case-Marking in the Romance Languages  

Alexandru Nicolae

Case-marking is subject to several important developments in the passage from Latin to the Romance languages. With respect to synthetic marking, nouns and adjectives witness considerable simplification, leading (with some exceptions, i.e., the binary case systems) to the almost complete disappearance of inflectional case-marking, while pronouns continue to show consistent inflectional case-marking. In binary case systems, case distinctions are also marked in the inflection of determiners. Inflectional simplification is compensated for by the profusion of analytic and mixed case-marking strategies and by alternative strategies of encoding grammatical relations.


French Outside Europe  

André Thibault

The first French colonial era goes back to the 17th and 18th centuries. It encompasses North American territories, the Antilles, and the Indian Ocean. The second colonial era started in the 19th century and ended in the 1960s. It first reached the Maghreb and Lebanon, followed by sub-Saharan Africa, where two colonial powers, France and Belgium, exported the use of French. The last territories affected by the expansion of the French language are to be found in the Pacific.



Adina Dragomirescu

Balkan-Romance is represented by Romanian and its historical dialects: Daco-Romanian (broadly known as Romanian), Aromanian, Megleno-Romanian, and Istro-Romanian (see article “Morphological and Syntactic Variation and Change in Romanian” in this encyclopedia). The external history of these varieties is often unclear, given the historical events that took place in the Lower Danubian region: the conquest of this territory by the Roman Empire for a short period and the successive Slavic invasions. Moreover, the earliest preserved writing in Romanian only dates from the 16th century. Between the Roman presence in the Balkans and the first attested text, there is a gap of more than 1,000 years, a period in which Romanian emerged, the dialectal separation took place, and the Slavic influence had effects especially on the lexis of Romanian. In the 16th century, in the earliest old Romanian texts, the language already displayed the main features of modern Romanian: the vowels /ə/ and /ɨ/; the nominative-accusative versus genitive-dative case distinction; analytical case markers, such as the genitive marker al; the functional prepositions a and la; the proclitic genitive-dative marker lui; the suffixal definite article; polydefinite structures; possessive affixes; rich verbal inflection, with both analytic and synthetic forms and with three auxiliaries (‘have’, ‘be’, and ‘want’); the supine, not completely verbalized at the time; two types of infinitives, with the ‘short’ one on a path toward becoming verbal and the ‘long’ one specializing as a noun; null subjects; nonfinite verb forms with lexical subjects; the mechanism for differential object marking and clitic doubling with slightly more vacillating rules than in the present-day language; two types of passives; strict negative concord; the SVO and VSO word orders; adjectives placed mainly in the postnominal position; a rich system of pronominal clitics; prepositions requiring the accusative and the genitive; and a large inventory of subordinating conjunctions introducing complement clauses. Most of these features are also attested in the trans-Danubian varieties (Aromanian, Megleno-Romanian, and Istro-Romanian), which were also strongly influenced by the various languages they have entered in direct contact with: Greek, Albanian, Macedonian, Croatian, and so forth. These source languages have had a major influence in the vocabulary of the trans-Danubian varieties and certain consequences in the shape of their grammatical system. The differences between Daco-Romanian and the trans-Danubian varieties have also resulted from the preservation of archaic features in the latter or from innovations that took place only there.


Korean Syntax  

James Hye Suk Yoon

The syntax of Korean is characterized by several signature properties. One signature property is head-finality. Word order variations and restrictions obey head-finality. Korean also possesses wh in-situ as well as internally headed relative clauses, as is typical of a head-final language. Another major signature property is dependent-marking. Korean has systematic case-marking on nominal dependents and very little, if any, head-marking. Case-marking and related issues, such as multiple case constructions, case alternations, case stacking, case-marker ellipsis, and case-marking on adjuncts, are front and center properties of Korean syntax as viewed from the dependent-marking perspective. Research on these aspects of Korean has contributed to the theoretical understanding of case and grammatical relations in linguistic theory. Korean is also characterized by agglutinative morphosyntax. Many issues in Korean syntax straddle the morphology-syntax boundary. Korean morphosyntax constitutes a fertile testing ground for ongoing debates about the relationship between morphology and syntax in domains such as coordination, deverbal nominalizations (mixed category constructions), copula, and other denominal constructions. Head-finality and agglutinative morphosyntax intersect in domains such as complex/serial verb and auxiliary verb constructions. Negation, which is a type of auxiliary verb construction, and the related phenomena of negative polarity licensing, offer important evidence for crosslinguistic understanding of these phenomena. Finally, there is an aspect of Korean syntax that reflects areal contact. Lexical and grammatical borrowing, topic prominence, pervasive occurrence of null arguments and ellipsis, as well as a complex system of anaphoric expressions, resulted from sustained contact with neighboring Sino-Tibetan languages.


Parasynthesis in Morphology  

Claudio Iacobini

The term parasynthesis is mainly used in modern theoretical linguistics in the meaning introduced by Arsène Darmesteter (1874) to refer to denominal or deadjectival prefixed verbs of the Romance languages (Fr. embarquer ‘to load, to board’) in which the non-prefixed verb (barquer) is not an actual word, and the co-radical nominal form (embarqu-) is not well formed. The Romance parasynthetic verb is characterized with reference to its nominal or adjectival base as the result of the co-occurrence of both a prefix and a suffix (typically of a conversion process, i.e., non-overt derivational marking). The co-occurrence or simultaneity of the two processes has been seen by some scholars as a circumfixation phenomenon, whereby two elements act in combination. The peculiar relationship existing between base and parasynthetic verb is particularly problematic for an Item and Process theoretical perspective since this approach entails the application of one process at a time. Conversely, a Word and Paradigm framework deals more easily with parasynthetic patterns, as parasynthetic verbs are put in relation with prefixed verbs and verbs formed by conversion, without being undermined neither by gaps in derivational patterns nor by the possible concomitant addition of prefixes and suffixes. Due to their peculiar structure, parasynthetic verbs have been matter of investigation even for non-specialists of Romance languages, especially from synchronic (or, better said, achronic) point of view. Attention has been also placed on their diachronic development in that, despite being characteristic of the Romance languages, parasynthetic verbs were already present, although to a lesser extent, in Latin. The diachronic development of parasynthetic verbs is strictly connected with that of spatial verb prefixes from Latin to the Romance languages, with particular reference to their loss of productivity in the encoding of spatial meanings and their grammaticalization into actionality markers. Parasynthetic verbs have been in the Romance languages since their earliest stages and have shown constant productivity and diffusion in all the Romance varieties, thus differing from spatial prefixes, which underwent a strong reduction in productivity in combination with verbs. The term parasynthetic is sometimes also used to refer to nouns and adjectives derived from compounds or in which both a prefix and a suffix are attached to a lexical base. In the case of nominal and adjectival formation, there is much less consensus among scholars on the need to use this term, as well as on which processes should fall under this label. The common denominator of such cases consists either in the non-attestation of presumed intermediate stages (Sp. corchotaponero ‘relative to the industry of cork plugs’) or in the non-correspondence between sense and structure of the morphologically complex word (Fr. surnaturel ‘supernatural’).


Morphology and Language Documentation  

Yuni Kim

What does it mean to document the morphology of a language, and how does one go about such a task? Most of the world’s languages are arguably underdocumented, yet morphological generalizations often require large amounts of primary data: thousands of word forms could be needed to establish basic patterns of allomorphy, for example, or the structure of an inflection-class system. Because of this, the major debates in the language documentation literature affect the field of morphology by shaping the nature of the data. A starting point is the idea that traditional methods of elicitation, often via translation from a contact language and inevitably requiring a patient speaker, can mask ingrained assumptions about the ontology of data and the wider context of linguistic research. Critical examination of these assumptions yields a wider range of possible approaches that can be drawn on to produce a corpus theorization (i.e., a rationale for the types of communicative events to be recorded) appropriate to each language situation. In particular, it has been argued that it is sometimes not ethical to collect language data in a decontextualized way that prioritizes (or appears to prioritize) the linguist’s goals above speakers’ goals, where those are not the same. Thus, in morphology, where virtually everyone agrees that some type of elicitation is essential, creativity and flexibility are sometimes needed to address or modify research questions. Fortunately, documentary linguistics has seen significant advances in the theory and practice of data management, making it possible to work efficiently with data from a wide variety of recording-session structures. Of equal interest are the reasons why a decontextualized approach may be undesirable, even for the linguist’s analytical purposes. The goal of ‘documenting morphology’ is an abstract one; one can only really document word forms, and morphological structure is a product of analysis. From this fact arise a few problems. First, and even independently of the ethical issues referred to above, it is not always obvious what methods are most reliable for getting speakers to produce word forms or for understanding speakers’ knowledge about them. Different methods have complementary pros and cons, so it is usually necessary to use a mix. When working with existing data, an appreciation of the complexities of the data gathering process is useful for developing a critical approach to the background contexts, strengths, and limitations of primary sources. Second, ‘documentation’ implies a reasonable level of comprehensiveness. For many semantically or functionally defined phenomena, it is possible to make a cross-linguistically robust checklist that ensures that one has more or less covered the relevant territory. It is much less straightforward to compile an inventory of structures in any formal domain, particularly given cross-linguistic variation in morphological vs. syntactic vs. prosodic encoding of similar functional categories. In morphology, the linguist’s inventory of phenomena often keeps expanding until nearly all grammatical constructions and large numbers of lexical items have been encountered. Again, this challenge can be addressed by using a mix of methods and genres to check that one has a correct understanding of at least the most commonly occurring patterns. Spontaneous speech tends to contain constructions that fail to show up in elicitation for reasons like pragmatics or interference from the contact language, while structured elicitation or metalinguistic work is needed to fully investigate the word-formation patterns within each of those constructions, or indeed (if the linguist is nonnative) to get enough of a foothold to work with spontaneous speech at all. Checklists from the viewpoint of morphological typology tend to initially be most useful for monitoring and organizing, and later for filling gaps at a more advanced stage of research.


Valency in the Romance Languages  

Steffen Heidinger

The notion of valency describes the property of verbs to open argument positions in a sentence (e.g., the verb eat opens two argument positions, filled in the sentence John ate the cake by the subject John and the direct object the cake). Depending on the number of arguments, a verb is avalent (no argument), monovalent (one argument), bivalent (two arguments), or trivalent (three arguments). In Romance languages, verbs are often labile (i.e., they occur in more than one valency pattern without any formal change on the verb). For example, the (European and Brazilian) Portuguese verb adoecer ‘get sick’/‘make sick’ can be used both as a monovalent and a bivalent verb (O bebê adoeceu ‘The baby got sick’ vs. O tempo frio adoeceu o bebê ‘The cold weather made the baby sick’). However, labile verbs are not equally important in all Romance languages. Taking the causative–anticausative alternation as an example, labile verbs are used more frequently in the encoding of the alternation in Portuguese and Italian than in Catalan and Spanish (the latter languages frequently recur to an encoding with a reflexively marked anticausative verb (e.g., Spanish romperse ‘break’). Romance languages possess various formal means to signal that a given constituent is an argument: word order, flagging the argument (by means of morphological case and, more importantly, prepositional marking), and indexing the argument on the verb (by means of morphological agreement or clitic pronouns). Again, Romance languages show variation with respect to the use of these formal means. For example, prepositional marking is much more frequent than morphological case marking on nouns (the latter being only found in Romanian).


Infinitival Clauses in the Romance Languages  

Guido Mensching

“Infinitival clauses” are constructions with a clausal status whose predicate is an infinitive. Romance infinitive clauses are mostly dependent clauses and can be divided into the following types: argumental infinitival clauses (such as subject and object clauses, the latter also including indirect interrogatives), predicative infinitival clauses, infinitival adjunct clauses, infinitival relative clauses, and nominalized infinitive clauses (with a determiner). More rarely, they appear as independent (main) clauses (root infinitival clauses) of different types, which usually have a marked character. Whereas infinitival adjunct clauses are generally preceded by prepositions, which can be argued to be outside the infinitival clause proper (i.e., the clause is part of a prepositional phrase), Romance argumental infinitive clauses are often introduced by complementizers that are diachronically derived from prepositions, mostly de/di and a/à. In most Romance languages, the infinitive itself is morphologically marked by an ending containing the morpheme {r} but lacks tense and agreement morphemes. However, some Romance languages have developed an infinitive that can be inflected for subject agreement (which is found in Portuguese, Galician, and Sardinian and also attested in Old Neapolitan). Romance languages share the property of English and other languages to leave the subject of infinitive clauses unexpressed (subject/object control, arbitrary control, and optional control) and also have raising and accusative-and-infinitive constructions. A special property of many Romance languages is the possibility of overtly expressing a nominative subject in infinitival clauses, mostly in postverbal position. The tense of the infinitive clause is usually interpreted as simultaneous or anterior to that of the matrix clause, but some matrix predicates and infinitive constructions trigger a posteriority/future reading. In addition, some Romance infinitive clauses are susceptible to constraints concerning aspect and modality.


French-Based Creole Languages  

Anne Zribi-Hertz

French-based creole languages (FBCLs) may be characterized as a group by one historical and two linguistic properties. Their shared historical feature is that they arose between the 16th and 19th centuries as vehicular (hence oral) languages in French colonies, through language contact between the colonial variety of French spoken by the French settlers and the typologically and genetically diverse languages spoken upon arrival by the imported slaves—the imported workers or the local people in the case of Tayo, which emerged in the 19th century after the abolition of slavery and whose status as an FBCL is controversial. The linguistic features characterizing FBCLs are (a) that their lexicon is dominantly derived from French while their phonology and morphosyntax are both reminiscent of, and different from, those of known dialectal varieties of French; and (b) that they stand as first languages (L1s), namely, are acquired by children and are used for all-purpose communication—as opposed to pidgins, types of contact languages used only as vehicular L2s for specific-interaction purposes (e.g., trade). Beyond these broad defining features, there is much variation among FBCLs with respect to the locations, periods, and historical conditions of their emergence; the relevant contact languages involved in their development; and the grammatical properties of the resulting creoles. And the details of the linguistic change process known as creolization are yet to be settled. FBCLs thus defined currently include on the American continent: Guyanese 1 (in French Guiana), Karipúna (Brazil, near the French-Guiana border), and Louisiana Creole (on the decrease), in Louisiana; in the Caribbean: Haitian (in the independent Republic of Haiti), St. Lucian (in the state of Sainte-Lucie), and the creoles spoken in the French-controlled territories of Martinique, Guadeloupe, Dominique, Saint-Barthélémy, and Northern part of Saint-Martin; in the Indian Ocean, off the shores of Eastern Africa: Mauritian (in Mauritius), Seychelles Creole (in the Seychelles), Rodrigues Creole (in the Rodrigues Island, controlled by Mauritius), and Reunion Creole (in the island of Reunion, a French-controlled territory); and in Southern New Caledonia: Tayo.


Caucasian Languages  

Marina Chumakina

Languages from at least five genetically unrelated families are spoken in the Caucasus, but there are only three endemic linguistic families belonging to the region: Kartvelian, West Caucasian, and Northeast Caucasian. These families are rather heterogeneous in terms of the number of languages and the distribution of the speakers across them. The Caucasus represents a situation where languages with millions of speakers have coexisted with one-village languages for hundreds of years, and where multilingualism has always been the norm. The richness of Caucasian languages on every linguistic stratum is dazzling: here we find some of the largest consonant inventories, inflectional systems where the mere number of word forms strains credibility (one of the Caucasian languages, Archi, is claimed to have over a million and a half word forms), and challenging syntactic structures. The typological interest of the Caucasian languages and the challenges they present to linguistic theory lie in different areas. Thus, for Kartvelian languages, the number of factors at play in the verbal system make the task of the production of a correct verbal form far from trivial. West Caucasian languages represent an instance of polysynthetic polypersonal verb inflection, which is unusual not only for Caucasus but for Eurasia in general. East Caucasian languages have large systems of non-finite forms which, unusually, retain the ability to realize agreement in gender and number while their non-finite nature is determined by the inability to head an independent clause and to express certain morpho-syntactic categories such as illocutionary force and evidentiality. Finally, all Caucasian languages are ergative to some extent.


Morphology in Indo-European Languages  

Paolo Milizia

Indo-European languages of the most archaic type, such as Old Indic and Ancient Greek, have rich fusional morphologies with predominant use of suffixation and ablaut as formal devices. The presence of cumulative inflectional morphs in final position is also a general IE feature. A noteworthy property of the archaic IE morphological system is its root-based organization. This is well observable in Old Indo-Aryan, where the mental lexicon is largely made up of roots unspecified for word-class membership. In the historical development of the different IE branches, recurrent phenomena are observed that lead to an increase in configurationality and a decrease in the degree of synthesis (use of adpositions at the expense of case forms, rise of auxiliaries and increasing employment of periphrastic morphology, creation of determiners). However, not all the documented developments can be subsumed under the rubric ‘morphological decay’: new synthetic verbal forms, which often coexist with the inherited ones, are often created via resynthesization of periphrases; new nominal case forms are sometimes created through univerbation of adpositional phrases; instances of prefixation recurrently arise from former compound structures consisting of adverb (‘preverb’) + verb. The formation of inflectional paradigms with several mutually unpredictable subsections and of relatively complex systems of inflectional classes is also observed in various IE languages. The same holds for the rising of new patterns of morphophonological alternations, which often allow the preservation of several morphological oppositions even after the loss of inflectional endings. As a consequence, modern IE languages may exhibit higher degrees of fusionality, at least in specific morphological subsystems, than their diachronic foregoers. In the various branches, the system of inflectional morphology could undergo several reshapings at the level of both the structure of grammatical categories and the formal organization of paradigms, sometimes with noteworthy typological changes. English poor morphology, Ossetic and New Armenian agglutinative nominal inflections, lack of verbal inflection of number, and presence of numeral classifiers in Eastern New Indo-Aryan varieties are among the examples of extreme departure from the ancient IE morphological type. A common development concerning word formation is the decline of the root-based organization of morphology.