Abstract words such as Fr. livraison ‘delivery’, It. fedeltà ‘faithfulness’, Sp. semejanza ‘resemblance’, belong to the word class of nouns. They do not possess materiality and therefore lack sensory perceivability. Within the spectrum of nouns, abstract nouns are located on the opposite side of proper names; between them, there are common nouns, collective nouns, and mass nouns. Abstract nouns are in part non-count and not able to be pluralized. In terms of meaning, there is typically a threefold division in groups: (a) Action/result nouns (e.g., Fr. lavage ‘washing’, It. giuramento ‘oath’, Sp. mordedura ‘bite’); (b) Quality nouns (e.g., Fr. dignité ‘dignity’, It. biancore ‘whiteness’, Sp. modestia ‘modesty’); and (c) Status nouns (e.g., Fr. episcopat ‘episcopate’, It. cuginanza ‘cousinhood’, Sp. almirantazgo ‘admiralship’). From a purely morphological standpoint, a classification of abstract nouns according to derivation basis appears suitable: (a) (primary) denominal abstract nouns (e.g., Fr. duché ‘dukedom’, It. linguaggio ‘language’, Sp. añada ‘vintage’); (b) (primary) deadjectival abstract nouns (e.g., Fr. folie ‘madness’, It. bellezza ‘beauty’, Sp. cortesía ‘courtesy’); and (c) (primary) deverbal abstract nouns (e.g., Fr. mouvement ‘movement’, It. scrittura ‘writing’, Sp. venganza ‘revenge’). Other abstract nouns arise from conversion, for example, Fr. le devoir ‘duty’, It. il freddo ‘coldness’, Sp. el cambio ‘change’. In light of this, the question of how far the formation of abstract nouns in Romance languages follows Latin patterns (derivation with suffixes) or whether new processes emerge is of particular interest. In addition, the individual Romance languages display different preferences in choosing abstract-forming morphological processes. On the one hand, there is a large number of Latin abstract-forming suffixes whose outcomes preserve the same function in the Romance languages, such as -ía (astrología ‘astrology’), -ura (scriptura ‘writing’), -ĭtia (pigrĭtia ‘sloth’), -io (oratio ‘speaking’). Furthermore, there is a group of Latin suffixes that gave rise to suffixes deriving abstract nouns only in Romance. Among these are, for example, -aticu (Fr. péage ‘road toll’, Sp. hallazgo ‘discovery’), -aceu (Sp. cuchillazo ‘knife thrust’), -aria (Sp. borrachera ‘drunkenness’, It. vecchiaia ‘old age’). On the other hand, suffixless processes of abstract noun formation are coming to full fruition only in Romance: The conversion of past participles (e.g., Fr. vue ‘sight’, It. dormita ‘sleep’, Sp. llegada ‘arrival’) is of special importance. The conversion of infinitives to nouns with abstract meaning is least common in Modern French (e.g., penser ‘thought’) and most common in Romanian (iertare ‘pardon’, durere ‘pain’, etc.). Deverbal noun formation without suffixes (Fr. amende ‘fine’, It. carica ‘charge’, Sp. socorro ‘help’, etc.), in contrast, is known to have developed a broad pan-Romance geographic spread.
Abstract Nouns in the Romance Languages
Accent in Japanese Phonology
The word accent system of Tokyo Japanese might look quite complex with a number of accent patterns and rules. However, recent research has shown that it is not as complex as has been assumed if one incorporates the notion of markedness into the analysis: nouns have only two productive accent patterns, the antepenultimate and the unaccented pattern, and different accent rules can be generalized if one focuses on these two productive accent patterns. The word accent system raises some new interesting issues. One of them concerns the fact that a majority of nouns are ‘unaccented,’ that is, they are pronounced with a rather flat pitch pattern, apparently violating the principle of obligatoriness. A careful analysis of noun accentuation reveals that this strange accent pattern occurs in some linguistically predictable structures. In morphologically simplex nouns, it typically tends to emerge in four-mora nouns ending in a sequence of light syllables. In compound nouns, on the other hand, it emerges due to multiple factors, such as compound-final deaccenting morphemes, deaccenting pseudo-morphemes, and some types of prosodic configurations. Japanese pitch accent exhibits an interesting aspect in its interactions with other phonological and linguistic structures. For example, the accent of compound nouns is closely related with rendaku, or sequential voicing; the choice between the accented and unaccented patterns in certain types of compound nouns correlates with the presence or absence of the sequential voicing. Moreover, whether the compound accent rule applies to a certain compound depends on its internal morphosyntactic configuration as well as its meaning; alternatively, the compound accent rule is blocked in certain types of morphosyntactic and semantic structures. Finally, careful analysis of word accent sheds new light on the syllable structure of the language, notably on two interrelated questions about diphthong-hood and super-heavy syllables. It provides crucial insight into ‘diphthongs,’ or the question of which vowel sequence constitutes a diphthong, against a vowel sequence across a syllable boundary. It also presents new evidence against trimoraic syllables in the language.
Affixation in Morphology
Kristel Van Goethem
Affixation is the morphological process that consists of adding an affix (i.e., a bound morpheme) to a morphological base. It is cross-linguistically the most common process that human languages use to derive new lexemes (derivational affixation) or to adapt a word’s form to its morphosyntactic context (inflectional affixation). Suffixes (i.e., bound morphemes following the base) and prefixes (i.e., bound morphemes preceding the base) are the most common affixes, with suffixation being more frequently recorded in the world’s languages than prefixation. Minor types of affixation include circumfixation and infixation. Conversion and back-formation are related derivational processes that do not make use of affixation. Many studies have concentrated on the need to differentiate derivation from inflection, but these morphological processes are probably best described as two end points of a cline. Prototypically, derivation is used to change a word’s category (part of speech) and involves a semantic change. A word’s inflectional distinctions make up its paradigm, which amounts to the different morphological forms that correlate with different morphosyntactic functions. Form-function mapping in (derivational and inflectional) affixation is a key issue in current research on affixation. Many deviations from the canonical One Form-One Meaning principle can be observed in the field of affixation. From a diachronic point of view, it has been demonstrated that affixes often derive from free lexemes by grammaticalization, with affixoids being recognized as an intermediate step on this cline. More controversial, but still attested, is the opposite change whereby affixes and affixoids develop into free morphemes through a process of degrammaticalization.
African Englishes From a Sociolinguistic Perspective
Four types of English exist in Africa, identifiable in terms of history, functions, and linguistic characteristics. West African Pidgin English has a history going back to the 15th century, 400 years before formal colonization. Creole varieties of English have a history going back to repatriation of slaves from the Caribbean and the United States in the 19th century. Second language varieties, which are the most widespread on the continent, are prototypically associated with British colonization and its education systems. L1 (First language) English occurred mostly in Southern and East Africa, and is best represented in South Africa. The latter shows significant similarities with the other major Southern Hemisphere varieties of English in Australia and New Zealand. All four subgroups of English are growing in numbers.
Afroasiatic languages are the fourth largest linguistic phylum, spoken by some 350 million people in North, West, Central, and East Africa, in the Middle East, and in scattered communities in Europe, the United States, and the Caucasus. Some Afroasiatic languages, such as Arabic, Hausa, Amharic, Somali, and Oromo, are spoken by millions of people, while others are endangered with extinction. As of the early 21st century, the phylum is composed of six families: Egyptian (extinct), Semitic, Cushitic, Omotic, Berber, and Chadic. There are some typological features shared by all families, particularly in the domain of phonology. Languages are also typologically quite distinct with respect to syntax and functions encoded in the grammatical systems. Some Afroasiatic languages, such as Egyptian, Akkadian, Phoenician, Hebrew, Arabic, and Ge’ez, have a longtime written tradition, but for many languages no writing system has yet been proposed or adopted. The Old Semitic writing system gave rise to the modern alphabets used in thousands of unrelated contemporary languages. Two Semitic languages, Hebrew (with some Aramaic) and Arabic, were used to write the Old Testament and the Koran, the holy books of Judaism and Islam.
Agreement in Germanic
Haldór Ármann Sigurðsson
There are four major types of agreement in Germanic: finite verb agreement, primary predicate agreement, secondary predicate agreement, and DP-internal concord, and there is extensive variation among the Germanic languages across all these agreement phenomena. Icelandic commonly has five distinct person/number forms of verbs, while Afrikaans and the mainland Scandinavian languages (Danish, Norwegian, Swedish) have no person/number distinctions of verbs, with the other languages positioning themselves between these extremes. Standard varieties of West-Germanic languages (Afrikaans, Dutch, English, German, Yiddish, West-Frisian,) have no predicate agreement, whereas standard varieties of Scandinavian languages (Icelandic, Faroese, Norwegian, Danish, Swedish) all have both primary and secondary predicate agreement. There is, however, quite some variation in predicate agreement within the Scandinavian languages. The Mainland Scandinavian languages have gender/number agreement of both primary and secondary predicates, albeit with some variation as to whether only predicative adjectives or both predicative adjectives and past participles show agreement (and also as to when past participles show agreement). The Insular Scandinavian languages (Faroese and Icelandic), on the other hand, have case agreement, in addition to gender/number agreement, of both primary and secondary predicates, either adjectives or past participles (e.g., Icelandic primary predicate agreement: “He.nom.m.sg was drunk.nom.m.sg”; secondary predicate agreement: “She.nom.f.sg met him.acc.m.sg drunk.acc.m.sg” [he was drunk] versus. “She.nom.f.sg met him.acc.m.sg drunk.nom.f.sg” [she was drunk]). Case agreement in these two languages commonly disambiguates secondary predicate structures. Afrikaans has no concord of DP-internal modifiers (articles, adjectives, etc.), whereas the other West-Germanic languages have some DP-concord, poorest in English, richest in German (which has gender/number/case concord of a number of categories, most clearly the articles). The Mainland Scandinavian languages also have some DP-concord (gender/number), while DP-concord is extensive in Faroese and Icelandic (gender/number/case of articles, demonstrative determiners, adjectives, some numerals, indefinite pronouns, floating quantifiers, and some possessive pronouns). The Germanic languages are a relatively small and closely knit language family, so the extensive agreement variation within this small family is a major challenge to any general theory of agreement.
Allomorphy and Syncretism in the Romance Languages
Allomorphy and syncretism are both deviations from the one-to-one relationship between form and meaning inside the linguistic sign as postulated by Saussure as well as from the ideal of inflectional morphology as stipulated in the canonical approach by Corbett. Instances of both phenomena are well documented in all Romance languages. In inflection, allomorphy refers to the use of more than one root/stem in the paradigm of a single lexeme or to the existence of more than one inflectional affix for the same function. Syncretism describes the existence of identical forms with different functions in one and the same paradigm. Verbs exhibiting stem allomorphy are traditionally called irregular, a label that describes the existence of unexpected and, sometimes, unpredictable forms from a learner’s perspective. Extreme forms of allomorphy are called suppletion, for which traditional accounts require two or more etymologically unrelated roots/stems to coexist within the paradigm of a single lexeme. Allomorphy often originates in sound change affecting only stems in a certain phonological environment. When the phonological conditioning of the stem allomorph disappears, which is frequently the case, its distribution within the paradigm may become purely morphological, thus constituting a morphome in the sense of Aronoff. Recurrent patterns of syncretism may also be considered morphomes. Whereas syncretism was quite rare in Latin verb morphology, Romance languages feature it to much greater, if different, degrees. In extreme cases, syncretism patterns become paradigm-structuring in many Gallo-Romance varieties, as is the case in the verb morphology of standard French, where almost all forms are syncretic with at least one other.
“Altaic” is a common term applied by linguists to a number of language families, spread across Central Asia and the Far East and sharing a large, most likely non-coincidental, number of structural and morphemic similarities. At the onset of Altaic studies, these similarities were ascribed to the one-time existence of an ancestral language—“Proto-Altaic,” from which all these families are descended; circumstantial evidence and glottochronological calculations tentatively date this language to some time around the 6th–7th millennium bc, and suggest Southern Siberia or adjacent territories (hence the name “Altaic”) as the original homeland of its speakers. However, since the mid-20th century the dominant view in historical linguistics has shifted to that of an “Altaic Sprachbund” (diffusion area), implying that the families in question have not sprung from a common source, but rather have acquired their similarities over a long period of mutual linguistic contact. The bulk of “Altaic” has traditionally included such uncontroversial families as Turkic, Mongolic, and Manchu-Tungusic; additionally, Japanese (Japonic) and Korean are also frequently seen as potential members of the larger Altaic family (the entire five branches are sometimes referred to as “Macro-Altaic”). The debate over the nature of the relationship between the various units that constitute “Altaic,” sometimes referred to as “the Altaic controversy,” has been one of the most hotly debated topics in 20th-century historical linguistics and a major focal point of studies dealing with the prehistory of Central and East Eurasia. Supporters of “Proto-Altaic,” commonly known as “(pro-)Altaicists,” claim that only divergence from an original common ancestor can account for the observed regular phonetic correspondences and other structural similarities, whereas “anti-Altaicists,” without denying the existence of such similarities, insist that they do not belong to the “core” layers of the respective languages and are therefore better explained as results of lexical borrowing and other forms of areal linguistic contact. As a rule, “pro-Altaicists” claim that “Proto-Altaic” is as reconstructible by means of the classic comparative method as any uncontroversial linguistic family; in support of this view, they have produced several attempts to assemble large bodies of etymological evidence for the hypothesis, backed by systems of regular phonetic correspondences between compared languages. All of these, however, have been heavily criticized by “anti-Altaicists” for lack of methodological rigor, implausibility of proposed phonetic and/or semantic changes, and confusion of recent borrowings with items allegedly inherited from a common ancestor. Despite the validity of many of these objections, it remains unclear whether they are sufficient to completely discredit the hypothesis of a genetic connection between the various branches of “Altaic,” which continues to be actively supported by a small, but stable scholarly minority.
Anaphora in Dravidian
K. A. Jayaseelan
The Dravidian languages have a long-distance reflexive anaphor taan . (It is taan in Tamil and Malayalam, taanu in Kannada and tanu in Telugu.) As is the case with other long-distance anaphors, it is subject-oriented; it is also [+human] and third person. Interestingly, it is infelicitous if bound within the minimal clause when it is an argument of the verb. (That is, it seems to obey Principle B of the binding theory.) Although it is subject-oriented in the normal case, it can be bound by a non-subject if the verb is a “psych predicate,” that is, a predicate that denotes a feeling; in this case, it can be bound by the experiencer of the feeling. Again, in a discourse that depicts the thoughts, feelings, or point of view of a protagonist—the so-called “logophoric contexts”—it can be coreferential with the protagonist even if the latter is mentioned only in the preceding discourse (not within the sentence). These latter facts suggest that the anaphor is in fact coindexed with the perspective of the clause (rather than with the subject per se). In cases where this anaphor needs to be coindexed with the minimal subject (to express a meaning like ‘John loves himself’), the Dravidian languages exhibit two strategies to circumvent the Principle B effect. Malayalam adds an emphasis marker tanne to the anaphor; taan tanne can corefer with the minimal subject. This strategy parallels the strategy of European languages and East Asian languages (cf. Scandinavian seg selv). The three other major Dravidian languages—Tamil, Telugu, and Kannada—use a verbal reflexive: they add a light verb koL- (lit. ‘take’) to the verbal complex, which has the effect of reflexivizing the transitive predicate. (It either makes the verb intransitive or gives it a self-benefactive meaning.) The Dravidian languages also have reciprocal and distributive anaphors. These have bipartite structures. An example of a Malayalam reciprocal anaphor is oral … matte aaL (‘one person … other person’). The distributive anaphor in Malayalam has the form awar-awar (‘they-they’); it is a reduplicated pronoun. The reciprocals and distributives are strict anaphors in the sense that they apparently obey Principle A; they must be bound in the domain of the minimal subject. They are not subject-oriented. A noteworthy fact about the pronominal system of Dravidian is that the third person pronouns come in proximal-distal pairs, the proximal pronoun being used to refer to something nearby and the distal pronoun being used elsewhere.
Argument Realization and Case in Japanese
Japanese is a language where the grammatical status of arguments and adjuncts is marked exclusively by postnominal case markers, and various argument realization patterns can be assessed by their case marking. Since Japanese is categorized as a language of the nominative-accusative type typologically, the unmarked case-marking frame obtained for transitive predicates of the non-stative (or eventive) type is ‘nominative-accusative’. Nevertheless, transitive predicates falling into the stative class often have other case-marking alignments, such as ‘nominative-nominative’ and ‘dative-nominative’. Consequently, Japanese provides much more varying argument realization patterns than those expected from its typological character as a nominative-accusative language. In point of fact, argument marking can actually be much more elastic and variable, the variations being motivated by several linguistic factors. Arguments often have the option of receiving either syntactic or semantic case, with no difference in the logical or cognitive meaning (as in plural agent and source agent alternations) or depending on the meanings their predicate carry (as in locative alternation). The type of case marking that is not normally available in main clauses can sometimes be obtained in embedded contexts (i.e., in exceptional case marking and small-clause constructions). In complex predicates, including causative and indirect passive predicates, arguments are case-marked differently from their base clauses by virtue of suffixation, and their case patterns follow the mono-clausal case array, despite the fact that they have multi-clausal structures. Various case marking options are also made available for arguments by grammatical operations. Some processes instantiate a change on the grammatical relations and case marking of arguments with no affixation or embedding. Japanese has the grammatical process of subjectivization, creating extra (non-thematic) major subjects, many of which are identified as instances of ‘possessor raising’ (or argument ascension). There is another type of grammatical process, which reduces the number of arguments by virtue of incorporating a noun into the predicate, as found in the light verb constructions with suru ‘do’ and the complex adjective constructions formed on the negative adjective nai ‘non-existent.’
Arthur Seymour Abramson (1925–2017) was an American linguist who was prominent in the international experimental phonetics research community. He was best known for his pioneering work, with Leigh Lisker, on voice onset time (VOT), and for his many years spent studying tone and voice quality in languages such as Thai. Born and raised in Jersey City, New Jersey, Abramson served several years in the Army during World War II. Upon his return to civilian life, he attended Columbia University (BA, 1950; PhD, 1960). There he met Franklin Cooper, an adjunct who taught acoustic phonetics while also working for Haskins Laboratories. Abramson started working on a part-time basis at Haskins and remained affiliated with the institution until his death. For his doctoral dissertation (1962), he studied the vowels and tones of the Thai language, which would sit at the heart of his research and travels for the rest of his life. He would expand his investigations to include various languages and dialects, such as Pattani Malay and the Kuai dialect of Suai, a Mon-Khmer language. Abramson began his collaboration with University Pennsylvania linguist Leigh Lisker at Haskins Laboratories in the 1960s. Using their unique VOT technique, a sensitive measure of the articulatory timing between an occlusion in the vocal tract and the beginning of phonation (characterized by the onset of vibration of the vocal folds), they studied the voicing distinctions of various languages. Their long standing collaboration continued until Lisker’s death in 2006. Abramson and colleagues often made innovative use of state-of-art tools and technologies in their work, including transillumination of the larynx in running speech, X-ray movies of speakers in several languages/dialects, electroglottography, and articulatory speech synthesis. Abramson’s career was also notable for the academic and scientific service roles that he assumed, including membership on the council of the International Phonetic Association (IPA), and as a coordinator of the effort to revise the International Phonetic Alphabet at the IPA’s 1989 Kiel Convention. He was also editor of the journal Language and Speech, and took on leadership roles at the Linguistic Society of America and the Acoustical Society of America. He was the founding Chair of the Linguistics Department at the University of Connecticut, which became a hotbed for research in experimental phonetics in the 1970s and 1980s because of its many affiliations with Haskins Laboratories. He also served for many years as a board member at Haskins, and Secretary of both the Board and the Haskins Corporation, where he was a friend and mentor to many.
Alan Reed Libert
Artificial languages—languages which have been consciously designed—have been created for more than 900 years, although the number of them has increased considerably in recent decades, and by the early 21st century the total figure probably was in the thousands. There have been several goals behind their creation; the traditional one (which applies to some of the best-known artificial languages, including Esperanto) is to make international communication easier. Some other well-known artificial languages, such as Klingon, have been designed in connection with works of fiction. Still others are simply personal projects. A traditional way of classifying artificial languages involves the extent to which they make use of material from natural languages. Those artificial languages which are created mainly by taking material from one or more natural languages are called a posteriori languages (which again include well-known languages such as Esperanto), while those which do not use natural languages as sources are a priori languages (although many a posteriori languages have a limited amount of a priori material, and some a priori languages have a small number of a posteriori components). Between these two extremes are the mixed languages, which have large amounts of both a priori and a posteriori material. Artificial languages can also be classified typologically (as natural languages are) and by how and how much they have been used. Many linguists seem to be biased against research on artificial languages, although some major linguists of the past have been interested in them.
Balkan-Romance is represented by Romanian and its historical dialects: Daco-Romanian (broadly known as Romanian), Aromanian, Megleno-Romanian, and Istro-Romanian (see article “Morphological and Syntactic Variation and Change in Romanian” in this encyclopedia). The external history of these varieties is often unclear, given the historical events that took place in the Lower Danubian region: the conquest of this territory by the Roman Empire for a short period and the successive Slavic invasions. Moreover, the earliest preserved writing in Romanian only dates from the 16th century. Between the Roman presence in the Balkans and the first attested text, there is a gap of more than 1,000 years, a period in which Romanian emerged, the dialectal separation took place, and the Slavic influence had effects especially on the lexis of Romanian. In the 16th century, in the earliest old Romanian texts, the language already displayed the main features of modern Romanian: the vowels /ə/ and /ɨ/; the nominative-accusative versus genitive-dative case distinction; analytical case markers, such as the genitive marker al; the functional prepositions a and la; the proclitic genitive-dative marker lui; the suffixal definite article; polydefinite structures; possessive affixes; rich verbal inflection, with both analytic and synthetic forms and with three auxiliaries (‘have’, ‘be’, and ‘want’); the supine, not completely verbalized at the time; two types of infinitives, with the ‘short’ one on a path toward becoming verbal and the ‘long’ one specializing as a noun; null subjects; nonfinite verb forms with lexical subjects; the mechanism for differential object marking and clitic doubling with slightly more vacillating rules than in the present-day language; two types of passives; strict negative concord; the SVO and VSO word orders; adjectives placed mainly in the postnominal position; a rich system of pronominal clitics; prepositions requiring the accusative and the genitive; and a large inventory of subordinating conjunctions introducing complement clauses. Most of these features are also attested in the trans-Danubian varieties (Aromanian, Megleno-Romanian, and Istro-Romanian), which were also strongly influenced by the various languages they have entered in direct contact with: Greek, Albanian, Macedonian, Croatian, and so forth. These source languages have had a major influence in the vocabulary of the trans-Danubian varieties and certain consequences in the shape of their grammatical system. The differences between Daco-Romanian and the trans-Danubian varieties have also resulted from the preservation of archaic features in the latter or from innovations that took place only there.
Bert Le Bruyn, Henriëtte de Swart, and Joost Zwarts
Bare nominals (also called “bare nouns”) are nominal structures without an overt article or other determiner. The distinction between a bare noun and a noun that is part of a larger nominal structure must be made in context: Milk is a bare nominal in I bought milk, but not in I bought the milk. Bare nouns have a limited distribution: In subject or object position, English allows bare mass nouns and bare plurals, but not bare singular count nouns (*I bought table). Bare singular count nouns only appear in special configurations, such as coordination (I bought table and chairs for £182). From a semantic perspective, it is noteworthy that bare nouns achieve reference without the support of a determiner. A full noun phrase like the cookies refers to the maximal sum of cookies in the context, because of the definite article the. English bare plurals have two main interpretations: In generic sentences they refer to the kind (Cookies are sweet), in episodic sentences they refer to some exemplars of the kind (Cookies are in the cabinet). Bare nouns typically take narrow scope with respect to other scope-bearing operators like negation. The typology of bare nouns reveals substantial variation, and bare nouns in languages other than English may have different distributions and meanings. But genericity and narrow scope are recurring features in the cross-linguistic study of bare nominals.
Berber-Arabic Language Contact
Since the start of the Islamic conquest of the Maghreb in the 7th century ce, Berber and Arabic have been in continual contact. This has led to large-scale mutual influence. The sociolinguistic setting of this influence is not the same, though; Arabic influence on Berber is found in a situation of language maintenance with widespread bilingualism, while Berber influence on Arabic is no doubt to a large degree due to language shift by Berber speakers to Arabic. Linguistic influence is found on all levels: phonology, morphology, syntax, and lexicon. In those cases where only innovative patterns are shared between the two language groups, it is often difficult to make out where the innovation started; thus the great similarities in syllable structure between Maghrebian Arabic and northern Berber are the result of innovations within both language families, and it is difficult to tell where it started. Morphological influence seems to be mediated exclusively by lexical borrowing. Especially in Berber, this has led to parallel systems in the morphology, where native words always have native morphology, while loans either have nativized morphology or retain Arabic-like patterns. In the lexicon, it is especially Berber that takes over scores of loanwords from Arabic, amounting in one case to over one-third of the basic lexicon as defined by 100-word lists.
Binding in Germanic
Eric Reuland and Martin Everaert
All languages have expressions, typically pronominals and anaphors, that may or must depend for their interpretation on another expression, their antecedent. When such a dependency is subject to structural conditions, it reflects binding. Although there is considerable variation in binding patterns cross-linguistically, in fact, variation is along a limited set of parameters. The Germanic languages exemplify some of the main factors involved. In Germanic, third-person pronominals generally do not allow binding by a co-argument. However, in Frisian and Afrikaans, they do, being embedded in a richer structure than meets the eye. In Continental West Germanic and Scandinavian, anaphors come in two types: simplex anaphors (SE-anaphors)—deficient for number and gender—and complex anaphors (SELF-anaphors). These typically consist of a pronominal or SE-anaphor combined with an element like Dutch zelf ‘self’ or one of its cognates. In all the Germanic languages SELF-anaphors are bound in their local domain—approximately the domain of their nearest subject—except in a few identifiable positions, where they are interpreted logophorically. That is, they accept a non-local antecedent, provided this element holds the perspective of the sentence. The distribution of SE-anaphors involves three different conditions. First, they can be bound by a co-argument only if the verb belongs to a restricted class, which allows syntactic detransitivization. Second, in general, SE-anaphors allow non-local binding. But the conditions differ among subgroups. In Dutch and German, they can only be bound non-locally when contained in a causative or perception verb complement or a small clause. In Mainland Scandinavian, non-local binding is, in principle, available to all infinitival clauses (subject to some dialectal variation). For instance, in some varieties of Norwegian, referentiality of intervening subjects restricts binding; in other varieties, the restricting factor is not “finiteness” but “being specified for tense.” Third, in Icelandic long-distance antecedents beyond the infinitival domain are licensed by a subjunctive, together with the requirement that the antecedent holds the perspective. Faroese largely patterns like Icelandic, although lacking a subjunctive. However, the class of verbs that allow this pattern coincides with the class of verbs in Icelandic that have a subjunctive complement. Non-local binding of SE-anaphors is sensitive to the requirement that the antecedent be animate, but the languages show differences in the details. Unlike the West Germanic languages, the Scandinavian languages all have a possessive reflexive in third person. In general, their distribution appears to be quite close to that of SE-anaphors, but this is subject to dialectal variation, with various differences in the details.
Case Markers in Indo-Aryan
Indo-Aryan languages have the longest documented historical record, with the earliest attested texts going back to 1900 bce. Old Indo-Aryan (Vedic, Sanskrit) had an inflectional case-marking system where nominatives functioned as subjects. Objects could be realized via several different case markers (depending on semantic and structural factors), but not the nominative. This inflectional system was lost over the course of several centuries during Middle Indo-Aryan, resulting in just a nominative–oblique inflectional distinction. The New Indo-Aryan languages innovated case markers and developed new case-marking systems. Like in Old Indo-Aryan, case is systematically used to express semantic differences via differential object marking constructions. However, unlike in Old Indo-Aryan, many of the New Indo-Aryan languages are ergative and all allow for non-nominative subjects, most prominently for experiencer subjects. Objects, on the other hand, can now also be unmarked (nominative), usually participating in differential object marking. The case-marking patterns within New Indo-Aryan and across time have given rise to a number of debates and analyses. The most prominent of these include issues of case alignment and language change, the distribution of ergative vs. accusative vs. nominative case, and discussions of markedness and differential case marking.
Catalan is a “medium-sized” Romance language spoken by over 10 million speakers, spread over four nation states: Northeastern Spain, Andorra, Southern France, and the city of L’Alguer (Alghero) in Sardinia, Italy. Catalan is divided into two primary dialectal divisions, each with further subvarieties: Western Catalan (Western Catalonia, Eastern Aragon, and Valencian Community) and Eastern Catalan (center and east of Catalonia, Balearic Islands, Rosselló, and l’Alguer). Catalan descends from Vulgar Latin. Catalan expanded during medieval times as one of the primary vernacular languages of the Kingdom of Aragon. It largely retained its role in government and society until the War of Spanish Succession in 1714, and since it has been minoritized. Catalan was finally standardized during the beginning of the 20th century, although later during the Franco dictatorship it was banned in public spaces. The situation changed with the new Spanish Constitution promulgated in 1978, when Catalan was declared co-official with Spanish in Catalonia, the Valencian Community, and the Balearic Islands. The Latin vowel system evolved in Catalan into a system of seven stressed vowels. As in most other Iberian Romance languages, there is a general process of spirantization or lenition of voiced stops. Catalan has a two-gender grammatical system and, as in other Western Romance languages, plurals end in -s; Catalan has a personal article and Balearic Catalan has a two-determiner system for common nouns. Finally, past perfective actions are indicated by a compound tense consisting of the auxiliary verb anar ‘to go’ in present tense plus the infinitive. Catalan is a minoritized language everywhere it is spoken, except in the microstate of Andorra, and it is endangered in France and l’Alguer. The revival of Catalan in the post-dictatorship era is connected with a movement called linguistic normalization. The idea of normalization refers to the aim to return Catalan to a “normal” use at an official level and everyday level as any official language.
Languages from at least five genetically unrelated families are spoken in the Caucasus, but there are only three endemic linguistic families belonging to the region: Kartvelian, West Caucasian, and Northeast Caucasian. These families are rather heterogeneous in terms of the number of languages and the distribution of the speakers across them. The Caucasus represents a situation where languages with millions of speakers have coexisted with one-village languages for hundreds of years, and where multilingualism has always been the norm. The richness of Caucasian languages on every linguistic stratum is dazzling: here we find some of the largest consonant inventories, inflectional systems where the mere number of word forms strains credibility (one of the Caucasian languages, Archi, is claimed to have over a million and a half word forms), and challenging syntactic structures. The typological interest of the Caucasian languages and the challenges they present to linguistic theory lie in different areas. Thus, for Kartvelian languages, the number of factors at play in the verbal system make the task of the production of a correct verbal form far from trivial. West Caucasian languages represent an instance of polysynthetic polypersonal verb inflection, which is unusual not only for Caucasus but for Eurasia in general. East Caucasian languages have large systems of non-finite forms which, unusually, retain the ability to realize agreement in gender and number while their non-finite nature is determined by the inability to head an independent clause and to express certain morpho-syntactic categories such as illocutionary force and evidentiality. Finally, all Caucasian languages are ergative to some extent.
Alessandro De Angelis
Although respective Central (= CIDs) and Southern (= SIDs) Italo-Romance dialects display peculiar linguistic features, they also share a substantial number of common isoglosses such that they can be classified as two subdivisions of the same geolinguistic unit. Some of these are simply represented by the absence of Tuscan features, such as diphthongization in open syllable or anaphonesis. Other features are idiosyncratic and are discussed within the main body of this article, such as: (a) the different types of vowel systems; (b) the two main patterns of metaphony; (c) propagation; (d) phonosyntactic doubling that is not sensitive to stress. Regarding the morphological phenomena present in these varieties, the encliticization of possessives and the loss of both of the future indicative and the present subjunctive will be discussed. With regard to (morpho)syntax, these varieties are known for: (a) the rise of a mass neuter (neo-neuter) class of nouns; (b) an alternating gender value; (c) the extensive use of a dedicated marker to encode the accusative case in highly referential nouns; (d) dual complementizer systems; (e) split intransitivity in auxiliary systems; (f) extensive participial agreement (as well as similar agreement in manner adjectives); and, (g) pseudo-coordination, among other notable phenomena.