This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
The coexistence over centuries of Romance-speaking and Semitic peoples in the Mediterranean area has led to reciprocal linguistic influence and contact phenomena. All Romance languages have been involved in this process, although some of them, such as Romanian, only superficially and indirectly. For what concerns the Semitic counterparts, the major role has been played by Arabic in the Middle Ages, not only in Moorish Spain (711–1492) and in the Emirate of Sicily (831–1072), where interlinguistic contact was daily and intense, but also in Provence and in the main ports of Continental Italy (Pisa, Genoa, Venice), thanks to their commercial relations with the Eastern Mediterranean and North Africa. In addition, a consistent amount of Arabic intellectual lexicon has entered the Romance languages through the translations of scientific treaties, generally through the mediation of Latin. Hebrew has also given a significant contribution, both via the translations of the Bible and, from the Late Middle Ages on, through the oral interaction of the local Jewish communities with non-Jews. In both cases contact has been indirect, since biblical loanwords have been mediated first by Greek and later by Latin, whereas the rest of the borrowings has been transmitted by the Judeo-Romance languages. The other Semitic languages have had no influence on Romance, except for a very limited number of Amharic loanwords to be found in Italian, as a consequence of Italian Colonialism in East Africa (1882–1936).
Although Medieval Spanish and Sicilian display traces of Arabic interference at all levels, including phonology, morphology and syntax, in most Romance languages effects of contact with Semitic are limited to lexicon. These comprise both direct borrowings and structural calques, as far as Arabic and—to a lesser extent—Hebrew are concerned, and pertain to several semantic ambits, such as trade, anatomy, astronomy, and botany (Arabic); religious rituals and practices (Hebrew); and, more generally, daily life, especially in peculiar sociolects and slangs. A case apart is represented by Maltese, a Western Arabic dialect deeply influenced by Italo-Romance (notably Sicilian) from the Middle Ages until the first half of the 20th century, which is a unique example of Romance-Semitic mixing not only at a lexical, but also at a phonological and morphosyntactic level.
Scrambling is one of the most widely discussed and prominent factors affecting word order variation in Korean. Scrambling in Korean exhibits various syntactic and semantic properties that cannot be subsumed under the standard A/A'-movement. Clause-external scrambling as well as clause-internal scrambling in Korean show mixed A/A'-effects in a range of tests such as anaphor binding, weak crossover, Condition C, negative polarity item licensing, wh-licensing, and scopal interpretation. VP-internal scrambling, by contrast, is known to be lack of reconstruction effects conforming to the claim that short scrambling is A-movement. Clausal scrambling, on the other hand, shows total reconstructions effects, unlike phrasal scrambling. The diverse properties of Korean scrambling have received extensive attention in the literature. Some studies argue that scrambling is a type of feature-driven A-movement with special reconstruction effects. Others argue that scrambling can be A-movement or A'-movement depending on the landing site. Yet others claim that scrambling is not standard A/A'-movement, but must be treated as cost-free movement with optional reconstruction effects. Each approach, however, faces non-trivial empirical and theoretical challenges, and further study is needed to understand the complex nature of scrambling. As the theory develops in the Minimalist Program, a variety of proposals have also been advanced to capture properties of scrambling without resorting to A/A'-distinctions.
Scrambling in Korean applies optionally but not randomly. It may be blocked due to various factors in syntax and its interfaces in the grammar. At the syntax proper, scrambling obeys general constraints on movement (e.g., island conditions, left branch condition, coordinate structure condition, proper binding condition, ban on string vacuous movement). Various semantic and pragmatic factors (e.g., specificity, presuppositionality, topic, focus) also play a crucial role in acceptability of sentences with scrambling. Moreover, current studies show that certain instances of scrambling are filtered out at the interface due to cyclic Spell-out and linearization, which strengthens the claim that scrambling is not a free option. Data from Korean pose important challenges against base-generation approaches to scrambling, and lend further credence to the view that scrambling is an instance of movement. The exact nature of scrambling in Korean—whether it is cost-free or feature-driven—must be further investigated in future research, however. The research on Korean scrambling leads us to the pursuit of a general theory, which covers obligatory A/A'-movement as well as optional displacement with mixed semantic effects in languages with free word order.
Empirical and theoretical research on language has recently experienced a period of extensive growth. Unfortunately, however, in the case of the Japanese language, far fewer studies—particularly those written in English—have been presented on adult second language (L2) learners and bilingual children. As the field develops, it is increasingly important to integrate theoretical concepts and empirical research findings in second language acquisition (SLA) of Japanese, so that the concepts and research can be eventually applied to educational practice. This article attempts to: (a) address at least some of the gaps currently existing in the literature, (b) deal with important topics to the extent possible, and (c) discuss various problems with regard to adult learners of Japanese as an L2 and English–Japanese bilingual children. Specifically, the article first examines the characteristics of the Japanese language. Tracing the history of SLA studies, this article then deliberately touches on a wide spectrum of domains of linguistic knowledge (e.g., phonology and phonetics, morphology, lexicon, semantics, syntax, discourse), context of language use (e.g., interactive conversation, narrative), research orientations (e.g., formal linguistics, psycholinguistics, social psychology, sociolinguistics), and age groups (e.g., children, adults). Finally, by connecting past SLA research findings in English and recent/present concerns in Japanese as SLA with a focus on the past 10 years including corpus linguistics, this article provides the reader with an overview of the field of Japanese linguistics and its critical issues.
The study of second language phonetics is concerned with three broad and overlapping research areas: the characteristics of second language speech production and perception, the consequences of perceiving and producing nonnative speech sounds with a foreign accent, and the causes and factors that shape second language phonetics. Second language learners and bilinguals typically produce and perceive the sounds of a nonnative language in ways that are different from native speakers. These deviations from native norms can be attributed largely, but not exclusively, to the phonetic system of the native language. Non-nativelike speech perception and production may have both social consequences (e.g., stereotyping) and linguistic–communicative consequences (e.g., reduced intelligibility). Research on second language phonetics over the past ca. 30 years has resulted in a fairly good understanding of causes of nonnative speech production and perception, and these insights have to a large extent been driven by tests of the predictions of models of second language speech learning and of cross-language speech perception. It is generally accepted that the characteristics of second language speech are predominantly due to how second language learners map the sounds of the nonnative to the native language. This mapping cannot be entirely predicted from theoretical or acoustic comparisons of the sound systems of the languages involved, but has to be determined empirically through tests of perceptual assimilation. The most influential learner factors which shape how a second language is perceived and produced are the age of learning and the amount and quality of exposure to the second language. A very important and far-reaching finding from research on second language phonetics is that age effects are not due to neurological maturation which could result in the attrition of phonetic learning ability, but to the way phonetic categories develop as a function of experience with surrounding sound systems.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
From a typological perspective, the phoneme inventories of Romance languages are of medium size: For instance, most consonant systems contain between 20 and 23 phonemes. An innovation with respect to Latin is the appearance of palatal and palato-alveolar consonants such as /ɲ ʎ/ (Italian, Spanish, Portuguese), /ʃ ʒ/ (French, Portuguese), and /tʃ dʒ/ (Italian, Romanian); a few varieties (e.g., Romansh and a number of Italian dialects) also show the palatal stops /c ɟ/. Besides palatalization, a number of lenition processes (both sonorization and spirantization) have characterized the diachronic development of plosives in Western Romance languages (cf. the French word chèvre “goat” < lat. CĀPRA(M)). Diachronically, both sonorization and spirantization occurred in postvocalic position, where the latter can still be observed as an allophonic rule in present-day Spanish and Sardinian. Sonorization, on the other hand, occurs synchronically after nasals in many southern Italian dialects.
The most fundamental change in the diachrony of the Romance vowel systems derives from the demise of contrastive Latin vowel quantity. However, some Raeto-Romance and northern Italo-Romance varieties have developed new quantity contrasts. Moreover, standard Italian displays allophonic vowel lengthening in open stressed syllables (e.g., /ˈka.ne/ “dog” → [ˈkaːne]. The stressed vowel systems of most Romance varieties contain either five phonemes (Spanish, Sardinian, Sicilian) or seven phonemes (Portuguese, Catalan, Italian, Romanian). Larger vowel inventories are typical of “northern Romance” and appear in dialects of Northern Italy as well as in Raeto- and Gallo-Romance languages. The most complex vowel system is found in standard French with its 16 vowel qualities, comprising the 3 rounded front vowels /y ø œ/ and the 4 nasal vowel phonemes /ɑ̃ ɔ̃ ɛ̃ œ̃/.
Romance languages differ in their treatment of unstressed vowels. Whereas Spanish displays the same five vowels /i e a o u/ in both stressed and unstressed syllables (except for unstressed /u/ in word-final position), many southern Italian dialects have a considerably smaller inventory of unstressed vowels as opposed to their stressed vowels.
The phonotactics of most Romance languages is strongly determined by their typological character as “syllable languages.” Indeed, the phonological word only plays a minor role as very few phonological rules or phonotactic constraints refer, for example, to the word-initial position (such as Italian consonant doubling or the distribution of rhotics in Ibero-Romance), or to the word-final position (such as obstruent devoicing in Raeto-Romance). Instead, a wide range of assimilation and lenition processes apply across word boundaries in French, Italian, and Spanish.
In line with their fundamental typological nature, Romance languages tend to allow syllable structures of only moderate complexity. Inventories of syllable types are smaller than, for example, those of Germanic languages, and the segmental makeup of syllable constituents mostly follows universal preferences of sonority sequencing. Moreover, many Romance languages display a strong preference for open syllables as reflected in the token frequency of syllable types. Nevertheless, antagonistic forces aiming at profiling the prominence of stressed syllables are visible in several Romance languages as well. Within the Ibero- Romance domain, more complex syllable structures and vowel reduction processes are found in the periphery, that is, in Catalan and Portuguese. Similarly, northern Italian and Raeto-Romance dialects have experienced apocope and/or syncope of unstressed vowels, yielding marked syllable structures in terms of both constituent complexity and sonority sequencing.
Klaus Beyer and Henning Schreiber
The Social Network Analysis approach (SNA), also known as sociometrics or actor-network analysis, investigates social structure on the basis of empirically recorded social ties between actors. It thereby aims to explain e.g. the processes of flow of information, spreading of innovations, or even pathogens throughout the network by actor roles and their relative positions in the network based on quantitative and qualitative analyses. While the approach has a strong mathematical and statistical component, the identification of pertinent social ties also requires a strong ethnographic background. With regard to social categorization, SNA is well suited as a bootstrapping technique for highly dynamic communities and under-documented contexts. Currently, SNA is widely applied in various academic fields. For sociolinguists, it offers a framework for explaining the patterning of linguistic variation and mechanisms of language change in a given speech community.
The social tie perspective developed around 1940, in the field of sociology and social anthropology based on the ideas of Simmel, and was applied later in fields such as innovation theory. In sociolinguistics, it is strongly connected to the seminal work of Lesley and James Milroy and their Belfast studies (1978, 1985). These authors demonstrate that synchronic speaker variation is not only governed by broad societal categories but is also a function of communicative interaction between speakers. They argue that the high level of resistance against linguistic change in the studied community is a result of strong and multiplex ties between the actors. Their approach has been followed by various authors, including Gal, Lippi-Green, and Labov, and discussed for a variety of settings; most of them, however, are located in the Western world.
The methodological advantages could make SNA the preferred framework for variation studies in Africa due to the prevailing dynamic multilingual conditions, often on the backdrop of less standardized languages. However, rather few studies using SNA as a framework have yet been conducted. This is possibly due to the quite demanding methodological requirements, the overall effort, and the often highly complex linguistic backgrounds. A further potential obstacle is the pace of theoretical development in SNA. Since its introduction to sociolinguistics, various new measures and statistical techniques have been developed by the fast growing SNA community. Receiving this vast amount of recent literature and testing new concepts is likewise a challenge for the application of SNA in sociolinguistics.
Nevertheless, the overall methodological effort of SNA has been much reduced by the advancements in recording technology, data processing, and the introduction of SNA software (UCINET) and packages for network statistics in R (‘sna’). In the field of African sociolinguistics, a more recent version of SNA has been implemented in a study on contact-induced variation and change in Pana and Samo, two speech communities in the Northwest of Burkina Faso. Moreover, further enhanced applications are on the way for Senegal and Cameroon, and even more applications in the field of African languages are to be expected.
Spanish in Contact with South-American Languages, with Special Emphasis on Andean and Paraguayan Spanish
The effect of indigenous languages of South America on Spanish is strongest in the lexicon (especially with toponyms, zoonyms, and phytonyms) and identifiable, but much more modest, in phonetics/phonology (e.g., vowel variability and reduction and nasalization) and morphosyntax (e.g., the different use of selected verb forms and constituent order). The phenomena called Media Lengua and Yopará differ from this picture in that the former roughly consists of a Spanish lexicon combined with Quechua grammar, while the latter is a fluid Guaraní-based system with numerous borrowings from Spanish. The effects of contact are socially and areally variable, with low-prestige, typically rural, varieties of South American Spanish showing the most significant systemic impact, while high-prestige, typically urban, varieties (including the national standards) show little more than lexical borrowings in the semantic fields mentioned. This result is hardly surprising, due to historical/sociolinguistic factors (which often led to situations of dominance and language shift) and to the typological dissimilarities between Spanish and the indigenous languages (which typically hinders borrowing, especially of morphological elements).
Erich R. Round
The non–Pama-Nyugan, Tangkic languages were spoken until recently in the southern Gulf of Carpentaria, Australia. The most extensively documented are Lardil, Kayardild, and Yukulta. Their phonology is notable for its opaque, word-final deletion rules and extensive word-internal sandhi processes. The morphology contains complex relationships between sets of forms and sets of functions, due in part to major historical refunctionalizations, which have converted case markers into markers of tense and complementization and verbal suffixes into case markers. Syntactic constituency is often marked by inflectional concord, resulting frequently in affix stacking. Yukulta in particular possesses a rich set of inflection-marking possibilities for core arguments, including detransitivized configurations and an inverse system. These relate in interesting ways historically to argument marking in Lardil and Kayardild. Subordinate clauses are marked for tense across most constituents other than the subject, and such tense marking is also found in main clauses in Lardil and Kayardild, which have lost the agreement and tense-marking second-position clitic of Yukulta. Under specific conditions of co-reference between matrix and subordinate arguments, and under certain discourse conditions, clauses may be marked, on all or almost all words, by complementization markers, in addition to inflection for case and tense.
Cécile B. Vigouroux
Despite their large demographic size, intra-continental African migrations have hardly been taken into account in the theorizing on migration in transnational studies and related fields. Research questions have been framed predominantly from a South-to-North perspective on population movements. This may be a consequence of the fact that the extent and complexity of modern population movements and contacts within Africa are hard to assess, owing mainly to lack of reliable data. For sociolinguists the challenge is even greater, partly because of the spotty knowledge of linguistic diversity in the continent and the scarcity of adequate sociolinguistic descriptions of the ways in which Africans manage their language repertoires. Despite these limitations, a sociolinguistics of intra-continental African migrations will contribute significantly to a better understanding of the conditions, nature, and periodicity of population contacts and interactional dynamics. It will help explain why geographic mobility entails reshaping sociocultural practices, including the language repertoires of both the migrants and the people they come in contact with. Moreover, the peculiarity of African economies, which rely heavily on informal non-institutionalized practices, prompts a rethinking of assumptions regarding the acquisition of the host country’s language(s) as the primary facilitator of the migrants’ socioeconomic inclusion. A sociolinguistic understanding of migrations within Africa can help to formulate new questions and enrich the complex pictures that the study of other parts of the world has already shaped.
“Tupian” is a common term applied by linguists to a linguistic stock of seven families spread across great parts of South America. Tupian languages share a large number of structural and morphological similarities which make genetic relationship very probable. Four families (Arikém, Mondé, Tuparí, and Raramarama-Poruborá) are still limited to the Madeira-Guaporé region in Brazil, considered by some scholars to be the Tupí homeland. Other families and branches would have migrated, in ancient times, down the Amazon (Mundurukú, Mawé) and up the Xingú River (Juruna, Awetí). Only the Tupí-Guarani branch, which makes up about 40 living languages, mainly spread to the south.
Two Tupí-Guaraní languages played an important part in the Portuguese and Spanish colonisation of South America, Tupinambá on the Brazilian coast and Guaraní in colonial Paraguay. In the early 21st century, Guaraní is spoken by more than six million non-Indian people in Paraguay and in adjacent parts of Argentina and Brazil.
Tupí-Guaraní (TG) is an artificial term used by linguists to denominate the family composed by eight subgroups of languages, one of them being the Guaraní subgroup and the other one the extinct Tupinambá and its varieties.
Important phonological characteristics of Tupian languages are nasality and the occurrence of a high central vowel /ɨ/, a glottal stop /ʔ/, and final consonants, especially plosives in coda position. Nasality seems to be a common characteristic of all branches of the family. Most of them show phenomena such as nasal harmony, also called nasal assimilation or regressive nasalization by some scholars.
Tupian languages have a rich morphology expressed mainly by suffixes and prefixes, though particles are also important to express grammatical categories. Verbal morphology is characterized by generally rich devices of valence-changing formations. Relational inflection is one of the most striking phenomena of TG nominal phrases. It allows marking the determination of a noun by a preceding adjunct, its syntactical transformation into a nominal predicate, or the absence of any relation. Relational inflection partly occurs also in other branches and families than Tupí-Guaraní. Verbal person marking is realized by prefixing in most languages; some languages of the Tuparí and Juruna family, however, use only free pronouns.
Tupian syntax is based on the predication of both verbs and nouns. Subordinate clauses, such as relative clauses, are produced by nominalization, while adverbial clauses are formed by specific particles or postpositions on the predicate. Traditional word order is SOV.
Edgar W. Schneider
English clearly is the world’s most widely used language in the early 21st century: the language of formal and other interactions in very many countries, the main tool of globalization, and the default choice for transnational communication. Initially, the expansion of the British Empire, beginning in the 17th century and driven by various motives for colonization, brought it to all continents: North America and the Caribbean, the southern hemisphere (including Australia, New Zealand, South Africa, and other territories), and also Asia, Africa, and the Pacific region. In contact with indigenous languages new, increasingly stable and localized varieties of English with properties and functions of their own have grown in many countries. These varieties have come to be summarily labeled as “World Englishes,” and a new subdiscipline in linguistics has emerged since the 1980s investigating their features and conditions of use. They have conventionally been classified according to their status in specific countries and territories, as native, second, or foreign languages, respectively, and several theoretical models have been proposed to account for their status, developments, and mutual relationships. Vibrant changes of the recent past, broadly associated with a sociolinguistics of globalization and increasing superdiversity, have continued to push the dissemination of English to new contexts, both socially and individually, and a “post-varieties approach” is now being envisaged.
A wide range of facts and issues can be discussed and investigated when addressing World Englishes. The basic perspective, obviously, concerns the sociohistorical diffusion of the language: Who brought English to which territories, when, and why? And how has the language been transformed in different places? It has been argued convincingly (in the “Dynamic Model” of the evolution of Postcolonial Englishes) that despite all geographical, historical, and social differences, amazing similarities in the emergence of these new varieties, grounded in principles of sociolinguistic accommodation and identity transformations, can be identified. In all contexts and territories, contact with local and other languages has been determinative, usually via the process of second-language acquisition of English by indigenous people. Language policies and their implementation by means of strategies of language pedagogy have played a major role, and all of this is shaped decisively by linguistic attitudes—the question of what speakers and authorities believe about such emerging varieties and their relationship to norms of correctness. Also, specific structural patterns and types of linguistic phenomena can be observed in all these varieties on all levels of language organization.
Consequently, the notion of “English” today needs to be retuned from thinking of it as a single, monolithic entity, a linguistic “standard” and a reference system, to understanding it as a set of related, structurally overlapping, but also distinct varieties, the products of a fundamental “glocalization” process with variable, context-dependent outcomes.
Rosemary G. Beam de Azcona
Zapotecan languages belong to the Otomanguean stock and consist of two major subgroups: Zapotec and Chatino. They are primarily spoken in Oaxaca, Mexico, and elsewhere in diaspora, particularly in California but also in other parts of the United States and Mexico. Zapotecan languages are spoken in a contiguous area and although all are related genetically, many languages exist in regional diffusion zones such that new changes spread areally. Similarly, individual Zapotecan “languages” often consist of dialect continua.
Zapotecan languages are tonal and also have contrastive phonation types, such as a contrast between modal (V), checked (VɁ), and rearticulated (VɁV) vowels. Some Valley Zapotec languages also have breathy voice, partially due to contact with Mixe. Vowel nasalization is a prominent feature of Chatino and a marginal feature of some Zapotec languages. Consonants usually fall into two contrastive series in Zapotec, commonly termed “fortis” and “lenis,” though the phonetic realizations of these vary from language to language. The historical loss of unstressed vowels is common in many Zapotecan languages, though there are individual Zapotec and Chatino languages that retain them. A stress shift from the final syllable (retained in Chatino) to the first (usually penultimate) syllable of the root (in Zapotec) makes the languages with vowel loss more dissimilar from each other, since a different syllable survives in Chatino versus Zapotec.
Zapotecan languages are head-initial languages with VSO order and are typically head-marking. Common morphology includes pre-posed TAM markers and post-posed person markers on the verb, and derivational prefixes on nouns. An emergent class of prepositions is developing out of what were historically relational nouns. Stative forms of verbs are more common than true adjectives, while numerals have many verbal properties.
Like other Meso-American languages, Zapotecan languages are currently experiencing both a golden age and a moment of unprecedented peril. There is an ever-increasing number of linguists who are native speakers of these languages, and the community of language activists, including students and educators, is growing stronger in Oaxaca and Mexico at large, and indeed worldwide, including where Zapotecan languages are spoken by immigrants. At the same time, the intense political and socioeconomic pressure on communities to shift to Spanish is greater than ever before, and the number of communities where children speak Zapotecan languages is ever shrinking. Children in communities where children speak Zapotecan languages at home are often chastised for doing so in school, which poses a continual threat. Zapotecan languages historically have been in contact with other Meso-American languages such as Nahuatl, Mixtec, Chontal, Mixe, Huave and Chinantec, among others. Today the vast majority of speakers of Zapotecan languages are at least bilingual in their language and Spanish, and many also speak English and/or other Meso-American languages. Zapotecan languages mostly show lexical borrowings from these other languages, and occasional grammatical borrowings. Regional varieties of Spanish show a Zapotecan substrate with numerous calques and interference on every level of the language from phonetics to pragmatics.