Occitan, a language of high medieval literary culture, historically occupies the southern third of France. Today it is dialectalized and highly endangered, like all the regional languages of France. Its main linguistic regions are Languedocien, Provençal, Limousin, Auvergnat, Vivaro-dauphinois (Alpine Provençal) and, linguistically on the fringes of the domain, Gascon. Despite its dialectalization, its typological unity and the profound difference that separates it from Northern Galloroman (Oïl dialects, Francoprovençal) and Gallo-Italian remain clearly perceptible. Its history is characterised by several ruptures (the Crusade against the Albigensians, the French Revolution) and several attempts at "rebirth" (the Baroque period, the Felibrige movement in the second half of the 19th century, the Occitanist movement of the 20th century). Towards the end of the Middle Ages, the Occitan koinè, a literary and administrative language integrating the main dialectal characteristics of all regions, was lost and replaced by makeshift regional spellings based on the French spelling. The modern Occitanist orthography tries to overcome these divisions by coming as close as possible to the medieval, "classical" written tradition, while respecting the main regional characteristics. Being a bridge language between northern Galloroman (Oïl varieties and Francoprovençal), Italy and Iberoromania, Occitan is a relatively conservative language in terms of its phonetic evolution from the popular spoken Latin of western Romania, its morphology and syntax (absence of subject clitics in the verbal system, conservation of a fully functional simple past tense). Only Gascon, which was already considered a specific language in the Middle Ages, presents particular structures that make it unique among Romance languages (development of a system of enunciative particles).
Article
Southern Gallo-Romance: Occitan and Gascon
Andres M. Kristol
Article
Danish
Eva Skafte Jensen
Danish is a North Germanic language, spoken by approximately 6 million people. Genealogically, it is related to the other Germanic languages, in particular the other North Germanic languages (Swedish, Norwegian, Icelandic, Faroese), but also, for example, German, Dutch, and English; typologically, Modern Danish is closer to Norwegian and Swedish than to any other language.
Historically deriving from Proto-Germanic, Danish morphology once had three grammatical genders (the masculine, the feminine, and the neuter) and case inflection (nominative, accusative, dative, and genitive) in all nominal words; it also had inflection for mood, tense, number, and person in the verbal conjugations. In Modern Standard Danish, much of the traditional nominal and verbal inflection has disappeared. Instead, other kinds of morphosyntactic constructions and structures have emerged. Middle Danish and Modern Danish are typologically very different languages. One of the structural innovations linked to the typological change is that a syntactic subject becomes obligatory in Danish sentences. Correlated to this, Danish develops expletive constructions with det ‘it’ and der ‘there’. Another important point differentiating Middle Danish from Modern Danish concerns agreement. Traditional Indo-European agreement (verbal as well as nominal) has receded in favor of more fixed word order, both on the sentence level and internally within phrases. As part of this, Modern Danish has developed a set of definite and indefinite articles. The traditional three genders are reduced to two (common and neuter) and have developed new syntactic-semantic functions alongside the traditional lexically distributed functions. In the verbal systems, Danish makes use of two different kinds of passive voice (a periphrastic and an inflected one), which carry different meanings, and also of two different auxiliaries in perfective constructions, that is, have ‘have’ and være ‘be’, the latter doubling as an auxiliary in periphrastic passive constructions. Perfective constructions are made up by an auxiliary and the supine form of the main verb. Danish is a V2-language with a relatively fixed word order, often depicted in the form of the so-called sentence frame, a topological model designed specifically for Danish. Like most other Germanic languages, Danish has a rich set of modal particles.
All these morphosyntactic features, Danish shares with Swedish and Norwegian, but the distribution is not completely identical in the three languages, something that makes the Mainland Scandinavian languages an interesting study object to the typologically interested linguist. Exclusive for Danish is the so-called stød, a suprasegmental prosodic feature, used as a distinctive feature.
Modern Danish is strongly standardized with only little of the traditional dialectal variation left. From the end of the 20th century, in the larger cities, new sociolects have emerged, that is, multi-ethnolects. The new multi-ethnolects are based on a substrate of Danish with lexical features from the languages of Central Asia, the Middle East, and Africa. In addition to the lexical innovations, the multi-ethnolects are characteristic in intonation patterns different from Standard Danish, and they have morphosyntactic features different from Standard Danish, for example, in word order and in the use of gender.
Article
Swedish
Erik M. Petzell
Swedish is a V2 language, like all Germanic except English, with a basic VO word order and a suffixed definite article, like all North Germanic. Swedish is the largest of the North Germanic languages, and the official language of both Sweden and Finland, in the latter case alongside the majority language Finnish. Worldwide, there are about 10.5 million first-language (L1) speakers. The extent of L2 Swedish speakers is unclear: In Sweden and Finland alone, there are at least 3 million L2 speakers. Genealogically, Swedish is closest to Danish. Together, they formed the eastern branch of North Germanic during the Viking age. Today, this unity of old is often obscured by later developments. Typologically, in the early 21st century, Swedish is closer to Norwegian than to Danish.
In the late 19th and early 20th centuries, there was great dialectal variation across the Swedish-speaking area. Very few of the traditional dialects have survived into the present, however. In the early 21st century, there are only some isolated areas, where spoken standard Swedish has not completely taken over, for example, northwestern Dalecarlia. Spoken standard Swedish is quite close to the written language. This written-like speech was promoted by primary school teachers from the late 19th century onward. In the 21st century, it comes in various regional guises, which differ from each other prosodically and display some allophonic variation, for example, in the realization of /r/.
During the late Middle Ages, Swedish was in close contact with Middle Low German. This had a massive impact on the lexicon, leading to loans in both the open and closed classes and even import of derivational morphology. Structurally, Swedish lost case and verbal agreement morphology, developed mandatory expletive subjects, and changed its word order in subordinate clauses. Swedish shares much of this development with Danish and Norwegian.
In the course of the early modern era, Swedish and Norwegian converged further, developing very similar phonological systems. The more conspicuous of the shared traits include two different rounded high front vowels, front /y/ and front-central /ʉ/, palatalization of initial /k/ and /g/ before front vowels, and a preserved phonemic tonal distinction.
As for morphosyntax, however, Swedish has sometimes gone its own way, distancing itself from both Norwegian and Danish. For instance, Swedish has a distinct non-agreeing active participle (supine), and it makes use of the morphological s-passive in a wider variety of contexts than Danish and Norwegian. Moreover, verbal particles always precede even light objects in Swedish, for example, ta upp den, literally ‘take up it’, while Danish and Norwegian patterns with, for example, English: tag den op/ta den opp, literally ‘take it up’. Furthermore, finite forms of auxiliary have may be deleted in subordinate clauses in Swedish but never in Danish/Norwegian.
Article
Yiddish
Lea Schäfer
The Yiddish language is directly linked to the culture and destiny of the Jewish population of Central and Eastern Europe. It originated as the everyday language of the Jewish population in the German-speaking lands around the Middle Ages and underwent a series of developments until the Shoah, which took a particularly large toll on the Yiddish-speaking Eastern European Jewish population. Today, Yiddish is spoken as a mother tongue almost exclusively in ultra-Orthodox communities, where it is now exposed to entirely new influences and is, thus, far from being a dead language.
After an introductory sketch, information on the geographical distribution and number of speakers as well as key historical developments are briefly summarized. Particularly important are the descriptions of the various sociolinguistic situations and the source situation. This is followed by a description of various (failed) attempts at standardization, as well as the geographical distribution and surveys of the dialects. The following section describes the status of Yiddish in the early 21st century, which overlaps with the sociolinguistic situation of Orthodox Yiddish. Finally, the linguistic features of modern Eastern Yiddish (dialects, standard, and Orthodox) are presented. In this context, linguistic levels and structures in which Yiddish differs from other (standard) Germanic languages are also discussed. Since Yiddish, as a language derived from Middle High German, is particularly close to German varieties, the differences and similarities between the two languages are particularly emphasized.
Article
English in the U.S. South
Kirk Hazen
English in the U.S. South contains a wide range of variation, encompassing ethnic, social class, and subregional variations all within the umbrella term of Southern English. Although it has been a socially distinct variety since at least the mid-19th century, many of the modern features it is nationally known for developed only after 1875. Lexical variation has long distinguished the U.S. South, but new vocabulary has replaced the old, and subregional variation in the U.S. South is no longer important for lexical variation. Social class still plays an important role in grammatical variation, but the rise of compulsory education limited previously wider ranges of dialect features. Despite traditional scholarship’s primary focus on lexical and grammatical language variation in the U.S. South, phonological variation has been the main area of scholarship since 1990s. Within phonological variation, the production of vowels, the most socially salient features of the U.S. South, has been a heavily studied realm of scholarship. Prosodic, consonant, and perception studies have been on the rise and have provided numerous insights into this highly diverse dialect region.
Article
Catalan
Francisco Ordóñez
Catalan is a “medium-sized” Romance language spoken by over 10 million speakers, spread over four nation states: Northeastern Spain, Andorra, Southern France, and the city of L’Alguer (Alghero) in Sardinia, Italy. Catalan is divided into two primary dialectal divisions, each with further subvarieties: Western Catalan (Western Catalonia, Eastern Aragon, and Valencian Community) and Eastern Catalan (center and east of Catalonia, Balearic Islands, Rosselló, and l’Alguer).
Catalan descends from Vulgar Latin. Catalan expanded during medieval times as one of the primary vernacular languages of the Kingdom of Aragon. It largely retained its role in government and society until the War of Spanish Succession in 1714, and since it has been minoritized. Catalan was finally standardized during the beginning of the 20th century, although later during the Franco dictatorship it was banned in public spaces. The situation changed with the new Spanish Constitution promulgated in 1978, when Catalan was declared co-official with Spanish in Catalonia, the Valencian Community, and the Balearic Islands.
The Latin vowel system evolved in Catalan into a system of seven stressed vowels. As in most other Iberian Romance languages, there is a general process of spirantization or lenition of voiced stops. Catalan has a two-gender grammatical system and, as in other Western Romance languages, plurals end in -s; Catalan has a personal article and Balearic Catalan has a two-determiner system for common nouns. Finally, past perfective actions are indicated by a compound tense consisting of the auxiliary verb anar ‘to go’ in present tense plus the infinitive.
Catalan is a minoritized language everywhere it is spoken, except in the microstate of Andorra, and it is endangered in France and l’Alguer. The revival of Catalan in the post-dictatorship era is connected with a movement called linguistic normalization. The idea of normalization refers to the aim to return Catalan to a “normal” use at an official level and everyday level as any official language.
Article
Discriminative Learning and the Lexicon: NDL and LDL
Yu-Ying Chuang and R. Harald Baayen
Naive discriminative learning (NDL) and linear discriminative learning (LDL) are simple computational algorithms for lexical learning and lexical processing. Both NDL and LDL assume that learning is discriminative, driven by prediction error, and that it is this error that calibrates the association strength between input and output representations. Both words’ forms and their meanings are represented by numeric vectors, and mappings between forms and meanings are set up. For comprehension, form vectors predict meaning vectors. For production, meaning vectors map onto form vectors. These mappings can be learned incrementally, approximating how children learn the words of their language. Alternatively, optimal mappings representing the end state of learning can be estimated. The NDL and LDL algorithms are incorporated in a computational theory of the mental lexicon, the ‘discriminative lexicon’. The model shows good performance both with respect to production and comprehension accuracy, and for predicting aspects of lexical processing, including morphological processing, across a wide range of experiments. Since, mathematically, NDL and LDL implement multivariate multiple regression, the ‘discriminative lexicon’ provides a cognitively motivated statistical modeling approach to lexical processing.
Article
Judeo-Spanish (Judezmo, Ladino)
David M. Bunis
The Ibero-Romance-speaking Jews of medieval Christian Iberia were linguistically distinct from their non-Jewish neighbors primarily as a result of their language’s unique Hebrew-Aramaic component; preservations from older Jewish Greek, Latin, and Arabic; a tradition of translating sacred Hebrew and Aramaic texts into their language using archaisms and Hebrew-Aramaic rather than Hispanic syntax; and their Hebrew-letter writing system. With the expulsions from Iberia in the late 15th century, most of the Sephardim who continued to maintain their Iberian-origin language resettled in the Ottoman Empire, with smaller numbers in North Africa and Italy. Their forced migration, and perhaps a conscious choice, essentially disconnected the Sephardim from the Spanish language as it developed in Iberia and Latin America, causing their language—which they came to call laðino ‘Romance’, ʤuðezmo or ʤuðjó ‘Jewish, Judezmo’, and more recently (ʤudeo)espaɲol ‘Judeo-Spanish’—to appear archaic when compared with modern Spanish. In their new locales the Sephardim developed the Hispanic component of their language along independent lines, resulting in further differentiation from Spanish. Divergence was intensified through borrowing from contact languages of the Ottoman Empire such as Turkish, Greek, and South Slavic. Especially from the late 18th century, factors such as the colonializing interests of France, Italy, and Austro-Hungary in the region led to considerable influence of their languages on Judezmo. In the 19th century, the dismemberment of the Ottoman and Austro-Hungarian empires and their replacement by highly nationalistic states resulted in a massive language shift to the local languages; that factor, followed by large speech-population losses during World War II and immigration to countries stressing linguistic homogeneity, have in recent years made Judezmo an endangered language.
Article
The Phonology of Compounds
Irene Vogel
A number of recent developments in phonological theory, beginning with The Sound Pattern of English, are particularly relevant to the phonology of compounds. They address both the phonological phenomena that apply to compound words and the phonological structures that are required as the domains of these phenomena: segmental and nonsegmental phenomena that operate within each member of a compound separately, as well as at the juncture between the members of compounds and throughout compounds as a whole. In all cases, what is crucial for the operation of the phonological phenomena of compounds is phonological structure, in terms of constituents of the Prosodic Hierarchy, as opposed to morphosyntactic structure. Specifically, only two phonological constituents are required, the Phonological Word, which provides the domain for phenomena that apply to the individual members of compounds and at their junctures, and a larger constituent that groups the members of compounds together. The nature of the latter is somewhat controversial, the main issue being whether or not there is a constituent in the Prosodic Hierarchy between the Phonological Word and the Phonological Phrase. When present, this constituent, the Composite Group (revised from the original Clitic Group), includes the members of compounds, as well as “stray” elements such as clitics and “Level 2” affixes. In its absence, compounds, and often the same “stray” elements, are analyzed as a type of Recursive Phonological Word, although crucially, the combinations of such element do not exhibit the same properties as the basic Phonological Word.
Article
Typological Diversity Within the Romance Languages
Davide Ricca
The Romance languages, despite their overall similarity, display interesting internal diversity which can be captured only very partially by looking at the six major standard languages, as typological databases often do. This diversity spans over all the levels of linguistic analysis, from phonology to morphology and syntax. Rather than making a long list of features, with no space to go much beyond their mere mention, the article focusses on just four main areas in a little more detail, trying to develop, if minimally, a discussion on their theoretical and methodological import.
The comparison with the full-world typological background given by the WALS Online shows that the differences within Romance may reach the level of general typological relevance. While this is probably not the case in their rather mainstream segmental phonology, it surely holds regarding nominal pluralization and the syntax of negation, which are both areas where the Romance languages have often distanced themselves quite significantly from their common ancestor, Latin. The morphological marking of nominal plural displays four values out of the seven recorded in WALS, adding a further one unattested there, namely subtraction; the negation strategies, although uniformly particle-like, cover all the five values found in WALS concerning linear order. Finally, Romance languages suggest several intriguing issues related with head-marking and dependent-marking constructions, again innovating against the substantially dependent-marking uniformity characteristic of Latin.