Corpus Phonology is an approach to phonology that places corpora at the center of phonological research. Some practitioners of corpus phonology see corpora as the only object of investigation; others use corpora alongside other available techniques (for instance, intuitions, psycholinguistic and neurolinguistic experimentation, laboratory phonology, the study of the acquisition of phonology or of language pathology, etc.). Whatever version of corpus phonology one advocates, corpora have become part and parcel of the modern research environment, and their construction and exploitation has been modified by the multidisciplinary advances made within various fields. Indeed, for the study of spoken usage, the term ‘corpus’ should nowadays only be applied to bodies of data meeting certain technical requirements, even if corpora of spoken usage are by no means new and coincide with the birth of recording techniques. It is therefore essential to understand what criteria must be met by a modern corpus (quality of recordings, diversity of speech situations, ethical guidelines, time-alignment with transcriptions and annotations, etc.) and what tools are available to researchers. Once these requirements are met, the way is open to varying and possibly conflicting uses of spoken corpora by phonological practitioners. A traditional stance in theoretical phonology sees the data as a degenerate version of a more abstract underlying system, but more and more researchers within various frameworks (e.g., usage-based approaches, exemplar models, stochastic Optimality Theory, sociophonetics) are constructing models that tightly bind phonological competence to language use, rely heavily on quantitative information, and attempt to account for intra-speaker and inter-speaker variation. This renders corpora essential to phonological research and not a mere adjunct to the phonological description of the languages of the world.
Dalmatian is an extinct group of Romance varieties spoken on the eastern Adriatic seaboard, best known from its Vegliote variety, spoken on the island of Krk (also called Veglia). Vegliote is principally represented by the linguistic testimony of its last speaker, Tuone Udaina, who died at the end of the 19th century. By the time Udaina’s Vegliote could be explored by linguists (principally by Matteo Bartoli), it seems that he had no longer actively spoken the language for decades, and his linguistic testimony is imperfect, in that it is influenced for example by the Venetan dialect that he habitually spoke. Nonetheless, his Vegliote reveals various distinctive and recurrent linguistic traits, notably in the domain of phonology (for example, pervasive and complex patterns of vowel diphthongization) and morphology (notably a general collapse of the general Romance inflexional system of tense and mood morphology, but also an unusual type of synthetic future form).
Eva Skafte Jensen
Danish is a North Germanic language, spoken by approximately 6 million people. Genealogically, it is related to the other Germanic languages, in particular the other North Germanic languages (Swedish, Norwegian, Icelandic, Faroese), but also, for example, German, Dutch, and English; typologically, Modern Danish is closer to Norwegian and Swedish than to any other language. Historically deriving from Proto-Germanic, Danish morphology once had three grammatical genders (the masculine, the feminine, and the neuter) and case inflection (nominative, accusative, dative, and genitive) in all nominal words; it also had inflection for mood, tense, number, and person in the verbal conjugations. In Modern Standard Danish, much of the traditional nominal and verbal inflection has disappeared. Instead, other kinds of morphosyntactic constructions and structures have emerged. Middle Danish and Modern Danish are typologically very different languages. One of the structural innovations linked to the typological change is that a syntactic subject becomes obligatory in Danish sentences. Correlated to this, Danish develops expletive constructions with det ‘it’ and der ‘there’. Another important point differentiating Middle Danish from Modern Danish concerns agreement. Traditional Indo-European agreement (verbal as well as nominal) has receded in favor of more fixed word order, both on the sentence level and internally within phrases. As part of this, Modern Danish has developed a set of definite and indefinite articles. The traditional three genders are reduced to two (common and neuter) and have developed new syntactic-semantic functions alongside the traditional lexically distributed functions. In the verbal systems, Danish makes use of two different kinds of passive voice (a periphrastic and an inflected one), which carry different meanings, and also of two different auxiliaries in perfective constructions, that is, have ‘have’ and være ‘be’, the latter doubling as an auxiliary in periphrastic passive constructions. Perfective constructions are made up by an auxiliary and the supine form of the main verb. Danish is a V2-language with a relatively fixed word order, often depicted in the form of the so-called sentence frame, a topological model designed specifically for Danish. Like most other Germanic languages, Danish has a rich set of modal particles. All these morphosyntactic features, Danish shares with Swedish and Norwegian, but the distribution is not completely identical in the three languages, something that makes the Mainland Scandinavian languages an interesting study object to the typologically interested linguist. Exclusive for Danish is the so-called stød, a suprasegmental prosodic feature, used as a distinctive feature. Modern Danish is strongly standardized with only little of the traditional dialectal variation left. From the end of the 20th century, in the larger cities, new sociolects have emerged, that is, multi-ethnolects. The new multi-ethnolects are based on a substrate of Danish with lexical features from the languages of Central Asia, the Middle East, and Africa. In addition to the lexical innovations, the multi-ethnolects are characteristic in intonation patterns different from Standard Danish, and they have morphosyntactic features different from Standard Danish, for example, in word order and in the use of gender.
Defectiveness in Morphology
Morphological defectiveness refers to situations where one or more paradigmatic forms of a lexeme are not realized, without plausible syntactic, semantic, or phonological causes. The phenomenon tends to be associated with low-frequency lexemes and loanwords. Typically, defectiveness is gradient, lexeme-specific, and sensitive to the internal structure of paradigms. The existence of defectiveness is a challenge to acquisition models and morphological theories where there are elsewhere operations to materialize items. For this reason, defectiveness has become a rich field of research in recent years, with distinct approaches that view it as an item-specific idiosyncrasy, as an epiphenomenal result of rule competition, or as a normal morphological alternation within a paradigmatic space.
Derivational morphology is a type of word formation that creates new lexemes, either by changing syntactic category or by adding substantial new meaning (or both) to a free or bound base. Derivation may be contrasted with inflection on the one hand or with compounding on the other. The distinctions between derivation and inflection and between derivation and compounding, however, are not always clear-cut. New words may be derived by a variety of formal means including affixation, reduplication, internal modification of various sorts, subtraction, and conversion. Affixation is best attested cross-linguistically, especially prefixation and suffixation. Reduplication is also widely found, with various internal changes like ablaut and root and pattern derivation less common. Derived words may fit into a number of semantic categories. For nouns, event and result, personal and participant, collective and abstract noun are frequent. For verbs, causative and applicative categories are well-attested, as are relational and qualitative derivations for adjectives. Languages frequently also have ways of deriving negatives, relational words, and evaluatives. Most languages have derivation of some sort, although there are languages that rely more heavily on compounding than on derivation to build their lexical stock. A number of topics have dominated the theoretical literature on derivation, including productivity (the extent to which new words can be created with a given affix or morphological process), the principles that determine the ordering of affixes, and the place of derivational morphology with respect to other components of the grammar. The study of derivation has also been important in a number of psycholinguistic debates concerning the perception and production of language.
Direct Perception of Speech
Carol A. Fowler
The theory of speech perception as direct derives from a general direct-realist account of perception. A realist stance on perception is that perceiving enables occupants of an ecological niche to know its component layouts, objects, animals, and events. “Direct” perception means that perceivers are in unmediated contact with their niche (mediated neither by internally generated representations of the environment nor by inferences made on the basis of fragmentary input to the perceptual systems). Direct perception is possible because energy arrays that have been causally structured by niche components and that are available to perceivers specify (i.e., stand in 1:1 relation to) components of the niche. Typically, perception is multi-modal; that is, perception of the environment depends on specifying information present in, or even spanning, multiple energy arrays. Applied to speech perception, the theory begins with the observation that speech perception involves the same perceptual systems that, in a direct-realist theory, enable direct perception of the environment. Most notably, the auditory system supports speech perception, but also the visual system, and sometimes other perceptual systems. Perception of language forms (consonants, vowels, word forms) can be direct if the forms lawfully cause specifying patterning in the energy arrays available to perceivers. In Articulatory Phonology, the primitive language forms (constituting consonants and vowels) are linguistically significant gestures of the vocal tract, which cause patterning in air and on the face. Descriptions are provided of informational patterning in acoustic and other energy arrays. Evidence is next reviewed that speech perceivers make use of acoustic and cross modal information about the phonetic gestures constituting consonants and vowels to perceive the gestures. Significant problems arise for the viability of a theory of direct perception of speech. One is the “inverse problem,” the difficulty of recovering vocal tract shapes or actions from acoustic input. Two other problems arise because speakers coarticulate when they speak. That is, they temporally overlap production of serially nearby consonants and vowels so that there are no discrete segments in the acoustic signal corresponding to the discrete consonants and vowels that talkers intend to convey (the “segmentation problem”), and there is massive context-sensitivity in acoustic (and optical and other modalities) patterning (the “invariance problem”). The present article suggests solutions to these problems. The article also reviews signatures of a direct mode of speech perception, including that perceivers use cross-modal speech information when it is available and exhibit various indications of perception-production linkages, such as rapid imitation and a disposition to converge in dialect with interlocutors. An underdeveloped domain within the theory concerns the very important role of longer- and shorter-term learning in speech perception. Infants develop language-specific modes of attention to acoustic speech signals (and optical information for speech), and adult listeners attune to novel dialects or foreign accents. Moreover, listeners make use of lexical knowledge and statistical properties of the language in speech perception. Some progress has been made in incorporating infant learning into a theory of direct perception of speech, but much less progress has been made in the other areas.
Discriminative Learning and the Lexicon: NDL and LDL
Yu-Ying Chuang and R. Harald Baayen
Naive discriminative learning (NDL) and linear discriminative learning (LDL) are simple computational algorithms for lexical learning and lexical processing. Both NDL and LDL assume that learning is discriminative, driven by prediction error, and that it is this error that calibrates the association strength between input and output representations. Both words’ forms and their meanings are represented by numeric vectors, and mappings between forms and meanings are set up. For comprehension, form vectors predict meaning vectors. For production, meaning vectors map onto form vectors. These mappings can be learned incrementally, approximating how children learn the words of their language. Alternatively, optimal mappings representing the end state of learning can be estimated. The NDL and LDL algorithms are incorporated in a computational theory of the mental lexicon, the ‘discriminative lexicon’. The model shows good performance both with respect to production and comprehension accuracy, and for predicting aspects of lexical processing, including morphological processing, across a wide range of experiments. Since, mathematically, NDL and LDL implement multivariate multiple regression, the ‘discriminative lexicon’ provides a cognitively motivated statistical modeling approach to lexical processing.
Dispersion Theory and Phonology
Dispersion Theory concerns the constraints that govern contrasts, the phonetic differences that can distinguish words in a language. Specifically it posits that there are distinctiveness constraints that favor contrasts that are more perceptually distinct over less distinct contrasts. The preference for distinct contrasts is hypothesized to follow from a preference to minimize perceptual confusion: In order to recover what a speaker is saying, a listener must identify the words in the utterance. The more confusable words are, the more likely a listener is to make errors. Because contrasts are the minimal permissible differences between words in a language, banning indistinct contrasts reduces the likelihood of misperception. The term ‘dispersion’ refers to the separation of sounds in perceptual space that results from maximizing the perceptual distinctiveness of the contrasts between those sounds, and is adopted from Lindblom’s Theory of Adaptive Dispersion, a theory of phoneme inventories according to which inventories are selected so as to maximize the perceptual differences between phonemes. These proposals follow a long tradition of explaining cross-linguistic tendencies in the phonetic and phonological form of languages in terms of a preference for perceptually distinct contrasts. Flemming proposes that distinctiveness constraints constitute one class of constraints in an Optimality Theoretic model of phonology. In this context, distinctiveness constraints predict several basic phenomena, the first of which is the preference for maximal dispersion in inventories of contrasting sounds that first motivated the development of the Theory of Adaptive Dispersion. But distinctiveness constraints are formulated as constraints on the surface forms of possible words that interact with other phonological constraints, so they evaluate the distinctiveness of contrasts in context. As a result, Dispersion Theory predicts that contrasts can be neutralized or enhanced in particular phonological contexts. This prediction arises because the phonetic realization of sounds depends on their context, so the perceptual differences between contrasting sounds also depend on context. If the realization of a contrast in a particular context would be insufficiently distinct (i.e., it would violate a high-ranked distinctiveness constraint), there are two options: the offending contrast can be neutralized, or it can be modified (‘enhanced’) to make it more distinct. A basic open question regarding Dispersion Theory concerns the proper formulation of distinctiveness constraints and the extent of variation in their rankings across languages, issues that are tied up with the questions about the nature of perceptual distinctiveness. Another concerns the size and nature of the comparison set of contrasting word-forms required to be able to evaluate whether a candidate output satisfies distinctiveness constraints.
The Early History of Romance Palatalizations
Strictly speaking, palatalization is a phonetic process of assimilation which can generate new palatal phonemes. However, in Romance linguistics, the term is traditionally used to describe any evolution (1) of velar stops preceding a front vowel, (2) of the palatal approximant (also known as “yod”) and clusters involving yod. Therefore, not only does “Romance palatalization” involve a segment which is already palatal, [j], but the result is also not always a palatal consonant: Sometimes it is a dental/alveolar (firstly an affricate, then in some cases a fricative). The article proposes a two-phase chronology for the early Romance palatalization, with the first phase affecting /kj/, /tj/ and in some cases /k/ and /g/ before front vowels, while the second phase affects other clusters involving /j/. It also draws a distinction between the varieties which show palatalization of velar consonants before front vowels (western Romània, central Italy) and varieties which do not show it or only show it at a late stage (Sardinia, southern Italy, the Balkans). The Romance data confirm some trends identified in typological literature and in some cases enable more precise descriptions. The consonants most susceptible to palatalization are: regarding the manner of articulation, stops (the most resistant are rhotics); regarding the place of articulation, velars (labials are the most resistant). Geminate segments are also more susceptible to palatalization.
Early Modern English
In the Early Modern English period (1500–1700), steps were taken toward Standard English, and this was also the time when Shakespeare wrote, but these perspectives are only part of the bigger picture. This chapter looks at Early Modern English as a variable and changing language not unlike English today. Standardization is found particularly in spelling, and new vocabulary was created as a result of the spread of English into various professional and occupational specializations. New research using digital corpora, dictionaries, and databases reveals the gradual nature of these processes. Ongoing developments were no less gradual in pronunciation, with processes such as the Great Vowel Shift, or in grammar, where many changes resulted in new means of expression and greater transparency. Word order was also subject to gradual change, becoming more fixed over time.
Geoffrey K. Pullum
English is both the most studied of the world’s languages and the most widely used. It comes closer than any other language to functioning as a world communication medium and is very widely used for governmental purposes. This situation is the result of a number of historical accidents of different magnitudes. The linguistic properties of the language itself would not have motivated its choice (contra the talk of prescriptive usage writers who stress the clarity and logic that they believe English to have). Divided into multiple dialects, English has a phonological system involving remarkably complex consonant clusters and a large inventory of distinct vowel nuclei; a bad, confusing, and hard-to-learn alphabetic orthography riddled with exceptions, ambiguities, and failures of the spelling to correspond to the pronunciation; a morphology that is rather more complex than is generally appreciated, with seven or eight paradigm patterns and a couple of hundred irregular verbs; a large multilayered lexicon containing roots of several quite distinct historical sources; and a syntax that despite its very widespread SVO (Subject-Verb-Object) basic order in the clause is replete with tricky details. For example, there are crucial restrictions on government of prepositions, many verb-preposition idioms, subtle constraints on the intransitive prepositions known as “particles,” an important distinction between two (or under a better analysis, three) classes of verb that actually have different syntax, and a host of restrictions on the use of its crucial “wh-words.” It is only geopolitical and historical accidents that have given English its enormous importance and prestige in the world, not its inherent suitability for its role.
English in the U.S. South
English in the U.S. South contains a wide range of variation, encompassing ethnic, social class, and subregional variations all within the umbrella term of Southern English. Although it has been a socially distinct variety since at least the mid-19th century, many of the modern features it is nationally known for developed only after 1875. Lexical variation has long distinguished the U.S. South, but new vocabulary has replaced the old, and subregional variation in the U.S. South is no longer important for lexical variation. Social class still plays an important role in grammatical variation, but the rise of compulsory education limited previously wider ranges of dialect features. Despite traditional scholarship’s primary focus on lexical and grammatical language variation in the U.S. South, phonological variation has been the main area of scholarship since 1990s. Within phonological variation, the production of vowels, the most socially salient features of the U.S. South, has been a heavily studied realm of scholarship. Prosodic, consonant, and perception studies have been on the rise and have provided numerous insights into this highly diverse dialect region.
Foot Structure in Germanic
Joshua Booth and Aditi Lahiri
A foot is an organizing unit of prosodic structure built on moras and syllables. Prominence falls on the heads of feet, and feet can be right- or left-headed (an iamb or a trochee, respectively). Feet can be constructed from the right or the left edge and lexical stress falls on the head of the leftmost or rightmost foot. The metrical system of a language can thus be defined by (a) the nature of the foot (trochee/iamb), (b) the direction of parsing, and (c) the foot that carries main stress. Each prosodic word minimally comprises a stressed foot. Germanic languages are traditionally divided into three branches, East Germanic (Gothic), West Germanic, and North Germanic. East Germanic has no modern descendants, unlike the latter two branches, which include, for example, English, German, and Dutch (West Germanic), and Danish, Norwegian, and Swedish (North Germanic), respectively. The status of the foot has remained remarkably consistent across the history of Germanic, remaining trochaic and quantity sensitive (although details differ across the relevant languages). This is despite significant changes to the quantity systems of Germanic languages, which have almost exclusively lost the distinction in either vowel or consonant quantity (whereas Proto-Germanic had both). The extensive borrowing from Romance languages across Germanic has also had a substantial impact. Germanic words rarely contain more than one foot, whereas Romance loans are largely longer than native Germanic vocabulary and therefore frequently comprise two or more feet. Due to the fact that native vocabulary was broadly stressed on the initial foot, whereas Romance loans often retained right-edge stress, a choice had to be made as to which foot to stress and the modern languages demonstrate that the right edge was selected in every case. Thus, while feet remain quantity sensitive and trochaic, the modern languages construct them from right to left and place main stress on the rightmost foot. This is in contrast to the early stages of the languages, when the opposite was the case.
Daniel Aalto, Jarmo Malinen, and Martti Vainio
Formant frequencies are the positions of the local maxima of the power spectral envelope of a sound signal. They arise from acoustic resonances of the vocal tract air column, and they provide substantial information about both consonants and vowels. In running speech, formants are crucial in signaling the movements with respect to place of articulation. Formants are normally defined as accumulations of acoustic energy estimated from the spectral envelope of a signal. However, not all such peaks can be related to resonances in the vocal tract, as they can be caused by the acoustic properties of the environment outside the vocal tract, and sometimes resonances are not seen in the spectrum. Such formants are called spurious and latent, respectively. By analogy, spectral maxima of synthesized speech are called formants, although they arise from a digital filter. Conversely, speech processing algorithms can detect formants in natural or synthetic speech by modeling its power spectral envelope using a digital filter. Such detection is most successful for male speech with a low fundamental frequency where many harmonic overtones excite each of the vocal tract resonances that lie at higher frequencies. For the same reason, reliable formant detection from females with high pitch or children’s speech is inherently difficult, and many algorithms fail to faithfully detect the formants corresponding to the lowest vocal tract resonant frequencies.
Frequency Effects in Grammar
Holger Diessel and Martin Hilpert
Until recently, theoretical linguists have paid little attention to the frequency of linguistic elements in grammar and grammatical development. It is a standard assumption of (most) grammatical theories that the study of grammar (or competence) must be separated from the study of language use (or performance). However, this view of language has been called into question by various strands of research that have emphasized the importance of frequency for the analysis of linguistic structure. In this research, linguistic structure is often characterized as an emergent phenomenon shaped by general cognitive processes such as analogy, categorization, and automatization, which are crucially influenced by frequency of occurrence. There are many different ways in which frequency affects the processing and development of linguistic structure. Historical linguists have shown that frequent strings of linguistic elements are prone to undergo phonetic reduction and coalescence, and that frequent expressions and constructions are more resistant to structure mapping and analogical leveling than infrequent ones. Cognitive linguists have argued that the organization of constituent structure and embedding is based on the language users’ experience with linguistic sequences, and that the productivity of grammatical schemas or rules is determined by the combined effect of frequency and similarity. Child language researchers have demonstrated that frequency of occurrence plays an important role in the segmentation of the speech stream and the acquisition of syntactic categories, and that the statistical properties of the ambient language are much more regular than commonly assumed. And finally, psycholinguists have shown that structural ambiguities in sentence processing can often be resolved by lexical and structural frequencies, and that speakers’ choices between alternative constructions in language production are related to their experience with particular linguistic forms and meanings. Taken together, this research suggests that our knowledge of grammar is grounded in experience.
Frisian is a West Germanic language that is indigenous to the southern coastal region of the North Sea in the Netherlands and Germany. In the early 21st century, it was spoken by around 400,000 inhabitants of the Dutch province of Friesland, by up to 1,000 speakers in the German municipality of the Saterland, and by an estimated 4,000 people in the German district of Nordfriesland. Corresponding to the geographical separation of these areas, which is the result of a complex historical process involving several migration events and language shifts, the Frisian language is traditionally divided into three dialect groups: West Frisian, East Frisian (Saterlandic), and North Frisian. They share common Frisian features, like the presence of two classes of weak verbs. Nevertheless, they are also characterized by individual innovations and various degrees of influence from different contact languages, which explains why they are no longer mutually intelligible. All three dialects are fully recognized as minority languages but differ in terms of their sociopolitical status. While West Frisian appears to occupy a moderately strong position in society—as it is not only recognized as the second official language of the Netherlands but also has access to higher domains and enjoys a considerable amount of constitutional support—the same does not apply to the other dialects. North Frisian and Saterlandic are mostly, if not entirely, confined to lower domains and attempts to extend their use have been only moderately successful. Considering the number of speakers, West Frisian is a relatively vital language as opposed to North Frisian and Saterlandic, which are both severely endangered.
Linguistic change not only affects the lexicon and the phonology of words, it also operates on the grammar of a language. In this context, grammaticalization is concerned with the development of lexical items into markers of grammatical categories or, more generally, with the development of markers used for procedural cueing of abstract relationships out of linguistic items with concrete referential meaning. A well-known example is the English verb go in its function of a future marker, as in She is going to visit her friend. Phenomena like these are very frequent across the world’s languages and across many different domains of grammatical categories. In the last 50 years, research on grammaticalization has come up with a plethora of (a) generalizations, (b) models of how grammaticalization works, and (c) methodological refinements. On (a): Processes of grammaticalization develop gradually, step by step, and the sequence of the individual stages follows certain clines as they have been generalized from cross-linguistic comparison (unidirectionality). Even though there are counterexamples that go against the directionality of various clines, their number seems smaller than assumed in the late 1990s. On (b): Models or scenarios of grammaticalization integrate various factors. Depending on the theoretical background, grammaticalization and its results are motivated either by the competing motivations of economy vs. iconicity/explicitness in functional typology or by a change from movement to merger in the minimalist program. Pragmatic inference is of central importance for initiating processes of grammaticalization (and maybe also at later stages), and it activates mechanisms like reanalysis and analogy, whose status is controversial in the literature. Finally, grammaticalization does not only work within individual languages/varieties, it also operates across languages. In situations of contact, the existence of a certain grammatical category may induce grammaticalization in another language. On (c): Even though it is hard to measure degrees of grammaticalization in terms of absolute and exact figures, it is possible to determine relative degrees of grammaticalization in terms of the autonomy of linguistic signs. Moreover, more recent research has come up with criteria for distinguishing grammaticalization and lexicalization (defined as the loss of productivity, transparency, and/or compositionality of former productive, transparent, and compositional structures). In spite of these findings, there are still quite a number of questions that need further research. Two questions to be discussed address basic issues concerning the overall properties of grammaticalization. (1) What is the relation between constructions and grammaticalization? In the more traditional view, constructions are seen as the syntactic framework within which linguistic items are grammaticalized. In more recent approaches based on construction grammar, constructions are defined as combinations of form and meaning. Thus, grammaticalization can be seen in the light of constructionalization, i.e., the creation of new combinations of form and meaning. Even though constructionalization covers many apects of grammaticalization, it does not exhaustively cover the domain of grammaticalization. (2) Is grammaticalization cross-linguistically homogeneous, or is there a certain range of variation? There is evidence from East and mainland Southeast Asia that there is cross-linguistic variation to some extent.
(High) German is both a group of closely related West Germanic varieties and a standardized language derived from this group that comprises a wide range of dialects and colloquial varieties in addition to its standardized form. The two terms have related, and to an extent overlapping, but distinct meanings: German refers to a Standard Average European language spoken predominantly in Central Europe by some 96 million speakers and by minority speech communities around the globe. High German has a double meaning: On the one hand, it is another term for Standard German. On the other hand, it refers to the High German linguistic group within West Germanic, the linguistic basis for the German language. As such, it is defined by the High German consonant shift, a sound change that affected Germanic obstruents and set it apart from its immediate neighbors within (West) Germanic, that is, Low German and Low Franconian. The High German consonant shift around the 7th century, together with the onset of written transmission in the 8th century, marks the beginning of the history of (High) German. Traditional dialects perpetuate patterns of areal variation that arose in the wake of this sound change. Standard German developed out of High German written varieties, especially based on East Central German, through processes of leveling, koineization, metalinguistic reasoning, and codification. During that process, the emergent supra-regional norm superseded Low German in northern Germany and Upper German regional norms in the south, as well as influencing spoken registers, but (Standard) German remains a pluricentric and pluriareal language. Today, colloquial, regional varieties that combine features of Standard German and traditional dialects dominate oral language use, and in social media the written language, too, is developing new colloquial forms that build on standard orthography as well as on regional, informal forms of spoken language usage.
Irit Meir and Oksana Tkachman
Iconicity is a relationship of resemblance or similarity between the two aspects of a sign: its form and its meaning. An iconic sign is one whose form resembles its meaning in some way. The opposite of iconicity is arbitrariness. In an arbitrary sign, the association between form and meaning is based solely on convention; there is nothing in the form of the sign that resembles aspects of its meaning. The Hindu-Arabic numerals 1, 2, 3 are arbitrary, because their current form does not correlate to any aspect of their meaning. In contrast, the Roman numerals I, II, III are iconic, because the number of occurrences of the sign I correlates with the quantity that the numerals represent. Because iconicity has to do with the properties of signs in general and not only those of linguistic signs, it plays an important role in the field of semiotics—the study of signs and signaling. However, language is the most pervasive symbolic communicative system used by humans, and the notion of iconicity plays an important role in characterizing the linguistic sign and linguistic systems. Iconicity is also central to the study of literary uses of language, such as prose and poetry. There are various types of iconicity: the form of a sign may resemble aspects of its meaning in several ways: it may create a mental image of the concept (imagic iconicity), or its structure and the arrangement of its elements may resemble the structural relationship between components of the concept represented (diagrammatic iconicity). An example of the first type is the word cuckoo, whose sounds resemble the call of the bird, or a sign such as RABBIT in Israeli Sign Language, whose form—the hands representing the rabbit's long ears—resembles a visual property of that animal. An example of diagrammatic iconicity is vēnī, vīdī, vīcī, where the order of clauses in a discourse is understood as reflecting the sequence of events in the world. Iconicity is found on all linguistic levels: phonology, morphology, syntax, semantics, and discourse. It is found both in spoken languages and in sign languages. However, sign languages, because of the visual-gestural modality through which they are transmitted, are much richer in iconic devices, and therefore offer a rich array of topics and perspectives for investigating iconicity, and the interaction between iconicity and language structure.
Ideophones (Mimetics, Expressives)
Kimi Akita and Mark Dingemanse
Ideophones, also termed mimetics or expressives, are marked words that depict sensory imagery. They are found in many of the world’s languages, and sizable lexical classes of ideophones are particularly well-documented in the languages of Asia, Africa, and the Americas. Ideophones are not limited to onomatopoeia like meow and smack but cover a wide range of sensory domains, such as manner of motion (e.g., plisti plasta ‘splish-splash’ in Basque), texture (e.g., tsaklii ‘rough’ in Ewe), and psychological states (e.g., wakuwaku ‘excited’ in Japanese). Across languages, ideophones stand out as marked words due to special phonotactics, expressive morphology including certain types of reduplication, and relative syntactic independence, in addition to production features like prosodic foregrounding and common co-occurrence with iconic gestures. Three intertwined issues have been repeatedly debated in the century-long literature on ideophones. (a) Definition: Isolated descriptive traditions and cross-linguistic variation have sometimes obscured a typologically unified view of ideophones, but recent advances show the promise of a prototype definition of ideophones as conventionalized depictions in speech, with room for language-specific nuances. (b) Integration: The variable integration of ideophones across linguistic levels reveals an interaction between expressiveness and grammatical integration, and has important implications for how to conceive of dependencies between linguistic systems. (c) Iconicity: Ideophones form a natural laboratory for the study of iconic form-meaning associations in natural languages, and converging evidence from corpus and experimental studies suggests important developmental, evolutionary, and communicative advantages of ideophones.