You are looking at 61-80 of 207 articles for:Clear All
In the Early Modern English period (1500–1700), steps were taken toward Standard English, and this was also the time when Shakespeare wrote, but these perspectives are only part of the bigger picture. This chapter looks at Early Modern English as a variable and changing language not unlike English today. Standardization is found particularly in spelling, and new vocabulary was created as a result of the spread of English into various professional and occupational specializations. New research using digital corpora, dictionaries, and databases reveals the gradual nature of these processes. Ongoing developments were no less gradual in pronunciation, with processes such as the Great Vowel Shift, or in grammar, where many changes resulted in new means of expression and greater transparency. Word order was also subject to gradual change, becoming more fixed over time.
Chris Rogers and Lyle Campbell
The reduction of the world’s linguistic diversity has accelerated over the last century and correlates to a loss of knowledge, collective and individual identity, and social value. Often a language is pushed out of use before scholars and language communities have a chance to document or preserve this linguistic heritage. Many are concerned for this loss, believing it to be one of the most serious issues facing humanity today. To address the issues concomitant with an endangered language, we must know how to define “endangerment,” how different situations of endangerment can be compared, and how each language fits into the cultural practices of individuals. The discussion about endangered languages focuses on addressing the needs, causes, and consequences of this loss.
Concern over endangered languages is not just an academic catch phrase. It involves real people and communities struggling with real social, political, and economic issues. To understand the causes and consequence of language endangerment for these individuals and communities requires a multifaceted perspective on the place of each language in the lives of their users. The loss of a language affects not only the world’s linguistic diversity but also an individual’s social identity, and a community’s sense of itself and its history.
The Eskimo-Aleut language family consists of two quite different branches, Aleut and Eskimo. The latter consists of Yupik and Inuit languages. It is spoken from the eastern coast of Russia to Greenland. The family is thought to have developed and diverged in Alaska between 4,000 and 6,000 years ago, although recent findings in a variety of fields suggest a more complex prehistory than previously assumed. The language family shares certain characteristics, including polysynthetic word formation, an originally ergative-absolutive case system (now substantially modified in Aleut), SOV word order, and more or less similar phonological systems across the language family, involving voiceless stop and voiced fricative consonant series often in alternation, and an originally four-vowel system frequently reduced to three. The languages in the family have undergone substantial postcolonial contact effects, especially evident in (although not restricted to) loanwords from the respective colonial languages. There is extensive language documentation for all languages, although not necessarily all dialects. Most languages and dialects are severely endangered today, with the exception of Eastern Canadian Inuit and Greenlandic (Kalaallisut). There are also theoretical studies of the languages in many linguistic fields, although the languages are unevenly covered, and there are still many more studies of the phonologies and syntaxes of the respective languages than other aspects of grammar.
Eva Buchi and Steven N. Dworkin
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
Within the field of linguistics, etymology is the only subdiscipline that is uniquely historical in its study of the relevant linguistic data. It is one of the oldest fields in Romance linguistics. The scholar credited with establishing Romance linguistics as a scholarly discipline, Friedrich Diez (1794–1876) authored both the first comparative Romance historical grammar (his three-volume Grammatik der Romanischen Sprachen [1836–1844]) and the first pan-Romance etymological dictionary (his Etymologisches Wörterbuch der Romanischen Sprachen ). A similar combination, illustrating the indissoluble link between etymology and historical grammar (especially the study of sound change), can be seen in the work of Wilhelm Meyer-Lübke (1861–1936), author of a four-volume Grammatik der Romanischen Sprachen (1890–1902) and of the last complete pan-Romance etymological dictionary, the Romanisches Etymologisches Wörterbuch (3d definitive edition, 1935).
The concept of etymology as practiced by Romanists has changed over the last 100 years. At the outset, Romance etymologists took as their brief the search for and identification of individual word origins. Starting in the early 20th century, various specialists began to view etymology as the preparation of the complete history of all facets of the evolution over time and space of the words or lexical families under study. Identification of the underlying base was only the first step in the process. From this perspective, etymology constitutes an essential element of diachronic lexicology, which covers all formal, semantic, and syntactic facets of a word’s evolution, including, if appropriate, the circumstances leading to its demise and replacement.
Practitioners of Romance etymology tend to study the history of individual words or word families in specific Romance languages rather than across the entire family. Almost every Romance language and many of their regional varieties have at least one etymological dictionary devoted to the history of its vocabulary (or at least to the identification of relevant word origins), the most notable being such multi-volumed works as the Französisches Etymologisches Wörterbuch (1922–2002), the Lessico Etimilogico Italiano (1979–), the Diccionario crítico etimológico castellano e hispánico (1980–1991), and the Diccionari etimològic i complimenari de la llengua catalana (1980–2001). The last complete pan-Romance dictionary remains the afore-cited third edition of Meyer-Lübke’s Romanisches etymologisches Wörterbuch.
Although originally coined as a riposte to the Neogrammarian view of sound change, Jules Gilliéron’s (1854–1926) dictum, “each word has its own history,” applies equally well to etymology. Yakov Malkiel (1914–1998), one of the leading writers on questions of method and practice in Romance etymology, has discussed the unique and complex nature of etymological solutions. As a result of the emphasis on individual problems and solutions, Romance etymology has not lent itself to the formulation of theories on the nature of lexical change, although there was in the past no shortage of literature on questions of methodology.
Although specialists continue to work on language-specific etymological questions, etymology is not currently at the forefront of work in Romance historical linguistics, a situation that may result, in part, from its lack of engagement with broad theoretical issues. Most studies still appear in the form of journal articles or Festschrift contributions. There is currently underway a new pan-Romance project, the Dictionnaire étymologique Roman (DéRom), with a new (and controversial) methodological underpinning, namely the rigorous application to the Romance data of comparative reconstruction to capture more accurately the phonological and morphological reality of proto-Romance (in essence a register of spoken Latin) and the semantic scope of the etymological base. This project has reawakened an interest in Romance etymology among a new generation of Romanists. Indeed, to remain vital and relevant within the framework of Romance linguistics, etymology must go beyond the details of individual lexical histories and make an effort to link its findings to our understanding of the nature and processes of language change.
Evaluative morphology is a field of linguistic studies that deals with the formation of diminutives, augmentatives, pejoratives, and amelioratives. Actually, evaluative constructions cross the boundaries of morphology, and are sometimes realized by formal strategies that cannot be numbered among word formation processes. Nevertheless, morphology plays a dominant role in the formation of evaluatives. The first attempt to draw an exhaustive account of this set of complex forms is found in the 1984 work Generative Morphology, by Sergio Scalise, who made the hypothesis that evaluatives represent a separate block of rules between inflection and derivation. This hypothesis is based on the fact that evaluatives show some properties that are derivational, others that are inflectional, and some specific properties that are neither derivational nor inflectional. After Scalise’s proposal, almost all scholars have tried to answer the question concerning the place of evaluative rules within the morphological component. What data reveal is that, in a cross-linguistic perspective, evaluatives display a uniform behavior from a semantic and functional point of view, but exhibit a wide range of formal properties. In other words, functional identity does not imply formal identity; consequently, we can expect that constructions performing the same function display different formal properties in different languages. So, if evaluatives are undoubtedly derivational in most Indo-European languages (even if they cannot be considered a typical example of derivation), they are certainly quite close to inflection in some Bantu languages. This means that the question about the place of evaluatives within the morphological component probably is not as crucial as scholars have thought, and that other issues, sometimes neglected in the literature, deserve the same attention. Among them, the role of pragmatics in the description of evaluatives is no doubt central. According to Dressler and Merlini Barbaresi, in their 1994 work, Morphopragmatics: Diminutives and Intensifiers in Italian, German and Other Languages, evaluative constructions are the more typical instantiation of morphopragmatics, which is “defined as the area of general pragmatic meanings of morphological rules, that is of the regular pragmatic effects produced when moving from the input to the output of a morphological rule.” Evaluatives include “a pragmatic variable which cannot be suppressed in the description of [their] meaning.” Another central issue in studies on evaluative morphology is the wide set of semantic nuances that usually accompany diminutives, augmentatives, pejoratives, and amelioratives. For example, a diminutive form can occasionally assume a value that is attenuative, singulative, partitive, appreciative, affectionate, etc. This cluster of semantic values has often increased the idea that evaluatives are irregular in nature and that they irremediably avoid any generalization. Dan Jurafsky showed, in 1996, that these different meanings are often the outcome of regular and cross-linguistically recurrent semantic processes, both in a synchronic and in a diachronic perspective.
This article revisits Grimshaw's (1990) tripartition of nominalization, which introduced an important correlation between particular types of nominalization and the readings associated with these nominal forms, Event and Referential. The article discusses criteria that may be used to distinguish between the two readings and the limitations of these criteria. It further offers a selective discussion of how different approaches to nominalization implement Event and Referential readings.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
Existential and locative constructions form an interesting cluster of copular structures in Romance. They are clearly related, and yet there are theoretical reasons to keep them apart. In-depth analysis of the Romance languages lends empirical support to their differentiation. In semantic terms, existentials express propositions about existence or presence in an implicit contextual domain, whereas locatives express propositions about the location of an entity. In terms of information structure, existentials are typically all new or broad focus constructions. Locatives are normally characterized by focus on the location, although this can also be a presupposed topic.
Romance existentials are formed with a copula and a post-copular phrase (the pivot). A wide range of variation is found in copula selection, copula-pivot agreement, expletive subjects, the presence and function of an etymologically locative pre-copular proform, and, lastly, the categorial status of the pivot, which is normally a noun phrase, but can also be an adjective (Calabrian, Sicilian). As for Romance locatives, a distinction must be drawn between, on the one hand, a construction with canonical SV order and S-V agreement and, on the other hand, another construction, with VS order and, in some languages, lack of V-S agreement. This latter structure has been named inverse locative.
Both existentials and locatives have a non-verbal predicate: the locative phrase in locatives and the post-copular noun or adjectival phrase in existentials. In locatives the predicate selects a thematic argument, i.e., an argument endowed with a thematic role, which serves as the syntactic subject, exception being made for inverse locatives in some languages. Contrastingly, in existentials, there is no thematic argument. In some languages the copula turns to the pivot for agreement, as this is the only overt noun phrase endowed with person and number features (Italian, Friulian, Romanian, etc.). In other languages this non-canonical agreement is not licensed (French, some Calabrian dialects, Brazilian Portuguese, etc.). In others still (Spanish, Sardinian, European Portuguese, Catalan, Gallo-Italian, etc.), it is only admitted with pivot classes that can be defined in terms of specificity. When the copula does not agree with the pivot, an expletive subject form may figure in pre-copular position. The crosslinguistic variation in copula-pivot agreement has been claimed to depend on language-specific constraints on subjecthood.
Highly specific pivots are only admitted in contextualized existentials, which express a proposition about the presence of an individual or an entity in a given and salient context. These existentials are found in all the Romance languages and would seem to defy the semantico-pragmatic constraints on the pivot that are widely known as Definiteness Effects.
A fundamental difference in theoretical models of morphology and, particularly, of the syntax–morphology interface is that between endoskeletal and exoskeletal approaches. In the former, more traditional, endoskeletal approaches, open-class lexical items like cat or sing are held to be inherently endowed with a series of formal features that determine the properties of the linguistic expressions in which they appear. In the latter, more recent, exoskeletal approaches, it is rather the morphosyntactic configurations, independently produced by the combination of abstract functional elements, that determine those properties. Lexical items, in this latter approach, are part of the structure but, crucially, do not determine it.
Conceptually, although a correlation is usually made between endoskeletalism and lexicalism/projectionism, on the one hand, and between exoskeletalism and (neo)constructionism, on the other, things are actually more complicated, and some frameworks exist that seem to challenge those correlations, in particular when the difference between word and morpheme is taken into account.
Empirically, the difference between these two approaches to morphology and the morphology-syntax interface comes to light when one examines how each one treats a diversity of word-related phenomena: morphosyntactic category and category shift in derivational processes, inflectional class, nominal properties like mass or count, and verbal properties like agentivity and (a)telicity.
While both pragmatic theory and experimental investigations of language using psycholinguistic methods have been well-established subfields in the language sciences for a long time, the field of Experimental Pragmatics, where such methods are applied to pragmatic phenomena, has only fully taken shape since the early 2000s. By now, however, it has become a major and lively area of ongoing research, with dedicated conferences, workshops, and collaborative grant projects, bringing together researchers with linguistic, psychological, and computational approaches across disciplines. Its scope includes virtually all meaning-related phenomena in natural language comprehension and production, with a particular focus on what inferences utterances give rise to that go beyond what is literally expressed by the linguistic material.
One general area that has been explored in great depth consists of investigations of various ‘ingredients’ of meaning. A major aim has been to develop experimental methodologies to help classify various aspects of meaning, such as implicatures and presuppositions as compared to basic truth-conditional meaning, and to capture their properties more thoroughly using more extensive empirical data. The study of scalar implicatures (e.g., the inference that some but not all students left based on the sentence Some students left) has served as a catalyst of sorts in this area, and they constitute one of the most well-studied phenomena in Experimental Pragmatics to date. But much recent work has expanded the general approach to other aspects of meaning, including presuppositions and conventional implicatures, but also other aspects of nonliteral meaning, such as irony, metonymy, and metaphors.
The study of reference constitutes another core area of research in Experimental Pragmatics, and has a more extensive history of precursors in psycholinguistics proper. Reference resolution commonly requires drawing inferences beyond what is conventionally conveyed by the linguistic material at issue as well; the key concern is how comprehenders grasp the referential intentions of a speaker based on the referential expressions used in a given context, as well as how the speaker chooses an appropriate expression in the first place. Pronouns, demonstratives, and definite descriptions are crucial expressions of interest, with special attention to their relation to both intra- and extralinguistic context. Furthermore, one key line of research is concerned with speakers’ and listeners’ capacity to keep track of both their own private perspective and the shared perspective of the interlocutors in actual interaction.
Given the rapid ongoing growth in the field, there is a large number of additional topical areas that cannot all be mentioned here, but the final section of the article briefly mentions further current and future areas of research.
Experimental Semiotics (ES) is a burgeoning new discipline aimed at investigating in the laboratory the development of novel forms of human communication. Conceptually connected to experimental research on language use, ES provides a scientific complement to field studies of spontaneously emerging new languages and studies on the emergence of communication systems among artificial agents.
ES researchers have created quite a few research paradigms to investigate the development of novel forms of human communication. Despite their diversity, these paradigms all rely on the use of semiotic games, that is, games in which people can succeed reliably only after they have developed novel communication systems. Some of these games involve creating novel signs for pre-specified meanings. These games are particularly suitable for studying relatively large communication systems and their structural properties. Other semiotic games involve establishing shared meanings as well as novel signs to communicate about them. These games are typically rather challenging and are particularly suitable for investigating the processes through which novel forms of communication are created.
Considering that ES is a methodological stance rather than a well-defined research theme, researchers have used it to address a greatly heterogeneous set of research questions. Despite this, and despite the recent origins of ES, two of these questions have begun to coalesce into relatively coherent research themes.
The first theme originates from the observation that novel communication systems developed in the laboratory tend to acquire features that are similar to key features of natural language. Most notably, they tend (a) to rely on the use of symbols—that is purely conventional signs—and (b) to adopt a combinatorial design, using a few basic units to express a large number of meanings. ES researchers have begun investigating some of the factors that lead to the acquisition of such features. These investigations suggest two conclusions. The first is that the emergence of symbols depends on the fact that, when repeatedly using non-symbolic signs, people tend to progressively abstract them. The second conclusion is that novel communication systems tend to adopt a combinatorial design more readily when their signs have low degrees of motivation and fade rapidly.
The second research theme originates from the observation that novel communication systems developed in the laboratory tend to begin systematically with motivated—that is non-symbolic—signs. ES investigations of this tendency suggest that it occurs because motivation helps people bootstrap novel forms of communication. Put it another way, these investigations show that it is very difficult for people to bootstrap communication through arbitrary signs.
John E. Joseph
Ferdinand de Saussure (1857–1913), the founding figure of modern linguistics, made his mark on the field with a book he published a month after his 21st birthday, in which he proposed a radical rethinking of the original system of vowels in Proto-Indo-European. A year later, he submitted his doctoral thesis on a morpho-syntactic topic, the genitive absolute in Sanskrit, to the University of Leipzig. He went to Paris intending to do a second, French doctorate, but instead he was given responsibility for courses on Gothic and Old High Gerrman at the École Pratique des Hautes Études, and for managing the publications of the Société de Linguistique de Paris. He abandoned more than one large publication project of his own during the decade he spent in Paris. In 1891 he returned to his native Geneva, where the University created a chair in Sanskrit and the history and comparison of languages for him. He produced some significant work on Lithuanian during this period, connected to his early book on the Indo-European vowel system, and yielding Saussure’s Law, concerning the placement of stress in Lithuanian. He undertook writing projects about the general nature of language, but again abandoned them. In 1907, 1908–1909, and 1910–1911, he gave three courses in general linguistics at the University of Geneva, in which he developed an approach to languages as systems of signs, each sign consisting of a signifier (sound pattern) and a signified (concept), both of them mental rather than physical in nature, and conjoined arbitrarily and inseparably. The socially shared language system, or langue, makes possible the production and comprehension of parole, utterances, by individual speakers and hearers. Each signifier and signified is a value generated by its difference from all the other signifiers or signifieds with which it coexists on an associative (or paradigmatic) axis, and affected as well by its syntagmatic axis. Shortly after Saussure’s death at 55, two of his colleagues, Bally and Sechehaye, gathered together students’ notes from the three courses, as well as manuscript notes by Saussure, and from them constructed the Cours de linguistique générale, published in 1916. Over the course of the next several decades, this book became the basis for the structuralist approach, initially within linguistics, and later adapted to other fields. Saussure left behind a large quantity of manuscript material that has gradually been published over the last few decades, and continues to be published, shedding new light on his thought.
Daniel Aalto, Jarmo Malinen, and Martti Vainio
Formant frequencies are the positions of the local maxima of the power spectral envelope of a sound signal. They arise from acoustic resonances of the vocal tract air column, and they provide substantial information about both consonants and vowels. In running speech, formants are crucial in signaling the movements with respect to place of articulation. Formants are normally defined as accumulations of acoustic energy estimated from the spectral envelope of a signal. However, not all such peaks can be related to resonances in the vocal tract, as they can be caused by the acoustic properties of the environment outside the vocal tract, and sometimes resonances are not seen in the spectrum. Such formants are called spurious and latent, respectively. By analogy, spectral maxima of synthesized speech are called formants, although they arise from a digital filter. Conversely, speech processing algorithms can detect formants in natural or synthetic speech by modeling its power spectral envelope using a digital filter. Such detection is most successful for male speech with a low fundamental frequency where many harmonic overtones excite each of the vocal tract resonances that lie at higher frequencies. For the same reason, reliable formant detection from females with high pitch or children’s speech is inherently difficult, and many algorithms fail to faithfully detect the formants corresponding to the lowest vocal tract resonant frequencies.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
French-Based Creole Languages (FBCLs) may be characterized as a group by one historical and two linguistic properties. Their shared historical feature is that they arose between the 16th and 19th centuries as vehicular (hence oral) languages in French colonies, through language contact between oral varieties of French spoken by the colonists, and typologically and genetically diverse languages spoken by imported slaves—or imported workers or the local people in the case of Tayo, which emerged in the 19th century after the abolition of slavery and whose status as an FBCL is controversial. The linguistic features characterizing FBCLs are (1) that their lexicon is derived from French while their grammar (phonology and morphosyntax) is both reminiscent of, and different from, that of known varieties of spoken, nonstandard, dialectal French; and (2), that they stand as first languages (L1s), namely, they are acquired by children through the natural process of language acquisition and are used for all-purpose communication—as opposed to pidgins, a type of contact languages only used as vehicular L2s for specific-interaction purposes (e.g., trade).
FBCLs thus defined currently include on the American continent: Gwiyané/Guyanais (in French Guyana) and Karipuna Creole (Brazil, near the French-Guyana border); Lwizyané/Louisianais (on the decrease), in Louisiana, USA; in the Caribbean: Ayisyen/Haitian (in the independent Republic of Haiti); Senlisyen/Saint-Lucian (in the state of Sainte-Lucie), and the creoles spoken in the French-controlled territories of Martinique, Guadeloupe, Dominique, Saint-Barthélémy, and the northern part of Saint-Martin; in the Indian Ocean, off the shores of Eastern Africa: Morisyen/Mauritian (in Mauritius), Seselwa/Seychellois (in the Seychelles), Rodrigé/Rodriguais (in the Rodrigues islands, controlled by Mauritius), Réyinyoné/Réunionnais (in the island of Réunion, a French-controlled territory); and in Southern New Caledonia: Tayo.
Beyond the shared defining features proposed above, there is much variation among FBCLs with respect to the places, periods, and historical conditions of their emergence; the relevant contact languages involved in their development; and their resulting grammatical properties.
Holger Diessel and Martin Hilpert
Until recently, theoretical linguists have paid little attention to the frequency of linguistic elements in grammar and grammatical development. It is a standard assumption of (most) grammatical theories that the study of grammar (or competence) must be separated from the study of language use (or performance). However, this view of language has been called into question by various strands of research that have emphasized the importance of frequency for the analysis of linguistic structure. In this research, linguistic structure is often characterized as an emergent phenomenon shaped by general cognitive processes such as analogy, categorization, and automatization, which are crucially influenced by frequency of occurrence.
There are many different ways in which frequency affects the processing and development of linguistic structure. Historical linguists have shown that frequent strings of linguistic elements are prone to undergo phonetic reduction and coalescence, and that frequent expressions and constructions are more resistant to structure mapping and analogical leveling than infrequent ones. Cognitive linguists have argued that the organization of constituent structure and embedding is based on the language users’ experience with linguistic sequences, and that the productivity of grammatical schemas or rules is determined by the combined effect of frequency and similarity. Child language researchers have demonstrated that frequency of occurrence plays an important role in the segmentation of the speech stream and the acquisition of syntactic categories, and that the statistical properties of the ambient language are much more regular than commonly assumed. And finally, psycholinguists have shown that structural ambiguities in sentence processing can often be resolved by lexical and structural frequencies, and that speakers’ choices between alternative constructions in language production are related to their experience with particular linguistic forms and meanings. Taken together, this research suggests that our knowledge of grammar is grounded in experience.
Game theory provides formal means of representing and explaining action choices in social decision situations where the choices of one participant depend on the choices of another. Game theoretic pragmatics approaches language production and interpretation as a game in this sense. Patterns in language use are explained as optimal, rational, or at least nearly optimal or rational solutions to a communication problem. Three intimately related perspectives on game theoretic pragmatics are sketched here: (i) the evolutionary perspective explains language use as the outcome of some optimization process, (ii) the rationalistic perspective pictures language use as a form of rational decision-making, and (iii) the probabilistic reasoning perspective considers specifically speakers’ and listeners’ beliefs about each other. There are clear commonalities behind these three perspectives, and they may in practice blend into each other.
At the heart of game theoretic pragmatics lies the idea that speaker and listener behavior, when it comes to using a language with a given semantic meaning, are attuned to each other. By focusing on the evolutionary or rationalistic perspective, we can then give a functional account of general patterns in our pragmatic language use. The probabilistic reasoning perspective invites modeling actual speaker and listener behavior, for example, as it shows in quantitative aspects of experimental data.
Gender is a grammatical feature, in a family with person, number, and case. In the languages that have grammatical gender—according to a representative typological sample, almost half of the languages in the world—it is a property that separates nouns into classes. These classes are often meaningful and often linked to biological sex, which is why many languages are said to have a “masculine” and a “feminine” gender. A typical example is Italian, which has masculine words for male persons (il bambino “the.
Across the languages of the world, gender systems vary widely. They differ in the number of classes, in the underlying assignment rules, and in how and where gender is marked. Since agreement is a definitional property, gender is generally absent in isolating languages as well as in young languages with little bound morphology, including sign languages. Therefore, gender is considered a mature phenomenon in language.
Gender interacts in various ways with other grammatical features. For example, it may be limited to the singular number or the third person, and it may be crosscut by case distinctions. These and other interrelations can complicate the task of figuring out a gender system in first or second language acquisition. Yet, children master gender early, making use of a broad variety of cues. By contrast, gender is famously difficult for second-language learners. This is especially true for adults and for learners whose first language does not have a gender system. Nevertheless, tests show that even for this group, native-like competence is possible to attain.
Different methods exist for classifying languages, depending on whether the task is to work out the relations among languages already known to be related—internal language classification—or whether the task is to establish that certain languages are related—external language classification.
The comparative method in historical linguistics, developed during the latter part of the 19th century, represents one method for internal language classification; lexicostatistics, developed during the 1950s, represents another. Elements of lexicostatistics have been transformed and carried over into modern computational linguistic phylogenetics, and currently efforts are also being made to automate the comparative method. Recent years have seen rapid progress in the development of methods, tools, and resources for language classification. For instance, computational phylogenetic algorithms and software have made it possible to handle the classification of many languages using explicit models of language change, and data have been gathered for two thirds of the world’s language, allowing for rapid, exploratory classifications. There are also many open questions and venues for future research, for instance: What are the real-world counterparts to the nodes in a family tree structure? How can shortcomings in the traditional method of comparative historical linguistics be overcome? How can the understanding of the results that computational linguistic phylogenetics have to offer be improved?
External language classification, a notoriously difficult task, has also benefitted from the advent of computational power. While, in the past, the simultaneous comparison of many languages for the purpose of discovering deep genealogical links was carried out in a haphazard fashion, leaving too much room for the effect of chance similarities to kick in, this sort of activity can now be done in a systematic, objective way on an unprecedented scale. The ways of producing final, convincing evidence for a deep genealogical relation, however, have not changed much. There is some room for improvement in this area, but even more room for improvement in the way that proposals for long-distance relations are evaluated.
Knut Tarald Taraldsen
This article presents different types of generative grammar that can be used as models of natural languages focusing on a small subset of all the systems that have been devised. The central idea behind generative grammar may be rendered in the words of Richard Montague: “I reject the contention that an important theoretical difference exists between formal and natural languages” (“Universal Grammar,” Theoria, 36 , 373–398).
The German sinologist and general linguist Georg von der Gabelentz (1840–1893) occupies an interesting place at the intersection of several streams of linguistic scholarship at the end of the 19th century. As Professor of East Asian languages at the University of Leipzig from 1878 to 1889 and then Professor for Sinology and General Linguistics at the University of Berlin from 1889 until his death, Gabelentz was present at some of the main centers of linguistics at the time. He was, however, generally critical of mainstream historical-comparative linguistics as propagated by the neogrammarians, and instead emphasized approaches to language inspired by a line of researchers including Wilhelm von Humboldt (1767–1835), H. Steinthal (1823–1899), and his own father, Hans Conon von der Gabelentz (1807–1874).
Today Gabelentz is chiefly remembered for several theoretical and methodological innovations which continue to play a role in linguistics. Most significant among these are his contributions to cross-linguistic syntactic comparison and typology, grammar-writing, and grammaticalization. His earliest linguistic work emphasized the importance of syntax as a core part of grammar and sought to establish a framework for the cross-linguistic description of word order, as had already been attempted for morphology by other scholars. The importance he attached to syntax was motivated by his engagement with Classical Chinese, a language almost devoid of morphology and highly reliant on syntax. In describing this language in his 1881 Chinesische Grammatik, Gabelentz elaborated and implemented the complementary “analytic” and “synthetic” systems of grammar, an approach to grammar-writing that continues to serve as a point of reference up to the present day. In his summary of contemporary thought on the nature of grammatical change in language, he became one of the first linguists to formulate the principles of grammaticalization in essentially the form that this phenomenon is studied today, although he did not use the current term. One key term of modern linguistics that he did employ, however, is “typology,” a term that he in fact coined. Gabelentz’s typology was a development on various contemporary strands of thought, including his own comparative syntax, and is widely acknowledged as a direct precursor of the present-day field.
Gabelentz is a significant transitional figure from the 19th to the 20th century. On the one hand, his work seems very modern. Beyond his contributions to grammaticalization avant la lettre and his christening of typology, his conception of language prefigures the structuralist revolution of the early 20th century in important respects. On the other hand, he continues to entertain several preoccupations of the 19th century—in particular the judgment of the relative value of different languages—which were progressively banished from linguistics in the first decades of the 20th century.
Linguistic change not only affects the lexicon and the phonology of words, it also operates on the grammar of a language. In this context, grammaticalization is concerned with the development of lexical items into markers of grammatical categories or, more generally, with the development of markers used for procedural cueing of abstract relationships out of linguistic items with concrete referential meaning. A well-known example is the English verb go in its function of a future marker, as in She is going to visit her friend. Phenomena like these are very frequent across the world’s languages and across many different domains of grammatical categories. In the last 50 years, research on grammaticalization has come up with a plethora of (a) generalizations, (b) models of how grammaticalization works, and (c) methodological refinements.
On (a): Processes of grammaticalization develop gradually, step by step, and the sequence of the individual stages follows certain clines as they have been generalized from cross-linguistic comparison (unidirectionality). Even though there are counterexamples that go against the directionality of various clines, their number seems smaller than assumed in the late 1990s.
On (b): Models or scenarios of grammaticalization integrate various factors. Depending on the theoretical background, grammaticalization and its results are motivated either by the competing motivations of economy vs. iconicity/explicitness in functional typology or by a change from movement to merger in the minimalist program. Pragmatic inference is of central importance for initiating processes of grammaticalization (and maybe also at later stages), and it activates mechanisms like reanalysis and analogy, whose status is controversial in the literature. Finally, grammaticalization does not only work within individual languages/varieties, it also operates across languages. In situations of contact, the existence of a certain grammatical category may induce grammaticalization in another language.
On (c): Even though it is hard to measure degrees of grammaticalization in terms of absolute and exact figures, it is possible to determine relative degrees of grammaticalization in terms of the autonomy of linguistic signs. Moreover, more recent research has come up with criteria for distinguishing grammaticalization and lexicalization (defined as the loss of productivity, transparency, and/or compositionality of former productive, transparent, and compositional structures).
In spite of these findings, there are still quite a number of questions that need further research. Two questions to be discussed address basic issues concerning the overall properties of grammaticalization. (1) What is the relation between constructions and grammaticalization? In the more traditional view, constructions are seen as the syntactic framework within which linguistic items are grammaticalized. In more recent approaches based on construction grammar, constructions are defined as combinations of form and meaning. Thus, grammaticalization can be seen in the light of constructionalization, i.e., the creation of new combinations of form and meaning. Even though constructionalization covers many apects of grammaticalization, it does not exhaustively cover the domain of grammaticalization. (2) Is grammaticalization cross-linguistically homogeneous, or is there a certain range of variation? There is evidence from East and mainland Southeast Asia that there is cross-linguistic variation to some extent.