You are looking at 81-100 of 279 articles
Evaluative morphology is a field of linguistic studies that deals with the formation of diminutives, augmentatives, pejoratives, and amelioratives. Actually, evaluative constructions cross the boundaries of morphology, and are sometimes realized by formal strategies that cannot be numbered among word formation processes. Nevertheless, morphology plays a dominant role in the formation of evaluatives. The first attempt to draw an exhaustive account of this set of complex forms is found in the 1984 work Generative Morphology, by Sergio Scalise, who made the hypothesis that evaluatives represent a separate block of rules between inflection and derivation. This hypothesis is based on the fact that evaluatives show some properties that are derivational, others that are inflectional, and some specific properties that are neither derivational nor inflectional. After Scalise’s proposal, almost all scholars have tried to answer the question concerning the place of evaluative rules within the morphological component. What data reveal is that, in a cross-linguistic perspective, evaluatives display a uniform behavior from a semantic and functional point of view, but exhibit a wide range of formal properties. In other words, functional identity does not imply formal identity; consequently, we can expect that constructions performing the same function display different formal properties in different languages. So, if evaluatives are undoubtedly derivational in most Indo-European languages (even if they cannot be considered a typical example of derivation), they are certainly quite close to inflection in some Bantu languages. This means that the question about the place of evaluatives within the morphological component probably is not as crucial as scholars have thought, and that other issues, sometimes neglected in the literature, deserve the same attention. Among them, the role of pragmatics in the description of evaluatives is no doubt central. According to Dressler and Merlini Barbaresi, in their 1994 work, Morphopragmatics: Diminutives and Intensifiers in Italian, German and Other Languages, evaluative constructions are the more typical instantiation of morphopragmatics, which is “defined as the area of general pragmatic meanings of morphological rules, that is of the regular pragmatic effects produced when moving from the input to the output of a morphological rule.” Evaluatives include “a pragmatic variable which cannot be suppressed in the description of [their] meaning.” Another central issue in studies on evaluative morphology is the wide set of semantic nuances that usually accompany diminutives, augmentatives, pejoratives, and amelioratives. For example, a diminutive form can occasionally assume a value that is attenuative, singulative, partitive, appreciative, affectionate, etc. This cluster of semantic values has often increased the idea that evaluatives are irregular in nature and that they irremediably avoid any generalization. Dan Jurafsky showed, in 1996, that these different meanings are often the outcome of regular and cross-linguistically recurrent semantic processes, both in a synchronic and in a diachronic perspective.
This article revisits Grimshaw's (1990) tripartition of nominalization, which introduced an important correlation between particular types of nominalization and the readings associated with these nominal forms, Event and Referential. The article discusses criteria that may be used to distinguish between the two readings and the limitations of these criteria. It further offers a selective discussion of how different approaches to nominalization implement Event and Referential readings.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
Existential and locative constructions form an interesting cluster of copular structures in Romance. They are clearly related, and yet there are theoretical reasons to keep them apart. In-depth analysis of the Romance languages lends empirical support to their differentiation. In semantic terms, existentials express propositions about existence or presence in an implicit contextual domain, whereas locatives express propositions about the location of an entity. In terms of information structure, existentials are typically all new or broad focus constructions. Locatives are normally characterized by focus on the location, although this can also be a presupposed topic.
Romance existentials are formed with a copula and a post-copular phrase (the pivot). A wide range of variation is found in copula selection, copula-pivot agreement, expletive subjects, the presence and function of an etymologically locative pre-copular proform, and, lastly, the categorial status of the pivot, which is normally a noun phrase, but can also be an adjective (Calabrian, Sicilian). As for Romance locatives, a distinction must be drawn between, on the one hand, a construction with canonical SV order and S-V agreement and, on the other hand, another construction, with VS order and, in some languages, lack of V-S agreement. This latter structure has been named inverse locative.
Both existentials and locatives have a non-verbal predicate: the locative phrase in locatives and the post-copular noun or adjectival phrase in existentials. In locatives the predicate selects a thematic argument, i.e., an argument endowed with a thematic role, which serves as the syntactic subject, exception being made for inverse locatives in some languages. Contrastingly, in existentials, there is no thematic argument. In some languages the copula turns to the pivot for agreement, as this is the only overt noun phrase endowed with person and number features (Italian, Friulian, Romanian, etc.). In other languages this non-canonical agreement is not licensed (French, some Calabrian dialects, Brazilian Portuguese, etc.). In others still (Spanish, Sardinian, European Portuguese, Catalan, Gallo-Italian, etc.), it is only admitted with pivot classes that can be defined in terms of specificity. When the copula does not agree with the pivot, an expletive subject form may figure in pre-copular position. The crosslinguistic variation in copula-pivot agreement has been claimed to depend on language-specific constraints on subjecthood.
Highly specific pivots are only admitted in contextualized existentials, which express a proposition about the presence of an individual or an entity in a given and salient context. These existentials are found in all the Romance languages and would seem to defy the semantico-pragmatic constraints on the pivot that are widely known as Definiteness Effects.
María Irene Moyna
The definition of exocentricity hinges on the notion of head in morphology. Exocentricity and its opposite, endocentricity, describe the two possible relationships between compound constituents and the compound lexeme they make up. In endocentric compounds, one of the constituent lexemes is the head, that is, the lexical item with the semantico-syntactic features that are passed on to the whole compound. In exocentric compounds, the features of the whole are not attributable to the constituents and must be sought elsewhere.
Exocentric compounds can be divided into two broad classes, namely, syntactic (or formal) and semantic exocentric compounds. Syntactic exocentric compounds exhibit a mismatch between the grammatical category of their constituents and that of the whole. Semantic exocentric compounds are exocentric by virtue of their meaning alone, their structure providing no clues of their nonliteral interpretation. Historically, most descriptive and theoretical analyses of exocentricity have focused on syntactic exocentric compounds. On the basis of large but non-exhaustive databases of the world languages, it has been shown that exocentric compounds are marked. With a few exceptions, exocentric compound patterns are both less frequent cross-linguistically and less likely to be used in those languages that can have them. However, some patterns recur with remarkable regularity in the world’s languages. These include possessive compounds (known by their Sanskrit name, bahuvrīhi), which combine a description of a part to denote the whole (e.g., Eng. sabretooth). Deverbal nominal compounds are also robust in many language families, such as Romance; these compounds combine a verb and its direct object to denote an agent or instrument (e.g., Fr. portefeuilles ‘briefcase,’ lit. ‘carry+papers’). A third highly frequent exocentric compounding pattern combines two constituents of the same grammatical category to create a lexeme of a different word class (e.g., Japanese daisho ‘size,’ lit. ‘small+large’). It should be noted that the basic distinction between syntactic and semantic exocentric compounds can become blurred because any lexicalized compound, regardless of its internal structure, is potentially susceptible to metaphoric meaning shifts and to formal recategorization through conversion. Although exocentricity is a syntactico-semantic feature typically attributed to compounds, other morphological structures may occasionally exhibit similar behavior, namely, phrasal chunks or “syntactic freezes.”
Exocentric compounds create interesting challenges to rule-based accounts of morphology, including both lexicalist hypotheses and also those that subsume word formation operations to those of syntax. In both types of proposals, the features of all constructions are attributable to their head, so that accounting for the mismatch exhibited by exocentric compounds requires structural adjustments. Cognitive linguistics has also focused on exocentric compounds, and has sought to account for their meanings through a combination of metaphoric and metonymic shifts.
A fundamental difference in theoretical models of morphology and, particularly, of the syntax–morphology interface is that between endoskeletal and exoskeletal approaches. In the former, more traditional, endoskeletal approaches, open-class lexical items like cat or sing are held to be inherently endowed with a series of formal features that determine the properties of the linguistic expressions in which they appear. In the latter, more recent, exoskeletal approaches, it is rather the morphosyntactic configurations, independently produced by the combination of abstract functional elements, that determine those properties. Lexical items, in this latter approach, are part of the structure but, crucially, do not determine it.
Conceptually, although a correlation is usually made between endoskeletalism and lexicalism/projectionism, on the one hand, and between exoskeletalism and (neo)constructionism, on the other, things are actually more complicated, and some frameworks exist that seem to challenge those correlations, in particular when the difference between word and morpheme is taken into account.
Empirically, the difference between these two approaches to morphology and the morphology-syntax interface comes to light when one examines how each one treats a diversity of word-related phenomena: morphosyntactic category and category shift in derivational processes, inflectional class, nominal properties like mass or count, and verbal properties like agentivity and (a)telicity.
While both pragmatic theory and experimental investigations of language using psycholinguistic methods have been well-established subfields in the language sciences for a long time, the field of Experimental Pragmatics, where such methods are applied to pragmatic phenomena, has only fully taken shape since the early 2000s. By now, however, it has become a major and lively area of ongoing research, with dedicated conferences, workshops, and collaborative grant projects, bringing together researchers with linguistic, psychological, and computational approaches across disciplines. Its scope includes virtually all meaning-related phenomena in natural language comprehension and production, with a particular focus on what inferences utterances give rise to that go beyond what is literally expressed by the linguistic material.
One general area that has been explored in great depth consists of investigations of various ‘ingredients’ of meaning. A major aim has been to develop experimental methodologies to help classify various aspects of meaning, such as implicatures and presuppositions as compared to basic truth-conditional meaning, and to capture their properties more thoroughly using more extensive empirical data. The study of scalar implicatures (e.g., the inference that some but not all students left based on the sentence Some students left) has served as a catalyst of sorts in this area, and they constitute one of the most well-studied phenomena in Experimental Pragmatics to date. But much recent work has expanded the general approach to other aspects of meaning, including presuppositions and conventional implicatures, but also other aspects of nonliteral meaning, such as irony, metonymy, and metaphors.
The study of reference constitutes another core area of research in Experimental Pragmatics, and has a more extensive history of precursors in psycholinguistics proper. Reference resolution commonly requires drawing inferences beyond what is conventionally conveyed by the linguistic material at issue as well; the key concern is how comprehenders grasp the referential intentions of a speaker based on the referential expressions used in a given context, as well as how the speaker chooses an appropriate expression in the first place. Pronouns, demonstratives, and definite descriptions are crucial expressions of interest, with special attention to their relation to both intra- and extralinguistic context. Furthermore, one key line of research is concerned with speakers’ and listeners’ capacity to keep track of both their own private perspective and the shared perspective of the interlocutors in actual interaction.
Given the rapid ongoing growth in the field, there is a large number of additional topical areas that cannot all be mentioned here, but the final section of the article briefly mentions further current and future areas of research.
Experimental Semiotics (ES) is a burgeoning new discipline aimed at investigating in the laboratory the development of novel forms of human communication. Conceptually connected to experimental research on language use, ES provides a scientific complement to field studies of spontaneously emerging new languages and studies on the emergence of communication systems among artificial agents.
ES researchers have created quite a few research paradigms to investigate the development of novel forms of human communication. Despite their diversity, these paradigms all rely on the use of semiotic games, that is, games in which people can succeed reliably only after they have developed novel communication systems. Some of these games involve creating novel signs for pre-specified meanings. These games are particularly suitable for studying relatively large communication systems and their structural properties. Other semiotic games involve establishing shared meanings as well as novel signs to communicate about them. These games are typically rather challenging and are particularly suitable for investigating the processes through which novel forms of communication are created.
Considering that ES is a methodological stance rather than a well-defined research theme, researchers have used it to address a greatly heterogeneous set of research questions. Despite this, and despite the recent origins of ES, two of these questions have begun to coalesce into relatively coherent research themes.
The first theme originates from the observation that novel communication systems developed in the laboratory tend to acquire features that are similar to key features of natural language. Most notably, they tend (a) to rely on the use of symbols—that is purely conventional signs—and (b) to adopt a combinatorial design, using a few basic units to express a large number of meanings. ES researchers have begun investigating some of the factors that lead to the acquisition of such features. These investigations suggest two conclusions. The first is that the emergence of symbols depends on the fact that, when repeatedly using non-symbolic signs, people tend to progressively abstract them. The second conclusion is that novel communication systems tend to adopt a combinatorial design more readily when their signs have low degrees of motivation and fade rapidly.
The second research theme originates from the observation that novel communication systems developed in the laboratory tend to begin systematically with motivated—that is non-symbolic—signs. ES investigations of this tendency suggest that it occurs because motivation helps people bootstrap novel forms of communication. Put it another way, these investigations show that it is very difficult for people to bootstrap communication through arbitrary signs.
John E. Joseph
Ferdinand de Saussure (1857–1913), the founding figure of modern linguistics, made his mark on the field with a book he published a month after his 21st birthday, in which he proposed a radical rethinking of the original system of vowels in Proto-Indo-European. A year later, he submitted his doctoral thesis on a morpho-syntactic topic, the genitive absolute in Sanskrit, to the University of Leipzig. He went to Paris intending to do a second, French doctorate, but instead he was given responsibility for courses on Gothic and Old High Gerrman at the École Pratique des Hautes Études, and for managing the publications of the Société de Linguistique de Paris. He abandoned more than one large publication project of his own during the decade he spent in Paris. In 1891 he returned to his native Geneva, where the University created a chair in Sanskrit and the history and comparison of languages for him. He produced some significant work on Lithuanian during this period, connected to his early book on the Indo-European vowel system, and yielding Saussure’s Law, concerning the placement of stress in Lithuanian. He undertook writing projects about the general nature of language, but again abandoned them. In 1907, 1908–1909, and 1910–1911, he gave three courses in general linguistics at the University of Geneva, in which he developed an approach to languages as systems of signs, each sign consisting of a signifier (sound pattern) and a signified (concept), both of them mental rather than physical in nature, and conjoined arbitrarily and inseparably. The socially shared language system, or langue, makes possible the production and comprehension of parole, utterances, by individual speakers and hearers. Each signifier and signified is a value generated by its difference from all the other signifiers or signifieds with which it coexists on an associative (or paradigmatic) axis, and affected as well by its syntagmatic axis. Shortly after Saussure’s death at 55, two of his colleagues, Bally and Sechehaye, gathered together students’ notes from the three courses, as well as manuscript notes by Saussure, and from them constructed the Cours de linguistique générale, published in 1916. Over the course of the next several decades, this book became the basis for the structuralist approach, initially within linguistics, and later adapted to other fields. Saussure left behind a large quantity of manuscript material that has gradually been published over the last few decades, and continues to be published, shedding new light on his thought.
First-language acquisition of morphology refers to the process whereby native speakers gain full and automatic command of the inflectional and derivational machinery of their mother tongue. Despite language diversity, evidence shows that morphological acquisition follows a shared path in development in evolving from semantically and structurally simplex and non-productive to more complex and productive. The emergence and consolidation of the central morphological systems in a language typically take place between the ages of two and six years, while mature command of all systems and subsystems can take up to 10 more years, and is mediated by the consolidation of literacy skills. Morphological learning in both inflection and derivation is always interwoven with lexical growth, and derivational acquisition is highly dependent on the development of a large and coherent lexicon. Three critical factors platform the acquisition of morphology. One factor is the input patterns in the ambient language, including various types of frequency. Input provides the context for children to pay attention to morphological markers as meaningful cues to caregivers’ intentions in interactive sociopragmatic settings of joint attention. A second factor is language typology, given that languages differ in the amount of word-internal information they package in words. The “typological impact” in morphology directs children to the ways pertinent conceptual and structural information is encoded in morphological structures. It is thus responsible for great differences among languages in the timing and pace of learning morphological categories such as passive verbs. Finally, development itself is a central mechanism that drives morphological acquisition from emergence to productivity in three senses: as the filtering device that enables the break into the morphological system, in providing the span of time necessary for the consolidation of morphological systems in children, and in hosting the cognitive changes that usher in mature morphological systems in both speech and writing in adolescents and adults.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
In a special focus-predicate concord construction (kakari musubi), specific focus particles called kakari joshi correlate with particular predicate conjugational endings, or musubi, other than regular finite forms, creating special illocutionary effects such as emphatic assertion or question. In Old Japanese (OJ), a particle ka, s(/z)ö, ya, or namu triggers an adnominal ending, while kösö calls for a realis ending. In Old Okinawan (OOk), ga or du prompts an adnominal ending, while sɨ associates with realis endings. Kakari musubi existed in proto-Japonic but died out in the Japanese branch; however, it is still preserved in its sister branch, Ryukyuan, in the Okinawan language.
This concord phenomenon, observed in only a few languages of the world, presents diverse issues concerning its evolution from origin to demise, the functional and semantic differences of its kakari particles (e.g., question-forming OJ ka vs. ya), and positional (sentence-medial vs. sentence-final) contrast. Furthermore, kakari musubi bears relevance to syntactic constructions such as clefts and nominalizations. Last, some kakari particles stemming from demonstratives offer worthy data for theory construction in grammaticalization or iconicity. Because of its far-reaching relevance, the construction has garnered attention from both formal and functional schools of linguistics.
Daniel Aalto, Jarmo Malinen, and Martti Vainio
Formant frequencies are the positions of the local maxima of the power spectral envelope of a sound signal. They arise from acoustic resonances of the vocal tract air column, and they provide substantial information about both consonants and vowels. In running speech, formants are crucial in signaling the movements with respect to place of articulation. Formants are normally defined as accumulations of acoustic energy estimated from the spectral envelope of a signal. However, not all such peaks can be related to resonances in the vocal tract, as they can be caused by the acoustic properties of the environment outside the vocal tract, and sometimes resonances are not seen in the spectrum. Such formants are called spurious and latent, respectively. By analogy, spectral maxima of synthesized speech are called formants, although they arise from a digital filter. Conversely, speech processing algorithms can detect formants in natural or synthetic speech by modeling its power spectral envelope using a digital filter. Such detection is most successful for male speech with a low fundamental frequency where many harmonic overtones excite each of the vocal tract resonances that lie at higher frequencies. For the same reason, reliable formant detection from females with high pitch or children’s speech is inherently difficult, and many algorithms fail to faithfully detect the formants corresponding to the lowest vocal tract resonant frequencies.
This is an advance summary of a forthcoming article in the Oxford Research Encyclopedia of Linguistics. Please check back later for the full article.
French-Based Creole Languages (FBCLs) may be characterized as a group by one historical and two linguistic properties. Their shared historical feature is that they arose between the 16th and 19th centuries as vehicular (hence oral) languages in French colonies, through language contact between oral varieties of French spoken by the colonists, and typologically and genetically diverse languages spoken by imported slaves—or imported workers or the local people in the case of Tayo, which emerged in the 19th century after the abolition of slavery and whose status as an FBCL is controversial. The linguistic features characterizing FBCLs are (1) that their lexicon is derived from French while their grammar (phonology and morphosyntax) is both reminiscent of, and different from, that of known varieties of spoken, nonstandard, dialectal French; and (2), that they stand as first languages (L1s), namely, they are acquired by children through the natural process of language acquisition and are used for all-purpose communication—as opposed to pidgins, a type of contact languages only used as vehicular L2s for specific-interaction purposes (e.g., trade).
FBCLs thus defined currently include on the American continent: Gwiyané/Guyanais (in French Guyana) and Karipuna Creole (Brazil, near the French-Guyana border); Lwizyané/Louisianais (on the decrease), in Louisiana, USA; in the Caribbean: Ayisyen/Haitian (in the independent Republic of Haiti); Senlisyen/Saint-Lucian (in the state of Sainte-Lucie), and the creoles spoken in the French-controlled territories of Martinique, Guadeloupe, Dominique, Saint-Barthélémy, and the northern part of Saint-Martin; in the Indian Ocean, off the shores of Eastern Africa: Morisyen/Mauritian (in Mauritius), Seselwa/Seychellois (in the Seychelles), Rodrigé/Rodriguais (in the Rodrigues islands, controlled by Mauritius), Réyinyoné/Réunionnais (in the island of Réunion, a French-controlled territory); and in Southern New Caledonia: Tayo.
Beyond the shared defining features proposed above, there is much variation among FBCLs with respect to the places, periods, and historical conditions of their emergence; the relevant contact languages involved in their development; and their resulting grammatical properties.
Holger Diessel and Martin Hilpert
Until recently, theoretical linguists have paid little attention to the frequency of linguistic elements in grammar and grammatical development. It is a standard assumption of (most) grammatical theories that the study of grammar (or competence) must be separated from the study of language use (or performance). However, this view of language has been called into question by various strands of research that have emphasized the importance of frequency for the analysis of linguistic structure. In this research, linguistic structure is often characterized as an emergent phenomenon shaped by general cognitive processes such as analogy, categorization, and automatization, which are crucially influenced by frequency of occurrence.
There are many different ways in which frequency affects the processing and development of linguistic structure. Historical linguists have shown that frequent strings of linguistic elements are prone to undergo phonetic reduction and coalescence, and that frequent expressions and constructions are more resistant to structure mapping and analogical leveling than infrequent ones. Cognitive linguists have argued that the organization of constituent structure and embedding is based on the language users’ experience with linguistic sequences, and that the productivity of grammatical schemas or rules is determined by the combined effect of frequency and similarity. Child language researchers have demonstrated that frequency of occurrence plays an important role in the segmentation of the speech stream and the acquisition of syntactic categories, and that the statistical properties of the ambient language are much more regular than commonly assumed. And finally, psycholinguists have shown that structural ambiguities in sentence processing can often be resolved by lexical and structural frequencies, and that speakers’ choices between alternative constructions in language production are related to their experience with particular linguistic forms and meanings. Taken together, this research suggests that our knowledge of grammar is grounded in experience.
The standard observation is that complementizers corresponding to English that involve the illocutionary force of the clause, but the situation is not that simple, as factivity and modality may come into play, too. Complementizers are cross-linguistically systematically morpho-phonologically identical to other categories like nouns, verbs, and adpositions (that is, prepositions and post-positions). Recently there have been attempts to account for the formal identity of complementizers with other categories by decomposing the complementizer morpheme into smaller pieces. New ways of thinking about function words like complementizers and (some) prepositions involve digging into their internal structure(s) through determining the presence or absence of structural homogeneity within and across languages or by taking a nanosyntactic approach to cross-category syncretism.
Game theory provides formal means of representing and explaining action choices in social decision situations where the choices of one participant depend on the choices of another. Game theoretic pragmatics approaches language production and interpretation as a game in this sense. Patterns in language use are explained as optimal, rational, or at least nearly optimal or rational solutions to a communication problem. Three intimately related perspectives on game theoretic pragmatics are sketched here: (i) the evolutionary perspective explains language use as the outcome of some optimization process, (ii) the rationalistic perspective pictures language use as a form of rational decision-making, and (iii) the probabilistic reasoning perspective considers specifically speakers’ and listeners’ beliefs about each other. There are clear commonalities behind these three perspectives, and they may in practice blend into each other.
At the heart of game theoretic pragmatics lies the idea that speaker and listener behavior, when it comes to using a language with a given semantic meaning, are attuned to each other. By focusing on the evolutionary or rationalistic perspective, we can then give a functional account of general patterns in our pragmatic language use. The probabilistic reasoning perspective invites modeling actual speaker and listener behavior, for example, as it shows in quantitative aspects of experimental data.
Gender is a grammatical feature, in a family with person, number, and case. In the languages that have grammatical gender—according to a representative typological sample, almost half of the languages in the world—it is a property that separates nouns into classes. These classes are often meaningful and often linked to biological sex, which is why many languages are said to have a “masculine” and a “feminine” gender. A typical example is Italian, which has masculine words for male persons (il bambino “the.
Across the languages of the world, gender systems vary widely. They differ in the number of classes, in the underlying assignment rules, and in how and where gender is marked. Since agreement is a definitional property, gender is generally absent in isolating languages as well as in young languages with little bound morphology, including sign languages. Therefore, gender is considered a mature phenomenon in language.
Gender interacts in various ways with other grammatical features. For example, it may be limited to the singular number or the third person, and it may be crosscut by case distinctions. These and other interrelations can complicate the task of figuring out a gender system in first or second language acquisition. Yet, children master gender early, making use of a broad variety of cues. By contrast, gender is famously difficult for second-language learners. This is especially true for adults and for learners whose first language does not have a gender system. Nevertheless, tests show that even for this group, native-like competence is possible to attain.
Different methods exist for classifying languages, depending on whether the task is to work out the relations among languages already known to be related—internal language classification—or whether the task is to establish that certain languages are related—external language classification.
The comparative method in historical linguistics, developed during the latter part of the 19th century, represents one method for internal language classification; lexicostatistics, developed during the 1950s, represents another. Elements of lexicostatistics have been transformed and carried over into modern computational linguistic phylogenetics, and currently efforts are also being made to automate the comparative method. Recent years have seen rapid progress in the development of methods, tools, and resources for language classification. For instance, computational phylogenetic algorithms and software have made it possible to handle the classification of many languages using explicit models of language change, and data have been gathered for two thirds of the world’s language, allowing for rapid, exploratory classifications. There are also many open questions and venues for future research, for instance: What are the real-world counterparts to the nodes in a family tree structure? How can shortcomings in the traditional method of comparative historical linguistics be overcome? How can the understanding of the results that computational linguistic phylogenetics have to offer be improved?
External language classification, a notoriously difficult task, has also benefitted from the advent of computational power. While, in the past, the simultaneous comparison of many languages for the purpose of discovering deep genealogical links was carried out in a haphazard fashion, leaving too much room for the effect of chance similarities to kick in, this sort of activity can now be done in a systematic, objective way on an unprecedented scale. The ways of producing final, convincing evidence for a deep genealogical relation, however, have not changed much. There is some room for improvement in this area, but even more room for improvement in the way that proposals for long-distance relations are evaluated.
Knut Tarald Taraldsen
This article presents different types of generative grammar that can be used as models of natural languages focusing on a small subset of all the systems that have been devised. The central idea behind generative grammar may be rendered in the words of Richard Montague: “I reject the contention that an important theoretical difference exists between formal and natural languages” (“Universal Grammar,” Theoria, 36 , 373–398).
The German sinologist and general linguist Georg von der Gabelentz (1840–1893) occupies an interesting place at the intersection of several streams of linguistic scholarship at the end of the 19th century. As Professor of East Asian languages at the University of Leipzig from 1878 to 1889 and then Professor for Sinology and General Linguistics at the University of Berlin from 1889 until his death, Gabelentz was present at some of the main centers of linguistics at the time. He was, however, generally critical of mainstream historical-comparative linguistics as propagated by the neogrammarians, and instead emphasized approaches to language inspired by a line of researchers including Wilhelm von Humboldt (1767–1835), H. Steinthal (1823–1899), and his own father, Hans Conon von der Gabelentz (1807–1874).
Today Gabelentz is chiefly remembered for several theoretical and methodological innovations which continue to play a role in linguistics. Most significant among these are his contributions to cross-linguistic syntactic comparison and typology, grammar-writing, and grammaticalization. His earliest linguistic work emphasized the importance of syntax as a core part of grammar and sought to establish a framework for the cross-linguistic description of word order, as had already been attempted for morphology by other scholars. The importance he attached to syntax was motivated by his engagement with Classical Chinese, a language almost devoid of morphology and highly reliant on syntax. In describing this language in his 1881 Chinesische Grammatik, Gabelentz elaborated and implemented the complementary “analytic” and “synthetic” systems of grammar, an approach to grammar-writing that continues to serve as a point of reference up to the present day. In his summary of contemporary thought on the nature of grammatical change in language, he became one of the first linguists to formulate the principles of grammaticalization in essentially the form that this phenomenon is studied today, although he did not use the current term. One key term of modern linguistics that he did employ, however, is “typology,” a term that he in fact coined. Gabelentz’s typology was a development on various contemporary strands of thought, including his own comparative syntax, and is widely acknowledged as a direct precursor of the present-day field.
Gabelentz is a significant transitional figure from the 19th to the 20th century. On the one hand, his work seems very modern. Beyond his contributions to grammaticalization avant la lettre and his christening of typology, his conception of language prefigures the structuralist revolution of the early 20th century in important respects. On the other hand, he continues to entertain several preoccupations of the 19th century—in particular the judgment of the relative value of different languages—which were progressively banished from linguistics in the first decades of the 20th century.
Linguistic change not only affects the lexicon and the phonology of words, it also operates on the grammar of a language. In this context, grammaticalization is concerned with the development of lexical items into markers of grammatical categories or, more generally, with the development of markers used for procedural cueing of abstract relationships out of linguistic items with concrete referential meaning. A well-known example is the English verb go in its function of a future marker, as in She is going to visit her friend. Phenomena like these are very frequent across the world’s languages and across many different domains of grammatical categories. In the last 50 years, research on grammaticalization has come up with a plethora of (a) generalizations, (b) models of how grammaticalization works, and (c) methodological refinements.
On (a): Processes of grammaticalization develop gradually, step by step, and the sequence of the individual stages follows certain clines as they have been generalized from cross-linguistic comparison (unidirectionality). Even though there are counterexamples that go against the directionality of various clines, their number seems smaller than assumed in the late 1990s.
On (b): Models or scenarios of grammaticalization integrate various factors. Depending on the theoretical background, grammaticalization and its results are motivated either by the competing motivations of economy vs. iconicity/explicitness in functional typology or by a change from movement to merger in the minimalist program. Pragmatic inference is of central importance for initiating processes of grammaticalization (and maybe also at later stages), and it activates mechanisms like reanalysis and analogy, whose status is controversial in the literature. Finally, grammaticalization does not only work within individual languages/varieties, it also operates across languages. In situations of contact, the existence of a certain grammatical category may induce grammaticalization in another language.
On (c): Even though it is hard to measure degrees of grammaticalization in terms of absolute and exact figures, it is possible to determine relative degrees of grammaticalization in terms of the autonomy of linguistic signs. Moreover, more recent research has come up with criteria for distinguishing grammaticalization and lexicalization (defined as the loss of productivity, transparency, and/or compositionality of former productive, transparent, and compositional structures).
In spite of these findings, there are still quite a number of questions that need further research. Two questions to be discussed address basic issues concerning the overall properties of grammaticalization. (1) What is the relation between constructions and grammaticalization? In the more traditional view, constructions are seen as the syntactic framework within which linguistic items are grammaticalized. In more recent approaches based on construction grammar, constructions are defined as combinations of form and meaning. Thus, grammaticalization can be seen in the light of constructionalization, i.e., the creation of new combinations of form and meaning. Even though constructionalization covers many apects of grammaticalization, it does not exhaustively cover the domain of grammaticalization. (2) Is grammaticalization cross-linguistically homogeneous, or is there a certain range of variation? There is evidence from East and mainland Southeast Asia that there is cross-linguistic variation to some extent.