You are looking at 121-140 of 279 articles
The Kiowa-Tanoan family is a small group of Native American languages of the Plains and pueblo Southwest. It comprises Kiowa, of the eponymous Plains tribe, and the pueblo-based Tanoan languages, Jemez (Towa), Tewa, and Northern and Southern Tiwa. These free-word-order languages display a number of typologically unusual characteristics that have rightly attracted attention within a range of subdisciplines and theories.
One word of Taos (my construction based on Kontak and Kunkel’s work) illustrates. In tóm-múlu-wia ‘I gave him/her a drum,’ the verb wia ‘gave’ obligatorily incorporates its object, múlu ‘drum.’ The agreement prefix tóm encodes not only object number, but identities of agent and recipient as first and third singular, respectively, and this all in a single syllable. Moreover, the object number here is not singular, but “inverse”: singular for some nouns, plural for others (tóm-músi-wia only has the plural object reading ‘I gave him/her cats’).
This article presents a comparative overview of the three areas just illustrated: from morphosemantics, inverse marking and noun class; from morphosyntax, super-rich fusional agreement; and from syntax, incorporation. The second of these also touches on aspects of morphophonology, the family’s three-tone system and its unusually heavy grammatical burden, and on further syntax, obligatory passives. Together, these provide a wide window on the grammatical wealth of this fascinating family.
Young-mee Yu Cho
Due to a number of unusual and interesting properties, Korean phonetics and phonology have been generating productive discussion within modern linguistic theories, starting from structuralism, moving to classical generative grammar, and more recently to post-generative frameworks of Autosegmental Theory, Government Phonology, Optimality Theory, and others. In addition, it has been discovered that a description of important issues of phonology cannot be properly made without referring to the interface between phonetics and phonology on the one hand, and phonology and morpho-syntax on the other. Some phonological issues from Standard Korean are still under debate and will likely be of value in helping to elucidate universal phonological properties with regard to phonation contrast, vowel and consonant inventories, consonantal markedness, and the motivation for prosodic organization in the lexicon.
James Hye Suk Yoon
The syntax of Korean is characterized by several signature properties. One signature property is head-finality. Word order variations and restrictions obey head-finality. Korean also possesses wh in-situ as well as internally headed relative clauses, as is typical of a head-final language. Another major signature property is dependent-marking. Korean has systematic case-marking on nominal dependents and very little, if any, head-marking. Case-marking and related issues, such as multiple case constructions, case alternations, case stacking, case-marker ellipsis, and case-marking on adjuncts, are front and center properties of Korean syntax as viewed from the dependent-marking perspective. Research on these aspects of Korean has contributed to the theoretical understanding of case and grammatical relations in linguistic theory. Korean is also characterized by agglutinative morphosyntax. Many issues in Korean syntax straddle the morphology-syntax boundary. Korean morphosyntax constitutes a fertile testing ground for ongoing debates about the relationship between morphology and syntax in domains such as coordination, deverbal nominalizations (mixed category constructions), copula, and other denominal constructions. Head-finality and agglutinative morphosyntax intersect in domains such as complex/serial verb and auxiliary verb constructions. Negation, which is a type of auxiliary verb construction, and the related phenomena of negative polarity licensing, offer important evidence for crosslinguistic understanding of these phenomena. Finally, there is an aspect of Korean syntax that reflects areal contact. Lexical and grammatical borrowing, topic prominence, pervasive occurrence of null arguments and ellipsis, as well as a complex system of anaphoric expressions, resulted from sustained contact with neighboring Sino-Tibetan languages.
Kra-Dai, also known as Tai–Kadai, Daic, and Kadai, is a family of diverse languages found in southern China, northeast India, and Southeast Asia. The number of these languages is estimated to be close to a hundred, with approximately 100 million speakers all over the world. As the name itself suggests, Kra-Dai is made up of two major groups, Kra and Dai. The former refers to a number of lesser-known languages, some of which have only a few hundred fluent speakers or even less. The latter (also known as Tai, or Kam-Tai) is well established, and comprises the best-known members of the family, Thai and Lao, the national languages of Thailand and Laos respectively, whose speakers account for over half of the Kra-Dai population.
The ultimate genetic affiliation of Kra-Dai remains controversial, although a consensus among western scholars holds that it belongs under Austronesian. The majority of Kra-Dai languages have no writing systems of their own, particularly Kra. Languages with writing systems include Thai, Lao, Sipsongpanna Dai, and Tai Lue. These use Indic-based scripts. Others use Chinese character-based scripts, such as the Zhuang and Kam-Sui in southern China and surrounding regions. The government introduced Romanized scripts in the 1950s for the Zhuang and the Kam-Sui languages. Almost every group within Kra-Dai has a rich oral history tradition.
The languages are typically tonal, isolating, and analytic, lacking in inflectional morphology, with no distinction for number and gender. A significant number of basic vocabulary items are monosyllabic, but bisyllabic and multisyllabic compounds also abound. There are morphological processes in which etymologically related words manifest themselves in groups through tonal, initial, or vowel alternations. Reduplication is a salient word formation mechanism. In syntax, the Kra-Dai languages can be said to have basic SVO word order. They possess a rich system of noun classifiers. Other features include verb serialization without overt marking to indicate grammatical relations. A number of lexical items (mostly verbs) may function as grammatical morphemes in syntactic operations. Temporal and aspectual meanings are expressed through tense-aspect markers typically derived from verbs, while mood and modality are conveyed via a rich array of discourse particles.
During the period from the fall of the Roman empire in the late 5th century to the beginning of the European Renaissance in the 14th century, the development of linguistic thought in Europe was characterized by the enthusiastic study of grammatical works by Classical and Late Antique authors, as well as by the adaptation of these works to suit a Christian framework. The discipline of grammatica, viewed as the cornerstone of the ideal liberal arts education and as a key to the wider realm of textual culture, was understood to encompass both the systematic principles for speaking and writing correctly and the science of interpreting the poets and other writers. The writings of Donatus and Priscian were among the most popular and well-known works of the grammatical curriculum, and were the subject of numerous commentaries throughout the medieval period. Although Latin persisted as the predominant medium of grammatical discourse, there is also evidence from as early as the 8th century for the enthusiastic study of vernacular languages and for the composition of vernacular-medium grammars, including sources pertaining to Anglo-Saxon, Irish, Old Norse, and Welsh. The study of language in the later medieval period is marked by experimentation with the form and layout of grammatical texts, including the composition of textbooks in verse form. This period also saw a renewed interest in the application of philosophical ideas to grammar, inspired in part by the availability of a wider corpus of Greek sources than had previously been unknown to western European scholars, such as Aristotle’s Physics, Metaphysics, Ethics, and De Anime. A further consequence of the renewed interest in the logical and metaphysical works of Aristotle during the later Middle Ages is the composition of so-called ‘speculative grammars’ written by scholars commonly referred to as the ‘Modistae’, in which the grammatical description of Latin formulated by Priscian and Donatus was integrated with the system of scholastic philosophy that was at its height from the beginning of the 13th to the middle of the 14th century.
Traditional Chinese linguistics grew out of two distinct interests in language: the philosophical reflection on things and their names, and the practical concern for literacy education and the correct understanding of classical works. The former is most typically found in the teachings of such pre-Qin masters as Confucius, Mozi, and Gongsun Long, who lived between the 6th and 3rd centuries
The picture just presented, in which Chinese philosophy and philology are combined to form a seemingly autonomous tradition, is complicated, however, by the fact that the Indic linguistic tradition started to influence the Chinese in the 2nd century
Chinese, with its linguistic tradition, had a profound impact in ancient East Asia. Not only did traditional studies of Japanese, Tangut, and other languages show significant Chinese influence, under which not the least achievement was the invention of the earliest writing systems for these languages, but many scholars from Japan and Korea actually took an active part in the study of Chinese as well, so that the Chinese linguistic tradition would itself be incomplete without the materials and findings these non-Chinese scholars have contributed. On the other hand, some of these scholars, most notably Motoori Norinaga and Fujitani Nariakira in Japan, were able to free themselves from the character-centered Chinese routine and develop rather original linguistic theories.
Judith T. Irvine
In the indigenous sociolinguistic systems of West Africa, an important way of expressing—and creating—social hierarchy in interaction is through intermediaries: third parties, through whom messages are relayed. The forms of mediation vary by region, by the scale of the social hierarchy, and by the ways hierarchy is locally understood. In larger-scale systems where hierarchy is elaborate, the interacting parties include a high-status person, a mediator who ranks lower, and a third person or group—perhaps another dignitary, but potentially anyone. In smaller-scale, more egalitarian societies, the (putative) interactants could include an authoritative spirit represented by a mask, the mask’s bearer, a “translator,” and an audience. In all these systems, mediated interactions may also involve distinctive registers or vocalizations. Meanwhile, the interactional structure and its characteristic ways of speaking offer tropes and resources for expressing politeness in everyday talk.
In the traditions connected with precolonial kingdoms and empires, professional praise orators deliver eulogistic performances for their higher-status patrons. This role is understood as transmission—transmitting a message from the past, or from a group, or from another dignitary—more than as creating a composition from whole cloth. The transmitter amplifies and embellishes the message; he or she does not originate it. In addition to their formal public performances, these orators serve as interpreters and intermediaries between their patrons and their patrons’ visitors. Speech to the patron is relayed through the interpreter, even if the original speaker and the patron are in the same room. Social hierarchy is thus expressed as interactional distance.
In the Sahel, these social hierarchies involve a division of labor, including communicative labor, in a complex system of ranked castes and orders. The praise orators, as professional experts in the arts of language and communication, are a separate, low-ranking category (known by the French term griot). Some features of griot performance style, and the contrasting—sometimes even disfluent—verbal conduct of high-ranking aristocrats, carry over into speech registers used by persons of any social category in situations evoking hierarchy (petitioning, for example). In indigenous state systems further south, professional orators are not a separate caste, and chiefs are also supposed to have verbal skills, although still using intermediaries. Special honorific registers, such as the esoteric Akan “palace speech,” are used in the chief’s court. Some politeness forms in everyday Akan usage today echo these practices.
An example of a small-scale society is the Bedik (Senegal-Guinea border), among whom masked dancers serve as the visible and auditory representation of spirit beings. The mask spirits, whose speech and conduct contrasts with their bearers’ ordinary behavior, require “translators” to relay their messages to addressees. This too is mediated communication, involving a multi-party interactional structure as well as distinctive vocalizations.
Linguistic repertoires in the Sahel have long included Arabic, and Islamic learning is another source of high status, coexisting with other traditional sources and sharing some interactional patterns. The European conquest brought European languages to the top of West African linguistic hierarchies, which have remained largely in place since independence.
As might be expected from the difficulty of traversing it, the Sahara Desert has been a fairly effective barrier to direct contact between its two edges; trans-Saharan language contact is limited to the borrowing of non-core vocabulary, minimal from south to north and mostly mediated by education from north to south. Its own inhabitants, however, are necessarily accustomed to travelling desert spaces, and contact between languages within the Sahara has often accordingly had a much greater impact. Several peripheral Arabic varieties of the Sahara retain morphology as well as vocabulary from the languages spoken by their speakers’ ancestors, in particular Berber in the southwest and Beja in the southeast; the same is true of at least one Saharan Hausa variety. The Berber languages of the northern Sahara have in turn been deeply affected by centuries of bilingualism in Arabic, borrowing core vocabulary and some aspects of morphology and syntax. The Northern Songhay languages of the central Sahara have been even more profoundly affected by a history of multilingualism and language shift involving Tuareg, Songhay, Arabic, and other Berber languages, much of which remains to be unraveled. These languages have borrowed so extensively that they retain barely a few hundred core words of Songhay vocabulary; those loans have not only introduced new morphology but in some cases replaced old morphology entirely. In the southeast, the spread of Arabic westward from the Nile Valley has created a spectrum of varieties with varying degrees of local influence; the Saharan ones remain almost entirely undescribed. Much work remains to be done throughout the region, not only on identifying and analyzing contact effects but even simply on describing the languages its inhabitants speak.
As elsewhere in the world, languages in Africa are endangered. The estimates for language loss on a world scale likely hold for Africa as well. Although the particular group of factors at work in Africa may be unique, they come from a well-established inventory familiar elsewhere. The forces reducing African language diversity come from the combination of a set of macro socioeconomic factors and historical events, such as colonization and globalization, coupled with local factors such as military conquest and misguided government policies. Simple demographic factors, such as number of speakers, are also important: the less widely spoken languages are more severely threatened than are those spoken more widely. The shift from African languages is to both European languages and the more widely spoken languages on the continent. Shifts also occur to localized or appropriated versions of the two. Climatic factors, most notably global warming, have played and will continue to play a role as well; the correlation between biological and linguistic diversity has often been remarked. For example, with the growth of plantation economies and the destruction of rain forests, there is a concomitant reduction in linguistic diversity.
Cecilio Garriga Escribano
The language of chemistry has seldom been the object of study by linguists, who tend to prioritize literary works. Nevertheless, in recent years its study has developed at a different pace for each of the Romance languages. It is therefore important to describe the current state of research separately for French, Spanish, Italian, Portuguese, Romanian, and Catalan. The work of historians of science, who have always dedicated particular attention to the language of chemistry, is particularly pertinent to this purpose.
Toward the end of the 18th century, French chemists spearheaded a terminological revolution: traditional terms used in alchemy were replaced by a well-structured, systematic nomenclature that was quickly adopted by the scientific community, mainly through the translation of French chemical texts, many of which were pedagogical in nature. It is important to trace the dissemination process of new chemical nomenclature in each country and in each language, since it was not uniform.
This new nomenclature is firmly based on the classical languages, particularly Greek, and it adopts a broad range of suffixes and prefixes for systematization. During the 19th century, this system steadily consolidated as the field of chemistry developed until a standardized international nomenclature was established.
From a lexicographical standpoint, the treatment of chemical terms in both general and specialized dictionaries deserves attention. Traditional lexicography has mistakenly classified many chemical terms as Hellenisms, while from the early 21st century onward they have been recognized as Gallicisms thanks to research carried out by historians of scientific language.
Finally, the procedures the Romance languages follow to coin chemical terms—both to name elements and chemicals and to express chemical combinations by means of word formation processes—must be taken into account.
The expression language of the economy and business refers to an extremely heterogeneous linguistic reality. For some, it denotes all text and talk produced by economic agents in the pursuit of economic activity, for others the language used to write or talk about the economy or business, that is, the language of the economic sciences and the media. Both the economy and business contain a myriad of subdomains, each with its own linguistic peculiarities. Language use also differs quite substantially between the shop floor and academic articles dealing with it. Last but not least, language is itself a highly articulate entity, composed of sounds, words, concepts, etc., which are taken care of by a considerable number of linguistic disciplines and theories. As a consequence, this research landscape offers a very varied picture.
The state of research is also highly diverse as far as the Romance languages are concerned. The bulk of relevant publications concerns French, followed at a certain distance by Spanish and Italian, while Romanian, Catalan, and Portuguese look like poor relations. As far as the dialects are concerned, only those of some Italian cities that held a central position in medieval trade, like Venice, Florence, or Genoa, have given rise to relevant studies. As far as the metalanguage used in research is concerned, the most striking feature is the overwhelming preponderance of German and the almost complete absence of English. The insignificant role of English must probably be attributed to the fact that the study of foreign business languages in the Anglo-Saxon countries is close to nonexistent. Why study foreign business languages if one own’s language is the lingua franca of today’s business world? Scholars from the Romance countries, of course, generally write in their mother tongue, but linguistic publications concerning the economic and business domain are relatively scarce there. The heterogeneity of the metalanguages used certainly hinders the constitution of a close-knit research community.
Aidan Pine and Mark Turin
The world is home to an extraordinary level of linguistic diversity, with roughly 7,000 languages currently spoken and signed. Yet this diversity is highly unstable and is being rapidly eroded through a series of complex and interrelated processes that result in or lead to language loss. The combination of monolingualism and networks of global trade languages that are increasingly technologized have led to over half of the world’s population speaking one of only 13 languages. Such linguistic homogenization leaves in its wake a linguistic landscape that is increasingly endangered.
A wide range of factors contribute to language loss and attrition. While some—such as natural disasters—are unique to particular language communities and specific geographical regions, many have similar origins and are common across endangered language communities around the globe. The harmful legacy of colonization and the enduring impact of disenfranchising policies relating to Indigenous and minority languages are at the heart of language attrition from New Zealand to Hawai’i, and from Canada to Nepal.
Language loss does not occur in isolation, nor is it inevitable or in any way “natural.” The process also has wide-ranging social and economic repercussions for the language communities in question. Language is so heavily intertwined with cultural knowledge and political identity that speech forms often serve as meaningful indicators of a community’s vitality and social well-being. More than ever before, there are vigorous and collaborative efforts underway to reverse the trend of language loss and to reclaim and revitalize endangered languages. Such approaches vary significantly, from making use of digital technologies in order to engage individual and younger learners to community-oriented language nests and immersion programs. Drawing on diverse techniques and communities, the question of measuring the success of language revitalization programs has driven research forward in the areas of statistical assessments of linguistic diversity, endangerment, and vulnerability. Current efforts are re-evaluating the established triad of documentation-conservation-revitalization in favor of more unified, holistic, and community-led approaches.
Victor A. Friedman
The Balkan languages were the first group of languages whose similarities were explained in modern linguistic terms as a result of language contact rather than as a result of descent from a common ancestor. Nikolai Trubetzkoy coined the term Sprachbund ‘linguistic league’ (as opposed to Sprachfamilie ‘language family’) to describe this relationship. Balkan linguistics, as both a subset of and precursor to contact linguistics, is, at its base, an historical linguistic discipline. It seeks to explain similarities among the relevant languages as the result of diffusion rather than of either transmission or of putative universal, typological properties of human language (which latter assumes parallel developments whose causation is ahistorical, i.e., unconnected with either contact or ancestry). The relevant languages are, with the exception of Turkic, all part of the Indo-European language family, but they belong to five distinct groups that are known to have been separated for a significant length of time (presumably millennia). Moreover, for four out of five Indo-European groups as well as for Turkic, there exists documentation that goes back more than a millennium, and in some cases several millennia. The Balkan languages are thus the oldest example of a well-documented and still living Sprachbund.
The primary questions that Balkan linguistics seeks to answer are these: What are the results of language contact in the Balkan languages, and how did they come about? The Balkan languages are traditionally defined as Albanian, Modern Greek, Balkan Romance (Romanian, Aromanian, and Meglenoromanian), and Balkan Slavic (Bulgarian, Macedonian, and the southernmost dialects of the former Serbo-Croatian). In recent decades, it has been recognized that the relevant dialects of Romani, Judezmo, and Turkish and Gagauz also participate in at least some of the convergent processes that are taken as definitive of the Balkan linguistic league. While the language family is defined by regular sound correspondences, which in turn help define shared morphology and a core lexicon, the Balkan linguistic league is defined principally by shared morphosyntactic developments and a shared lexicon of borrowings often called “cultural.” In the Balkan linguistic league, phonological developments are sometimes shared among different languages at the dialectal level, but there are no such features that characterize the Balkan languages as a group. Just as in the language family not every diagnostic item is represented in every branch, so, too, in the Balkan linguistic league not every feature is equally represented in all languages and dialects.
Among the most characteristic morphosyntactic features are the following: (1) replacement of infinitives by analytic subjunctives, (2) the use of a particle derived from etymological ‘want’ to mark the future, (3) replacement of synthetic gradation of adjectives with analytic constructions, (4) replacement of conditionals by anterior futures, (5) resumptive clitic pronouns for certain direct and indirect objects, (6) various simplifications in the declensional system, (7) postposed definite articles (for Balkan Slavic, Balkan Romance, and Albanian), (8) grammaticalized evidentials (Balkan Slavic, Albanian, Turkic, and to some extent Balkan Romance and Romani). While some of these convergences began in the ancient or medieval periods, the Balkan linguistic league took its definitive modern shape during the centuries of the Ottoman Empire (14th to early 20th centuries).
William R. Leben
About 7,000 languages are spoken around the world today. The actual number depends on where the line is drawn between language and dialect—an arbitrary decision, because languages are always in flux. But specialists applying a reasonably uniform criterion across the globe count well over 2,000 languages in Asia and Africa, while Europe has just shy of 300. In between are the Pacific region, with over 1,300 languages, and the Americas, with just over 1,000. Languages spoken natively by over a million speakers number around 250, but the vast majority have very few speakers. Something like half are thought likely to disappear over the next few decades, as speakers of endangered languages turn to more widely spoken ones.
The languages of the world are grouped into perhaps 430 language families, based on their origin, as determined by comparing similarities among languages and deducing how they evolved from earlier ones. As with languages, there’s quite a lot of disagreement about the number of language families, reflecting our meager knowledge of many present-day languages and even sparser knowledge of their history. The figure 430 comes from Glottolog.org, which actually lists them all. While the world’s language families may well go back to a smaller number of original languages, even to a single mother tongue, scholars disagree on how far back current methods permit us to trace the history of languages.
While it is normal for languages to borrow from other languages, occasionally a totally new language is created by mixing elements of two distinct languages to such a degree that we would not want to identify one of the source languages as the mother tongue. This is what led to the development of Media Lengua, a language of Ecuador formed through contact among speakers of Spanish and speakers of Quechua. In this language, practically all the word stems are from Spanish, while all of the endings are from Quechua. Just a handful of languages have come into being in this way, but less extreme forms of language mixture have resulted in over a hundred pidgins and creoles currently spoken in many parts of the world. Most arose during Europe’s colonial era, when European colonists used their language to communicate with local inhabitants, who in turn blended vocabulary from the European language with grammar largely from their native language.
Also among the languages of the world are about 300 sign languages used mainly in communicating among and with the deaf. The structure of sign languages typically has little historical connection to the structure of nearby spoken languages.
Some languages have been constructed expressly, often by a single individual, to meet communication demands among speakers with no common language. Esperanto, designed to serve as a universal language and used as a second language by some two million, according to some estimates, is the prime example, but it is only one among several hundred would-be international auxiliary languages.
This essay surveys the languages of the world continent by continent, ending with descriptions of sign languages and of pidgins and creoles. A set of references grouped by section appears at the very end. The main source for data on language classification, numbers of languages, and speakers is the 19th edition of Ethnologue (see Resources), except where a different source is cited.
Phonological learnability deals with the formal properties of phonological languages and grammars, which are combined with algorithms that attempt to learn the language-specific aspects of those grammars. The classical learning task can be outlined as follows: Beginning at a predetermined initial state, the learner is exposed to positive evidence of legal strings and structures from the target language, and its goal is to reach a predetermined end state, where the grammar will produce or accept all and only the target language’s strings and structures. In addition, a phonological learner must also acquire a set of language-specific representations for morphemes, words and so on—and in many cases, the grammar and the representations must be acquired at the same time.
Phonological learnability research seeks to determine how the architecture of the grammar, and the workings of an associated learning algorithm, influence success in completing this learning task, i.e., in reaching the end-state grammar. One basic question is about convergence: Is the learning algorithm guaranteed to converge on an end-state grammar, or will it never stabilize? Is there a class of initial states, or a kind of learning data (evidence), which can prevent a learner from converging? Next is the question of success: Assuming the algorithm will reach an end state, will it match the target? In particular, will the learner ever acquire a grammar that deems grammatical a superset of the target language’s legal outputs? How can the learner avoid such superset end-state traps? Are learning biases advantageous or even crucial to success?
In assessing phonological learnability, the analysist also has many differences between potential learning algorithms to consider. At the core of any algorithm is its update rule, meaning its method(s) of changing the current grammar on the basis of evidence. Other key aspects of an algorithm include how it is triggered to learn, how it processes and/or stores the errors that it makes, and how it responds to noise or variability in the learning data. Ultimately, the choice of algorithm is also tied to the type of phonological grammar being learned, i.e., whether the generalizations being learned are couched within rules, features, parameters, constraints, rankings, and/or weightings.
Laurie Beth Feldman and Judith F. Kroll
We summarize findings from across a range of methods, including behavioral measures of overall processing speed and accuracy, electrophysiological indices that tap into the early time course of language processing, and neural measures using structural and functional imaging. We argue that traditional claims about rigid constraints on the ability of late bilinguals to exploit the meaning and form of the morphology and morphosyntax in a second language should be revised so as to move away from all or none command of structures motivated from strict dichotomies among linguistic categories of morphology. We describe how the dynamics of morphological processing in neither monolingual or bilingual speakers is easily characterized in terms of the potential to decompose words into their constituent morphemes and that morphosyntactic processing is not easily characterized in terms of categories of structures that are learnable and those that are unlearnable by bilingual and nonnative speakers. Instead, we emphasize the high degree of variability across individuals and plasticity within individuals in their ability to successfully learn and use even subtle aspects of a second language. Further, both of the bilingual’s two languages become active when even one language is engaged, and parallel activation has consequences that shape both languages, thus their influence is not in the unidirectional manner that was traditionally assumed. We briefly discuss the nature of possible constraints and directions for future research.
Eve V. Clark
The words and word-parts children acquire at different stages offer insights into how the mental lexicon might be organized. Children first identify ‘words,’ recurring sequences of sounds, in the speech stream, attach some meaning to them, and, later, analyze such words further into parts, namely stems and affixes. These are the elements they store in memory in order to recognize them on subsequent occasions. They also serve as target models when children try to produce those words themselves. When they coin words, they make use of bare stems, combine certain stems with each other, and sometimes add affixes as well. The options they choose depend on how much they need to add to coin a new word, which familiar elements they can draw on, and how productive that option is in the language. Children’s uses of stems and affixes in coining new words also reveal that they must be relying on one representation in comprehension and a different representation in production. For comprehension, they need to store information about the acoustic properties of a word, taking into account different occasions, different speakers, and different dialects, not to mention second-language speakers. For production, they need to work out which articulatory plan to follow in order to reproduce the target word. And they take time to get their production of a word aligned with the representation they have stored for comprehension. In fact, there is a general asymmetry here, with comprehension being ahead of production for children, and also being far more extensive than production, for both children and adults. Finally, as children add more words to their repertoires, they organize and reorganize their vocabulary into semantic domains. In doing this, they make use of pragmatic directions from adults that help them link related words through a variety of semantic relations.
The Lexical Integrity Hypothesis (LIH) holds that words are syntactic atoms, implying that syntactic processes and principles do not have access to word segments. Interestingly, when this widespread “negative characterization” is turned into its positive version, a standard picture of the Morphology-Syntax borderline is obtained.
The LIH is both a fundamental principle of Morphology and a test bench for morphological theories. As a matter of fact, the LIH is problematic for both lexicalist and anti-lexicalist frameworks, which radically differ in accepting or rejecting Morphology as a component of grammar different from Syntax. Lexicalist theories predict no exceptions to LIH, contrary to fact. From anti-lexicalist theories one might expect a large set of counterexamples to this hypothesis, but the truth is that attested potential exceptions are restricted, as well as confined to very specific grammatical areas. Most of the phenomena taken to be crucial for evaluating the LIH are briefly addressed in this article: argument structure, scope, prefixes, compounds, pronouns, elliptical segments, bracketing paradoxes, and coordinated structures. It is argued that both lexicalist and anti-lexicalist positions crucially depend on the specific interpretations that their proponents are willing to attribute to the very notion of Syntax: a broad one, which basically encompasses constituent structure, binary branching, scope, and compositionality, and a narrow one, which also coverts movement, recursion, deletion, coordination, and other aspects of phrase structure. The objective differences between these conceptions of Syntax are shown to be determinant in the evaluation of LIH’s predictions.
The term lexicalization describes the addition of new open-class elements to a repository of holistically processed linguistic units. At the basis of lexicalization are word-formation processes such as affixation, compounding, or borrowing, which are a necessary precondition for lexicalization. Still, lexicalization goes beyond word formation in important respects. First, lexicalization also involves multi-word expressions and set phrases; second, it includes a range of processes that follow the coinage of a new element. These processes conjointly lead to holistic processing, that is, the cognitive treatment of a linguistic element as a unified whole. Holistic processing contrasts with analytic processing, which is the cognitive treatment of a linguistic unit as a complex whole that is composed of several parts. Lexicalization is usefully contrasted with grammaticalization, that is, the emergence of new linguistic units that fulfill grammatical functions. Finally, lexicalization is also a concept that lends itself to the study of cross-linguistic differences in the types of meaning that are lexicalized in specific domains such as, for example, motion.
The central goal of the Lexical Semantic Framework (LSF) is to characterize the meaning of simple lexemes and affixes and to show how these meanings can be integrated in the creation of complex words. LSF offers a systematic treatment of issues that figure prominently in the study of word formation, such as the polysemy question, the multiple-affix question, the zero-derivation question, and the form and meaning mismatches question.
LSF has its source in a confluence of research approaches that follow a decompositional approach to meaning and, thus, defines simple lexemes and affixes by way of a systematic representation that is achieved via a constrained formal language that enforces consistency of annotation. Lexical-semantic representations in LSF consist of two parts: the Semantic/Grammatical Skeleton and the Semantic/Pragmatic Body (henceforth ‘skeleton’ and ‘body’ respectively). The skeleton is comprised of features that are of relevance to the syntax. These features act as functions and may take arguments. Functions and arguments of a skeleton are hierarchically arranged. The body encodes all those aspects of meaning that are perceptual, cultural, and encyclopedic.
Features in LSF are used in (a) a cross-categorial, (b) an equipollent, and (c) a privative way. This means that they are used to account for the distinction between the major ontological categories, may have a binary (i.e., positive or negative) value, and may or may not form part of the skeleton of a given lexeme. In order to account for the fact that several distinct parts integrate into a single referential unit that projects its arguments to the syntax, LSF makes use of the Principle of Co-indexation. Co-indexation is a device needed in order to tie together the arguments that come with different parts of a complex word to yield only those arguments that are syntactically active.
LSF has an important impact on the study of the morphology-lexical semantics interface and provides a unitary theory of meaning in word formation.