This contribution analyses morphologically autonomous structures within the context of the Romance languages, the family of languages which, along with Latin, have most served as an evidence base for these structures. Autonomous morphological structures are defined as an abstract representation of paradigmatic cells which form a cohesive group and reliably share exponents with each other, and the forms which realize them, are thus to a large extent interpredictable. In this contribution, I restrict my discussion to the most canonical type of these structures and those which have sparked the most controversy in the linguistic literature. I analyze this controversy and suggest that it is due to (a) their overlapping meaning with the term morphome, a concept which embodies an empirical claim about all morphology and (b) the controversy surrounding what morphology actually is and the basic units of morphological analysis and storage. I make a distinction between abstractive and constructive models of morphology and suggest that historical tendencies within the latter encourage scholars to view morphologically autonomous structures either as not synchronically relevant or as phonologically or semantically derivable due to their theoretical assumptions about the nature of language and the mental storage of words. These assumptions constitute the horizons of intelligibility of such models regarding the functioning of language and its governing principles, including outdated ideas of the capacity of mental storage. Unfortunately, however, the different theories furnish scholars with an expansive array of devices through which they can seemingly explain away the synchronic generalizations of the data while relegating the most recalcitrant data to the domain of memorized forms which are not relevant to the grammar. I present evidence in favor of the psychological reality of morphologically autonomous structures in diachrony and I argue that synchronically, these structures are necessary to explain the distribution of the data and capture the fact that speakers do not memorize every inflectional form of a paradigm but rely on patterns of predictability and implicational relationships between forms. It is my suggestion that morphologically autonomous structures encourage a revaluation of the basic units of memorization and the structure of the lexicon in accordance with abstractive theories of morphology.
Article
Morphologically ‘Autonomous’ Structures in the Romance Languages
Paul O'Neill
Article
Catalan
Francisco Ordóñez
Catalan is a “medium-sized” Romance language spoken by over 10 million speakers, spread over four nation states: Northeastern Spain, Andorra, Southern France, and the city of L’Alguer (Alghero) in Sardinia, Italy. Catalan is divided into two primary dialectal divisions, each with further subvarieties: Western Catalan (Western Catalonia, Eastern Aragon, and Valencian Community) and Eastern Catalan (center and east of Catalonia, Balearic Islands, Rosselló, and l’Alguer).
Catalan descends from Vulgar Latin. Catalan expanded during medieval times as one of the primary vernacular languages of the Kingdom of Aragon. It largely retained its role in government and society until the War of Spanish Succession in 1714, and since it has been minoritized. Catalan was finally standardized during the beginning of the 20th century, although later during the Franco dictatorship it was banned in public spaces. The situation changed with the new Spanish Constitution promulgated in 1978, when Catalan was declared co-official with Spanish in Catalonia, the Valencian Community, and the Balearic Islands.
The Latin vowel system evolved in Catalan into a system of seven stressed vowels. As in most other Iberian Romance languages, there is a general process of spirantization or lenition of voiced stops. Catalan has a two-gender grammatical system and, as in other Western Romance languages, plurals end in -s; Catalan has a personal article and Balearic Catalan has a two-determiner system for common nouns. Finally, past perfective actions are indicated by a compound tense consisting of the auxiliary verb anar ‘to go’ in present tense plus the infinitive.
Catalan is a minoritized language everywhere it is spoken, except in the microstate of Andorra, and it is endangered in France and l’Alguer. The revival of Catalan in the post-dictatorship era is connected with a movement called linguistic normalization. The idea of normalization refers to the aim to return Catalan to a “normal” use at an official level and everyday level as any official language.
Article
Lexical Representations in Language Processing
Gary Libben
Words are the backbone of language activity. An average 20-year-old native speaker of English will have a vocabulary of about 42,000 words. These words are connected with one another within the larger network of lexical knowledge that is termed the mental lexicon. The metaphor of a mental lexicon has played a central role in the development of theories of language and mind and has provided an intellectual meeting ground for psychologists, neurolinguists, and psycholinguists. Research on the mental lexicon has shown that lexical knowledge is not static. New words are acquired throughout the life span, creating very large increases in the richness of connectivity within the lexical system and changing the system as a whole. Because most people in the world speak more than one language, the default mental lexicon may be a multilingual one. Such a mental lexicon differs substantially from a lexicon of an individual language and would lead to the creation of new integrated lexical systems due to the pressure on the system to organize and access lexical knowledge in a homogenous manner. The mental lexicon contains both word knowledge and morphological knowledge. There is also evidence that it contains multiword strings such as idioms and lexical bundles. This speaks in support of a nonrestrictive “big tent” view of units of representation within the mental lexicon. Changes in research on lexical representations in language processing have emphasized lexical action and the role of learning. Although the metaphor of words as distinct representations within a lexical store has served to advance knowledge, it is more likely that words are best seen as networks of activity that are formed and affected by experience and learning throughout the life span.
Article
Nominalizations in the Romance Languages
Antonio Fábregas and Rafael Marín
The term nominalization refers to a specific type of category-changing morphological operation that produces nouns from other lexical categories, most productively verbs and adjectives. By extension, it is also used to refer to the resulting derived nouns. In Romance languages, nominalization generally involves addition of a suffix to the base (cf. Italian generoso ‘generous’ > generos-ità ‘generosity’), and such suffixes are called nominalizers. However there are also cases of nouns built from other categories without any overt nominalizer (cf. Spanish inútil ‘useless’ > inútil ‘useless person’); descriptively, this process is called conversion, and it is debatable whether it should also be treated as a nominalization or whether another different kind of morphological operation is involved here.
Nominalizations can be divided in several classes depending on a variety of semantic and syntactic factors, such as the type of entities that they denote or the ability to introduce arguments. The main nominalization classes are (a) complex event nominalizations, which come from verbs, can combine with some temporal and aspectual modifiers, and have the ability to introduce at least an internal argument; (b) state nominalizations, which denote states associated to the verbs that serve as their bases; (c) participant nominalizations, which denote different types of arguments of the base, such as agents, resulting objects, locations or recipients; and (d) quality nominalizations, coming from adjectives and more restrictively from verbs, which denote a set of properties related to their base. Different classes of predicates select for different nominalization types, and there is a debate surrounding which tests capture in a more complete way the nuances of this taxonomy.
Nominalizers impose different types of restrictions to their bases: aspectual restrictions (individual-level vs. stage-level, (a) telicity, dynamicity, etc.), argument structure restrictions (agent vs. nonagent, different types of internal arguments), morphological restrictions (for instance, selecting only verbs that belong to a particular conjugation class), and finally conceptual restrictions (for instance, showing a strong preference for bases belonging to a particular conceptual domain).
In Romance languages, nominalizations sometimes alternate with other word classes, most significantly infinitives (see article on “Infinitival Clauses in the Romance Languages” in this encyclopedia). Infinitival constructions in Romance can display a mixture of verbal and nominal properties, or be totally recategorized as nouns, and in both cases they can compete with prototypical nominalizations. Less generally, participles (see article on “Participial Relative Clauses” in this encyclopedia), gerunds and supines can also display nominalization properties in some Romance varieties.
Article
Peculiarities of Portuguese Word-Formation
Graça Rio-Torto
Portuguese shares major word-formation mechanisms—affixation, composition, conversion, blending, clipping—with Romance languages, but also displays some peculiarities related to different Latin, Celtiberian, Germanic, and Mozarabic lexical heritages and to the internal dynamics of the language from the 12th to the 21st century. Portuguese has preserved the core of the medieval word-formation framework, but new patterns were of course introduced from time to time, especially during the 20th century. Portuguese word-formation peculiarities are partly conservative, partly innovative; some comply with international trends of word-formation, others depart from them. The proliferation of Neo-Latin compounding and the increase of blending, as well as the introduction of phenomena such as clipping, reanalysis, and grammaticalization illustrate the convergence of modern Portuguese with international word-formation tendencies. In Portuguese, as in other languages, learned suffixes tend to be less productive than the corresponding nonlearned ones coexisting with them. However, in specific cases such as gentilic adjectives/nouns, a learned suffix like -ense could also win over its nonlearned rival (in this case, Pt. -ês/-esa), while in Italian the nonlearned suffix -ese prevails.
Apart from peculiar phonological outcomes of some Latin suffixes and the greater weight of interfixation due to phonological and prosodic conditions, the major distinctive traits of Portuguese word-formation include: (a) the unique distribution of the major evaluative suffixes, grounded in subjective/attitudinal values; (b) the subjective meanings associated with several suffixes that are not found in the corresponding suffixes of other Romance languages; (c) the specific set of suffixal resources for forming agentive and instrumental deverbal nouns; and (d) the expansion of the categorial bases selected by some suffixes.
Article
Secondary Predication in the Romance Languages
Steffen Heidinger
A secondary predicate is a nonverbal predicate which is typically optional and which shares its argument with the sentence’s main verb (e.g., cansada ‘tired’ in Portuguese Ela chega cansada ‘She arrives tired’). A basic distinction within the class of adjunct secondary predicates is that between depictives and resultatives. Depictives, such as cansada in the Portuguese example, describe the state of an argument during the event denoted by the verb. Typically, Romance depictives morphologically agree with their argument in gender and number (as in the case of cansada). Resultatives, such as flat in John hammered the metal flat, describe the state of an argument which results from the event denoted by the verb. Resultatives come in different types, and the strong resultatives, such as flat in the English example, are missing in Romance languages. Although strong resultatives are missing, Romance languages possess other constructions which express a sense of resultativity: spurious resultatives, where the verb and the resultative predicate are linked because the manner of carrying out the action denoted by the verb leads to a particular resultant state (e.g., Italian Mia figlia ha cucito la gonna troppo stretta ‘My daughter sewed the skirt too tight’), and to a much lesser extent weak resultatives, where the meaning of the verb and the meaning of the resultative predicate are related (the resultative predicate specifies a state that is already contained in the verb’s meaning, e.g., French Marie s’est teint les cheveux noirs ‘Marie dyed her hair black’). In Romance languages the distinction between participant-oriented secondary predicates and event-oriented adjectival adverbs is not always clear. On the formal side, the distinction is blurred when (a) adjectival adverbs exhibit morphological agreement (despite their event orientation) or (b) secondary predicates do not agree with the argument they predicate over. On the semantic side, one and the same string may be open to interpretation as a secondary predicate or as an adjectival adverb (e.g., Spanish Pedro gritó colérico ‘Pedro screamed furious/furiously’).
Article
Morphological and Syntactic Variation and Change in Catalan
Gemma Rigau and Manuel Pérez Saldanya
Catalan is a Romance language closely related to Gallo-Romance languages. However, contact with Spanish since the 15th century has led it to adopt various linguistic features that are closer to those seen in Ibero-Romance languages. Catalan exhibits five broad dialects: Central, Northern, and Balearic, which pertain to the Eastern dialect block, and Northwestern and Valencian, which make up the Western.
This article deals with the most salient morphosyntactic properties of Catalan and covers diachronic and diatopic variations. It also offers information about diastratic or sociolinguistic variations, namely standard and non-standard variations. Among the most characteristic morphosyntactic features are the following:
1. Catalan is the only Romance language that exhibits a periphrastic past tense expressed by means of the verb anar ‘go’ + infinitive (Ahir vas cantar ‘Yesterday you sang’). This periphrastic past coexists with a simple past (Ahir cantares ‘Yesterday you sang’). However, Catalan does not have a periphrastic future built with the movement verb go.
2. Demonstratives show a two-term system in most Catalan dialects: aquí ‘here’ (proximal) and allà or allí ‘there’ (distal); but in Valencian and some Northwestern dialects, there is a three-term system. In contrast with other languages that have a two-term system, Catalan uses the proximal demonstrative to express proximity either to the speaker or to the addressee (Aquí on jo soc ‘Here where I am’, Aquí on tu ets ‘There where you are’).
3. Catalan has a complex system of clitic pronouns (or weak object pronouns) which may vary in form according to the point of contact with the verb, proclitically or enclitically; e.g., the singular masculine accusative clitic can have two syllabic forms (el and lo) and an asyllabic one (l’ or ‘l): El saludo ‘I am greeting him’, Puc saludar-lo ‘I can greet him’, L’havies saludat ‘You had greeted him’, Saluda’l ‘Greet him’.
4. Existential constructions may contain the predicate haver-hi ‘there be’, consisting of the locative clitic hi and the verb haver ‘have’ (Hi ha tres estudiants ‘There are three students’) and the copulative verb ser ‘be’ (Tres estudiants ja són aquí ‘Three students are already here’) or other verbs whose behavior can be close to an unaccusative verb when preceded by the clitic hi (Aquí hi treballen forners ‘There are some bakers working here’).
5. The negative polarity adverb no ‘not’ may be reinforced by the adverbs pas or cap in some dialects and can co-occur with negative polarity items (ningú ‘anybody/nobody’, res ‘anything/nothing’, mai ‘never’, etc.). Negative polarity items exhibit negative agreement (No hi ha mai ningú ‘Nobody is ever here’), but they may express positive meaning in some non-declarative syntactic contexts (Si mai vens, truca’m ‘If you ever come, call me’).
6. Other distinguishing items are the interrogative and confirmative particles, the pronominal forms of address, and the personal articles.
Article
Psycholinguistic Research on Inflectional Morphology in the Romance Languages
Claudia Marzi and Vito Pirrelli
Over the past decades, psycholinguistic aspects of word processing have made a considerable impact on views of language theory and language architecture. In the quest for the principles governing the ways human speakers perceive, store, access, and produce words, inflection issues have provided a challenging realm of scientific inquiry, and a battlefield for radically opposing views. It is somewhat ironic that some of the most influential cognitive models of inflection have long been based on evidence from an inflectionally impoverished language like English, where the notions of inflectional regularity, (de)composability, predictability, phonological complexity, and default productivity appear to be mutually implied. An analysis of more “complex” inflection systems such as those of Romance languages shows that this mutual implication is not a universal property of inflection, but a contingency of poorly contrastive, nearly isolating inflection systems. Far from presenting minor faults in a solid, theoretical edifice, Romance evidence appears to call into question the subdivision of labor between rules and exceptions, the on-line processing vs. long-term memory dichotomy, and the distinction between morphological processes and lexical representations. A dynamic, learning-based view of inflection is more compatible with this data, whereby morphological structure is an emergent property of the ways inflected forms are processed and stored, grounded in universal principles of lexical self-organization and their neuro-functional correlates.
Article
Cognitive Semantics in the Romance Languages
Ulrich Detges
Cognitive semantics (CS) is an approach to the study of linguistic meaning. It is based on the assumption that the human linguistic capacity is part of our cognitive abilities, and that language in general and meaning in particular can therefore be better understood by taking into account the cognitive mechanisms that control the conceptual and perceptual processing of extra-linguistic reality. Issues central to CS are (a) the notion of prototype and its role in the description of language, (b) the nature of linguistic meaning, and (c) the functioning of different types of semantic relations. The question concerning the nature of meaning is an issue that is particularly controversial between CS on the one hand and structuralist and generative approaches on the other hand: is linguistic meaning conceptual, that is, part of our encyclopedic knowledge (as is claimed by CS), or is it autonomous, that is, based on abstract and language-specific features? According to CS, the most important types of semantic relations are metaphor, metonymy, and different kinds of taxonomic relations, which, in turn, can be further broken down into more basic associative relations such as similarity, contiguity, and contrast. These play a central role not only in polysemy and word formation, that is, in the lexicon, but also in the grammar.
Article
Discriminative Learning and the Lexicon: NDL and LDL
Yu-Ying Chuang and R. Harald Baayen
Naive discriminative learning (NDL) and linear discriminative learning (LDL) are simple computational algorithms for lexical learning and lexical processing. Both NDL and LDL assume that learning is discriminative, driven by prediction error, and that it is this error that calibrates the association strength between input and output representations. Both words’ forms and their meanings are represented by numeric vectors, and mappings between forms and meanings are set up. For comprehension, form vectors predict meaning vectors. For production, meaning vectors map onto form vectors. These mappings can be learned incrementally, approximating how children learn the words of their language. Alternatively, optimal mappings representing the end state of learning can be estimated. The NDL and LDL algorithms are incorporated in a computational theory of the mental lexicon, the ‘discriminative lexicon’. The model shows good performance both with respect to production and comprehension accuracy, and for predicting aspects of lexical processing, including morphological processing, across a wide range of experiments. Since, mathematically, NDL and LDL implement multivariate multiple regression, the ‘discriminative lexicon’ provides a cognitively motivated statistical modeling approach to lexical processing.