Skip to main content
Skip to main content

Research

Research at our top-ranked department spans syntax, semantics, phonology, language acquisition, computational linguistics, psycholinguistics and neurolinguistics. 

Connections between our core competencies are strong, with theoretical, experimental and computational work typically pursued in tandem.

A network of collaboration at all levels sustains a research climate that is both vigorous and friendly. Here new ideas develop in conversation, stimulated by the steady activity of our labs and research groups, frequent student meetings with faculty, regular talks by local and invited scholars and collaborations with the broader University of Maryland language science community, the largest and most integrated language science research community in North America.

Show activities matching...

filter by...

Effect of syntactic constraints on long-distance dependency formation in backwards anaphora processing

Syntactic constraints exert early effects in online resolution of anaphoric pronouns. But when do they become available for cataphoric pronouns, those which precede their 'antecedent'?

Linguistics

Non-ARHU Contributor(s): Nina Kazanina, Moti Lieberman, Masaya Yoshida
Dates:
This article presents three studies that investigate when syntactic constraints become available during the processing of long-distance backwards pronominal dependencies (backwards anaphora or cataphora). Earlier work demonstrated that in such structures the parser initiates an active search for an antecedent for a pronoun, leading to gender mismatch effects in cases where a noun phrase in a potential antecedent position mismatches the gender of the pronoun [Van Gompel, R. P. G. & Liversedge, S. P. (2003). The influence of morphological information on cataphoric pronoun assignment. Journal of Experimental Psychology: Learning, Memory, and Cognition, 29, 128–139]. Results from three self-paced reading studies suggest that structural constraints on coreference, in particular Principle C of the Binding Theory [Chomsky, N. (1981). Lectures on government and binding. Dordrecht, Foris], exert an influence at an early stage of this search process, such that gender mismatch effects are elicited at grammatically licit antecedent positions, but not at grammatically illicit antecedent positions. The results also show that the distribution of gender mismatch effects is unlikely to be due to differences in the predictability of different potential antecedents. These findings suggest that backwards anaphora dependencies are processed with a grammatically constrained active search mechanism, similar to the mechanism used to process another type of long-distance dependency, the wh dependency (e.g., [Stowe, L. (1986). Evidence for online gap creation. Language and Cognitive Processes, 1, 227–245; Traxler, M. J., & Pickering, M. J. (1996). Plausibility and the processing of unbounded dependencies: an eye-tracking study. Journal of Memory and Language, 35, 454–475.]). We suggest that the temporal priority for syntactic information observed here reflects the predictabilityof structural information, rather than the need for an architectural constraint that delays the use of non syntactic information.

Read More about Effect of syntactic constraints on long-distance dependency formation in backwards anaphora processing

On Formal Feature Licensing in Minimalism: Aspects of Standard Arabic Morphosyntax

A minimalist account of Standard Arabic syntax.

Linguistics

Non-ARHU Contributor(s): Usama Soltan
Dates:
This dissertation investigates a set of phenomena in Standard Arabic at the syntax-morphology interface, providing an analysis for each within the assumptions of the minimalist program, particularly those related to mechanisms of formal feature licensing. Among the issues discussed are the subject-verb agreement asymmetry, case-assignment, default agreement, nominative Themes, as well as interactions between tense, negation, and modality heads. In this regard, I provide an analysis for word order alternation in the language in terms of left dislocation rather than via movement, showing that the language does not show A-movement effects in SVO orders, passives, raising constructions, or object shift. The same is also shown to hold in what is usually referred to as raising-to-object constructions. The proposed analysis shows that formal features such as case and agreement can be licensed in absence of movement, a conclusion more compatible with the Agree-based approach to formal feature licensing in minimalism than with the Spec-head approach. Finally, I propose to extend Agree to head-head relations in the functional domain, accounting for the interesting, though rather intricate, paradigm of inflecting negatives as well as person-less imperatives in Standard Arabic and languages that exhibit similar behavior.

Language-Specific Constraints on Scope Interpretation in First Language Acquisition

How do children acquiring Japanese learn constraints on quantifier scope?

Linguistics

Non-ARHU Contributor(s): Takuya Goro
Dates:
This dissertation investigates the acquisition of language-specific constraints on scope interpretation by Japanese preschool children. Several constructions in Japanese do not allow scope interpretations that the corresponding English sentences do allow. First, in Japanese transitive sentences with multiple quantificational arguments, an inverse scope interpretation is disallowed, due to the Rigid Scope Constraint. Second, Japanese logical connectives cannot be interpreted under the scope of local negation, due to their Positive Polarity. Thirdly, in Japanese infinitival complement constructions with implicative matrix verbs like wasureru ("forget") the inverse scope interpretation is required, due to the Anti-Reconstruction Constraint. The main goal of this research is to determine how Japanese children learn these constraints on scope interpretations. To that end, three properties of the acquisition task that have an influence on the learnability of linguistic knowledge are examined: productivity, no negative evidence, and arbitrariness. The results of experimental investigations show that Japanese children productively generate scope interpretations that are never exemplified in the input. For example, with sentences that contain two quantificational arguments, Japanese children accessed inverse scope interpretations that Japanese adults do not allow. Also, Japanese children interpret the disjunction ka under the scope of local negation, which is not a possible interpretive option in the adult language. These findings clearly show that children do not acquire these scope constraints through conservative learning, and raise the question of how they learn to purge their non-adult interpretations. It is argued that input data do not provide learners with negative evidence (direct or indirect) against particular scope interpretations. Two inherent properties of input data about possible scope interpretations, data sparseness and indirectness, make negative evidence too unreliable as a basis for discovering what scope interpretation is impossible. In order to solve the learnability problems that children's scope productivity raise, I suggest that the impossibility of their non-adult interpretations are acquired by learning some independently observable properties of the language. In other words, the scope constraints are not arbitrary in the sense that their effects are consequences of other properties of the grammar of Japanese.

Spin: Lexical Semantics, Transitivity, and the Identification of Implicit Sentiment

A new approach to automatic identification of implicit sentiment in text.

Linguistics

Non-ARHU Contributor(s):

Stephan Greene

Dates:

Current interest in automatic sentiment analysis is motivated by a variety of information requirements. The vast majority of work in sentiment analysis has been specifically targeted at detecting subjective statements and mining opinions. This dissertation focuses on a different but related problem that to date has received relatively little attention in NLP research: detecting implicit sentiment, or spin, in text. This text classification task is distinguished from other sentiment analysis work in that there is no assumption that the documents to be classified with respect to sentiment are necessarily overt expressions of opinion. They rather are documents that might reveal a perspective. This dissertation describes a novel approach to the identification of implicit sentiment, motivated by ideas drawn from the literature on lexical semantics and argument structure, supported and refined through psycholinguistic experimentation. A relationship predictive of sentiment is established for components of meaning that are thought to be drivers of verbal argument selection and linking and to be arbiters of what is foregrounded or backgrounded in discourse. In computational experiments employing targeted lexical selection for verbs and nouns, a set of features reflective of these components of meaning is extracted for the terms. As observable proxies for the underlying semantic components, these features are exploited using machine learning methods for text classification with respect to perspective. After initial experimentation with manually selected lexical resources, the method is generalized to require no manual selection or hand tuning of any kind. The robustness of this linguistically motivated method is demonstrated by successfully applying it to three distinct text domains under a number of different experimental conditions, obtaining the best classification accuracies yet reported for several sentiment classification tasks. A novel graph-based classifier combination method is introduced which further improves classification accuracy by integrating statistical classifiers with models of inter-document relationships.

A program for experimental syntax: Finding the relationship between acceptability and grammatical knowlege

The tools of experimental syntax can be used to explore the relationship between acceptability judgments and the form or nature of grammatical knowledge, not just its content.

Linguistics

Non-ARHU Contributor(s): Jon Sprouse
Dates:
There has always been interest in the methodology of acceptability judgment collection, as well as the reliability of the results. It seems, though, that the past several years have seen an increase in the number of studies employing formal experimental techniques for the collection of acceptability judgments, so much so that the term experimental syntax has come to be applied to the use of those techniques. The question this dissertation asks is whether the extent of the utility of experimental syntax is to find areas in which informal judgment collection was insufficient, or whether there is a complementary research program for experimental syntax that is more than just a methodological footnote to the informal judgment collection of theoretical syn- tax. This dissertation is a first attempt at a tentative yes: the tools of experimental syntax can be used to explore the relationship between acceptability judgments and the form or nature of grammatical knowledge, not just the content of grammatical knowledge. This dissertation begins by identifying several recent claims about the nature of grammatical knowledge that have been made based upon hypotheses about the nature of acceptability judgments. Each chapter applies the tools of experimental syntax to those hypotheses in an attempt to refine our understanding of the relationship between acceptability and grammatical knowledge. The claims investigated include: that grammatical knowledge is gradient, that grammatical knowledge is sensitive to context effects, that the stability or instability of acceptability reflects underlying differences in grammatical knowledge, that processing effects affect acceptability, and that acceptability judgments have nothing further to contribute to debates over the number and nature of dependency forming operations. Using wh-movement and Island effects as the empirical basis of the research, the results of these studies suggest that the relationship between acceptability and grammatical knowledge is much more complicated than previously thought. The overarching conclusion is that there is a program for experimental syntax that is independent of simple data collection: only through the tools of experimental syntax can we achieve a better understanding of the nature of acceptability, and how it relates to the nature of grammatical knowledge.

Relating Structure and Time in Linguistics and Psycholinguistics

Linguistics and psycholinguistics differ not in their topic but in their tools, and our choice of tools should be commensurate to the hypotheses we are testing. A case study of long-distance dependencies serves to illustrate the point.

Linguistics

Contributor(s): Colin Phillips
Non-ARHU Contributor(s):

Matt Wagers

Dates:
Publisher: Oxford University Press

Linguistics and psycholinguistics differ not in their topic but in their tools, and our choice of tools should be commensurate to the hypotheses we are testing. A case study of long-distance dependencies serves to illustrate the point.

Necessary Bias in Natural Language Learning

A computational model of grammar acquisition.

Linguistics

Non-ARHU Contributor(s): Lisa Pearl
Dates:
This dissertation investigates the mechanism of language acquisition given the boundary conditions provided by linguistic representation and the time course of acquisition. Exploration of the mechanism is vital once we consider the complexity of the system to be learned and the non-transparent relationship between the observable data and the underlying system. It is not enough to restrict the potential systems the learner could acquire, which can be done by defining a finite set of parameters the learner must set. Even supposing that the system is defined by n binary parameters, we must still explain how the learner converges on the correct system(s) out of the possible 2^n systems, using data that is often highly ambiguous and exception-filled. The main discovery from the case studies presented here is that learners can in fact succeed provided they are biased to only use a subset of the available input that is perceived as a cleaner representation of the underlying system. The case studies are embedded in a framework that conceptualizes language learning as three separable components, assuming that learning is the process of selecting the best-fit option given the available data. These components are (1) a defined hypothesis space, (2) a definition of the data used for learning (data intake), and (3) an algorithm that updates the learner's belief in the available hypotheses, based on data intake. One benefit of this framework is that components can be investigated individually. Moreover, defining the learning components in this somewhat abstract manner allows us to apply the framework to a range of language learning problems and linguistics domains. In addition, we can combine discrete linguistic representations with probabilistic methods and so account for the gradualness and variation in learning that human children display. The tool of exploration for these case studies is computational modeling, which proves itself very useful in addressing the feasibility, sufficiency, and necessity of data intake filtering since these questions would be very difficult to address with traditional experimental techniques. In addition, the results of computational modeling can generate predictions that can then be tested experimentally.

(Dis)Agree: Movement and Agreement Reconsidered

On the Agree relation in Minimalism.

Linguistics

Non-ARHU Contributor(s): Pritha Chandra
Dates:
This dissertation examines Agree, a narrow syntactic, long-distance operation underlying phi-agreement in the grammar. Taking the strong minimalist thesis (cf. Chomsky 2000) as my point of departure, I question Agree on both conceptual and empirical grounds. On the conceptual side, the operation is suspect first for its language-specific character. Second, it also fails to be justified on the grounds of general architectural constraints and legibility requirements. Further, evidences of various long-distance agreement from across languages examined here question the empirical basis for Agree built throughout the previous literature. As far as this is true, I contend that the faculty of language has nothing beyond Merge and Move/Internal Merge, the first being inevitable in any language-like system and the latter necessitated by interface exigencies. My purpose in this dissertation is to show that these two operations suffice to obtain phi-agreement in natural language.

Tracking Sound Dynamics in Human Auditory Cortex: New macroscopic perspectives from MEG

An MEG study of auditory perception.

Linguistics

Non-ARHU Contributor(s):

Huan Luo

Dates:

Both the external world and our internal world are full of changing activities , and the question of how these two dynamic systems are linked constitutes the most intriguing and fundamental question in neuroscience and cognitive science. This study specifically investigates the processing and representation of sound dynamic information in human auditory cortex using magnetoencephalography (MEG), a non-invasive brain imaging technique whose high temporal resolution (on the order of ~1ms) makes it an appropriate tool for studying the neural correlates of dynamic auditory information. The other goal of this study is to understand the essence of the macroscopic activities reflected in non-invasive brain imaging experiments, specifically focusing on MEG. Invasive single-cell recordings in animals have yielded a large amount of information about how the brain works at a microscopic level. However, there still exist large gaps in our understanding of the relationship between the activities recorded at the microscopic level in animals and at the macroscopic level in humans, which have yet to be reconciled in terms of their different spatial scales and activities format, making a unified knowledge framework still unsuccessful. In this study, natural speech sentences and sounds containing speech-like temporal dynamic features are employed to probe the human auditory system. The recorded MEG signal is found to be well correlated with the stimulus dynamics via amplitude modulation (AM) and/or phase modulation (PM) mechanisms. Specifically, oscillations at various frequency bands are found to be the main information-carrying elements of the MEG signal, and the two major parameters of these endogenous brain rhythms, amplitude and phase, are modulated by incoming sensory stimulus dynamics, corresponding to AM and PM mechanism, to track sound dynamics. Crucially, such modulation tracking is found to be correlated with human perception and behavior. This study suggests that these two dynamic and complex systems, the external and internal worlds, systematically communicate and are coupled via modulation mechanism, leading to a reverberating flow of information embedded in oscillating waves in human cortex. The results also have implications for brain imaging studies, suggesting that these recorded macroscopic activities reflect brain state, the more close neural correlate of high-level cognitive behavior.

Syntactic prediction and lexical frequency effects in sentence processing

Three experiments examining the effect of lexical surface frequency on sentence processing and the interaction between surface frequency and syntactic prediction.

Linguistics

Non-ARHU Contributor(s):

Katya Rozanova

Dates:
Publisher: University of Maryland Working Papers

This paper presents three experiments which examine the effect of lexical surface frequency on sentence processing and the interaction between surface frequency and syntactic prediction. The first two experiments make use of the self-paced reading paradigm to show that processing time differences due to surface frequency (e.g., the frequency of cats not including occurrences of cat), which have previously been demonstrated in isolated word tasks like lexical decision, also give rise to reaction time differences in sentence processing tasks, in this case for singular and plural English nouns. The second experiment investigates whether a prediction for the number morpheme triggered by the number-marked determiners this and these might counter the surface frequency effect; however, the small size of the surface frequency effect and baseline differences in reaction times to this and these made the results unclear. Results from a third experiment using lexical decision suggest that the difference in the size of the surface frequency effects between the lexical decision experiments and the self-paced-reading experiments are likely due to differences in task demands. Our results have methodological implications for psycholinguistic experiments that manipulate morphology as a means of examining other questions of interest.