Research

Publications

 

Articles

 

''How to Solve the Gender Inclusion Problem'' Hypatia (forthcoming).
[Self-Archived]

Abstract: The inclusion problem for theories of gender arises when those theories inappropriately fail to include certain individuals in the gender categories to which they ought to belong. The inclusion problem affects both of the most influential traditions in feminist theorizing about gender: social-position accounts and identity accounts. I argue that the inclusion problem can be solved by adopting a structured theory of gender, which incorporates aspects of both social-position accounts and identity accounts. According to the theory I favor, an individual’s gender is determined by their gender identity if they have one; otherwise, it is determined by their social position. My structured approach to gender offers a more direct solution to the gender inclusion problem than alternatives recently advocated by Barnes (2020), Jenkins (2023), and others. It also points the way to a simple solution to inclusion problems that arise at the level of gendered language.

''AI Wellbeing'' Asian Journal of Philosophy (forthcoming). (With Simon Goldstein.)
[Self-Archived]

Abstract: Under what conditions would an artificially intelligent system have wellbeing? Despite its clear bearing on the ethics of human interactions with artificial systems, this question has received little direct attention. Because all major theories of wellbeing hold that an individual’s welfare level is partially determined by their mental life, we begin by considering whether artificial systems have mental states. We show that a wide range of theories of mental states, when combined with leading theories of wellbeing, predict that certain existing artificial systems have wellbeing. Along the way, we argue that there are good reasons to believe that artificial systems can have wellbeing even if they are not phenomenally conscious. While we do not claim to demonstrate conclusively that AI systems have wellbeing, we argue that there is a significant probability that some AI systems have or will soon have wellbeing, and that this should lead us to reassess our relationship with the intelligent systems we create.

''Quotation for Dummies'' Philosophical Perspectives (forthcoming).
[Self-Archived]

Abstract: Quotation marks in natural language that do not function straightforwardly as devices for securing reference to linguistic objects have generally been categorized as instances of either mixed quotation or scare quotation. I argue that certain examples of quotation in natural language resist assimilation to either of these two theoretical categories. It follows that we must recognize a novel type of quotation in natural language, which I call dummy quotation because it involves the contribution of semantically minimal ‘dummy meanings’ to composition. I develop a semantic theory of dummy quotation and show that it is better able than rival proposals to account for the troublesome examples in question.

''Language Agents Reduce the Risk of Existential Catastrophe'' AI & Society (forthcoming). (With Simon Goldstein.)
[Official] [Self-Archived]

Abstract: Recent advances in natural language processing have given rise to a new kind of AI architecture: the language agent. By repeatedly calling an LLM to perform a variety of cognitive tasks, language agents are able to function autonomously to pursue goals specified in natural language and stored in a human-readable format. Because of their architecture, language agents exhibit behavior that is predictable according to the laws of folk psychology: they have desires and beliefs, and then make and update plans to pursue their desires given their beliefs. We argue that the rise of language agents significantly reduces the probability of an existential catastrophe due to loss of control over an AGI. This is because the probability of such an existential catastrophe is proportional to the difficulty of aligning AGI systems, and language agents significantly reduce that difficulty. In particular, language agents help to resolve three important issues related to aligning AIs: reward misspecification, goal misgeneralization, and uninterpretability.

''Covert Mixed Quotation'' Semantics and Pragmatics 17(5): 1–54 (2024).
[Official (Open Access)]

Abstract: The term covert mixed quotation describes cases in which linguistic material is interpreted in the manner of mixed quotation — that is, used in addition to being mentioned — despite the superficial absence of any commonly recognized conventional devices indicating quotation. After developing a novel theory of mixed quotation, I show that positing covert mixed quotation allows us to give simple and unified treatments of a number of puzzling semantic phenomena, including the projective behavior of conventional implicature items embedded in indirect speech reports and propositional attitude ascriptions, so-called 'c-monsters,' metalinguistic negation, metalinguistic negotiation, and 'in a sense' constructions.

''Artificial Intelligence: Arguments for Catastrophic Risk'' Philosophy Compass 19(2): e12964 (2024). (With Adam Bales and William D'Alessandro.)
[Official (Open Access)]

Abstract: Recent progress in artificial intelligence (AI) has drawn attention to the technology’s transformative potential, including what some see as its prospects for causing large-scale harm. We review two influential arguments purporting to show how AI could pose catastrophic risks. The first argument — the Problem of Power-Seeking — claims that, under certain assumptions, advanced AI systems are likely to engage in dangerous power-seeking behavior in pursuit of their goals. We review reasons for thinking that AI systems might seek power, that they might obtain it, that this could lead to catastrophe, and that we might build and deploy such systems anyway. The second argument claims that the development of human-level AI will unlock rapid further progress, culminating in AI systems far more capable than any human — this is the Singularity Hypothesis. Power-seeking behavior on the part of such systems might be particularly dangerous. We discuss a variety of objections to both arguments and conclude by assessing the state of the debate.

''Dilemmatic Gaslighting'' Philosophical Studies 180: 745–772 (2023).
[Official] [Self-Archived]

Abstract: Existing work on gaslighting ties it constitutively to facts about the intentions or prejudices of the gaslighter and/or his victim’s prior experience of epistemic injustice. I argue that the concept of gaslighting is more broadly applicable than has been appreciated: what is distinctive about gaslighting, on my account, is simply that a gaslighter confronts his victim with a certain kind of choice between rejecting his testimony and doubting her own basic epistemic competence in some domain. I thus hold that gaslighting is a purely epistemic phenomenon — not requiring any particular set of intentions or prejudices on the part of the gaslighter — and also that it can occur even in the absence of any prior experience of epistemic injustice. Appreciating the dilemmatic character of gaslighting allows us to understand its connection with a characteristic sort of epistemic harm, makes it easier to apply the concept of gaslighting in practice, and raises the possibility that we might discover its structure and the associated harm in surprising places. 

''Contextology'' Philosophical Studies 179: 3187–3209 (2022). (With Simon Goldstein.)
[Official] [Self-Archived]

Abstract: Contextology is the science of the dynamics of the conversational context. Contextology formulates laws governing how the shared information states of interlocutors evolve in response to assertion. More precisely, the contextologist attempts to construct a function which, when provided with just a conversation's pre-update context and the content of an assertion, delivers that conversation's post-update context. Most contextologists have assumed that the function governing the evolution of the context is simple: the post-update context is just the pre-update context intersected with the content of the assertion. We argue that this assumption is wrong: not only is it false, it is also incoherent given standard contextological assumptions. Moreover, it is impossible in principle to revise it to correctly describe the dynamics of context. We conclude that there can be no science of Contextology. The laws governing the evolution of the context in response to assertion must make essential reference to the private information states of interlocutors.

''Why Horizontalism'' Philosophical Studies 177: 2881-2905 (2020).
[
Official] [Self-Archived]

Abstract: Horizontalism is the thesis that what a speaker asserts in literally and sincerely uttering an indicative sentence is some horizontal proposition of her utterance; diagonalism is the thesis that what a speaker asserts in literally and sincerely uttering an indicative sentence is some diagonal proposition of her utterance. Recent work on assertion has reached no clear consensus favoring either horizontalism or diagonalism. I explore a novel strategy for adjudicating between the two views by considering the advantages and disadvantages which would accrue to a linguistic community as a result of adopting different committal practices – that is, practices of associating utterances with the propositions to which speakers undertake assertoric commitments in uttering them – ultimately concluding that a horizontalist practice has important advantages over its competitors.

''Slurs Are Directives'' Philosophers' Imprint 19(48): 1-28 (2019).
[Official (Open Access)]

Abstract: Recent work on the semantics and pragmatics of slurs has explored a variety of ways of explaining their potential to derogate, with the most popular family of approaches appealing to either: (i), the non-cognitive attitudes expressed by — or (ii), the propositions concerning such attitudes semantically or pragmatically communicated by — the speakers who use them. I begin by arguing that no such speaker-oriented approach can be correct. I then propose an alternative treatment of slurs, according to which they are semantically associated with both descriptive and directive content. On the view I defend, when speakers use slurs, they simultaneously propose to add an at-issue proposition to the conversational common ground and issue a not-at-issue directive to their interlocutors to adopt a derogatory perspective toward members of the targeted group. This proposal both avoids the problems faced by other accounts and opens up a novel way of thinking about the phenomenon of appropriation.

''Radically Insensitive Theists'' Religious Studies 55: 169-188 (2019).
[Official] [Self-Archived]

Abstract: Sceptical theists attempt to meet the challenge to theism posed by evidential arguments from evil by appealing to the limitations of human cognition. Drawing on an exchange between William Rowe and Michael Bergmann, I argue that consistent sceptical theists must be radically insensitive to certain kinds of evidence about prima facie evils – that is, that they must endorse the claim that not even evidence of extreme and pervasive suffering could justify disbelief in theism. I show that Bergmann’s attempt to respond to this problem does not succeed and argue that no alternative response is forthcoming, concluding that the threat of radical insensitivity constitutes a serious and underappreciated difficulty for sceptical theism.

''Uniformity Motivated'' Linguistics and Philosophy 41: 665-684 (2018).
[Official] [Self-Archived]

Abstract: Can rational communication proceed when interlocutors are uncertain which contents utterances contribute to discourse? An influential negative answer to this question is embodied in the Stalnakerian principle of uniformity, which requires speakers to produce only utterances that express the same content in every possibility treated as live for the purposes of the conversation. The principle of uniformity enjoys considerable intuitive plausibility and, moreover, seems to follow from platitudes about assertion; nevertheless, it has recently proven controversial. In what follows, I defend the principle by developing two arguments for it based on premises reflecting the central aims and assumptions of possibility-carving frameworks for modeling inquiry — that is, frameworks which describe the evolution of individuals’ attitudinal states in terms of set-theoretic operations defined over a domain of objects representing possibilities.

''De Ray: On the Boundaries of the Davidsonian Semantic Programme'' Mind 126: 697-714 (2017). (With Ernie Lepore.)
[Official] [Self-Archived]

Abstract: Greg Ray (2014) believes he has discovered a crucial oversight in Donald Davidson’s semantic programme, recognition of which paves the way for a novel approach to Davidsonian semantics. We disagree: Ray’s novel approach involves a tacit appeal to pre-existing semantic knowledge which vitiates its interest as a development of the Davidsonian programme.

 

Chapters

 

''Do Not Diagonalize'' In The Oxford Handbook of Contemporary Philosophy of Language (forthcoming).
[Self-Archived]

Abstract: Speakers assert in order to communicate information. It is natural, therefore, to hold that the content of an assertion is whatever information it communicates to its audience. In cases involving uncertainty about the semantic values of context-sensitive lexical items, moreover, it is natural to hold that the information an assertion communicates to its audience is whatever information audience members are in a position to recover from it by assuming that the proposition it semantically determines is true. This sort of picture corresponds to an influential and widely endorsed theory of assertoric content: diagonalism. I begin by arguing that, despite its intuitive appeal, diagonalism should be rejected because it conflicts with our intuitive judgments about the circumstances in which the contents of speakers' assertions would be true or false.  I then show that the failure of diagonalism requires us to either abandon a familiar way of thinking about information and rational assertion or hold that the content of an assertion is not always the information it communicates. I suggest that we choose the latter horn of this dilemma — assertoric content is better characterized in terms of the commitments speakers undertake than in terms of the information they communicate.

''Pronouns and Gender'' In The Oxford Handbook of Applied Philosophy of Language, pp. 265–292 (2024). (With Michael Glanzberg.)
[Official] [Self-Archived]

Abstract: This chapter introduces readers to the empirical questions at issue in debates over gendered pronouns and assesses the plausibility of various possible answers to these questions. It has two parts. The first is a general introduction to the linguistics and psychology of grammatical gender. The second focuses on the meanings of gendered pronouns in English. It begins with a discussion of some methodological limitations of empirical approaches to the topic and the normative implications of those limitations. It then argues against three simple theories of the semantics of gendered pronouns in English and proposes an alternative that fares better: the Gender-First View. Finally, it discusses the singular use of 'they' and its connection to nonbinary gender identities.

 

Shorter Pieces (Replies, Discussion, Etc.)

 

''Is Alignment Unsafe?'' Philosophy & Technology 37(110): 1-4 (2024).
[Official (Open Access)]

Abstract: Inchul Yum (2024) argues that the widespread adoption of language agent architectures would likely increase the risk posed by AI by simplifying the process of aligning artificial systems with human values and thereby making it easier for malicious actors to use them to cause a variety of harms. Yum takes this to be an example of a broader phenomenon: progress on the alignment problem is likely to be net safety-negative because it makes artificial systems easier for malicious actors to control. I offer some reasons for skepticism about this surprising and pessimistic conclusion.

 

Edited

 

AI Safety Special Issue of Philosophical Studies (forthcoming.) (With Dan Hendrycks.)
[Link]

Unstructured Content. Oxford University Press (2025). (With Peter van Elswyk, Andy Egan, and Dirk Kindermann.)
[Oxford] [Amazon]

Abstract: The original essays in this volume present new research on unstructured theories of content, which have traditionally played a central role in linguistics and philosophy of language. The volume explores a wide range of themes related to unstructured content, including both the continued controversy over whether unstructured theories individuate contents too coarsely and various applications of unstructured theories to topics like rationality, epistemic commitment, semantic expressivism, relevance, and propositional attitude ascriptions. It contains contributions from different theoretical perspectives, including both those sympathetic to unstructured theories of content and those who are skeptical, as well as from different methodological backgrounds, with philosophy, logic, and linguistics all represented. With contributions from leading scholars in philosophy and linguistics, this volume will be of interest to anyone working in logic, metaphysics, or the philosophy of mind.

The Structure of Truth. Oxford University Press (2020). (With Ernie Lepore.)
[Oxford] [Amazon] [Introduction (Self-Archived)]

Abstract: Donald Davidson was one of the most famous and influential philosophers of the twentieth century. The Structure of Truth presents his 1970 Locke Lectures in print for the first time. They comprise an invaluable historical document which illuminates how Davidson was thinking about the theory of meaning, the role of a truth theory therein, the ontological commitments of a truth theory, the notion of logical form, and so on, at a pivotal moment in the development of his thought. Unlike Davidson's previously published work, the lectures are written so as to be presented to an audience as a fully organized and coherent exposition of his program in the philosophy of language. Had they been widely available in the years following 1970, the reception of Davidson's work might have been very different. Given the systematic nature of their presentation of Davidson's semantic program, these lectures will be of interest to anyone working in the philosophy of language.

Reviewed in Mind by Ian Rumfitt. [Official] [Archived]

Reviewed in The Review of Metaphysics by Claudine Verheggen. [Official] [Archived]

 

Recent and Upcoming Presentations

 

What Remains of the Singularity Hypothesis? Workshop on the Cognitive and Linguistic Capacities of LLMs, University of Hong Kong. [December 2024]

A Case for AI Consciousness: Artificial Language Agents and Global Workspace Theory. UMD College Park. [October 2024]

Metalinguistic Comparison Is Covert Mixed Quotation. Workshop in Philosophy of Language, 25th World Congress of Philosophy, Rome. [August 2024]

Metalinguistic Comparison Is Covert Mixed Quotation. Point of View in Language Workshop, Uppsala University. [June 2024]

How to Solve the Gender Inclusion Problem. Filosofidagarna 2024, University of Gothenburg. [June 2024]

AI Wellbeing. Mind and Moral Status Group, Global Priorities Institute, University of Oxford, Digital. [May 2024]

A Case for AI Consciousness: Artificial Language Agents and Global Workspace Theory. Pacific APA Symposium, Portland. [March 2024]

What Remains of the Traditional AI X-Risk Argument? Workshop on Artificial Intelligence and Catastrophic Risk, Australian National University. [March 2024]

A Case for AI Consciousness: Artificial Language Agents and Global Workspace Theory. Macquarie University. [March 2024]

Comments on "Bigoted Beliefs and the Safety Condition" by Gus Turyn. Eastern APA Colloquium, New York. [January 2024]

AI Safety: The View from Philosophy. ITAM AI Futures Fellowship, Mexico City. [January 2024]