I'm Dillon Plunkett, a cognitive (neuro)scientist studying human and AI minds.

Research

I'm interested in complex thought, conscious awareness, and the relationship between the two. What kinds of thoughts are we capable of thinking and how does the human brain encode them? Which mental processes do we consciously experience and what determines the character of those experiences?

I'm also interested in artificial intelligence systems. I have all of the same questions about them. I also believe they will likely be either enormously beneficial or catastrophically harmful for life on Earth. Accordingly, my current research is focused on understanding and steering powerful AI systems.

Currently, I am an Anthropic Fellow researching model welfare with Kyle Fish. Previously, I worked with Jorge Morales in the Subjectivity Lab, where my research focused on the ability of AI systems to report on their own internal processes and on how the human mind represents and predicts changes. Before that, I did my PhD research in Joshua Greene's lab at Harvard. And before that, I did research in experimental epistemology, causal inference, and metareasoning with Tania Lombrozo and Tom Griffiths while working in the Concepts and Cognition and Computational Cognitive Science labs at UC Berkeley.

As an undergraduate, I studied philosophy and psychology at Harvard. My thesis work focused on another topic I find fascinating: the rational and moral significance of personal identity. Precisely what makes some future person me and why should I care more about that person than other people?

Publications


Full CV (pdf)