Pengrui Han

Pengrui Han (Barry)

I am currently in the MSCS program at UIUC, advised by Prof. Jiaxuan You. I am also a researcher in the MIT Brain and Cognitive Sciences department, working with Prof. Evelina Fedorenko and Dr. Andrea de Varda in the EvLab. I received my B.A. in Mathematics and Computer Science from Carleton College, a leading liberal arts college in the US. During my undergrad, I was fortunate to work with Prof. Anima Anandkumar and Dr. Rafał Kocielnik in the Anima AI+Science Lab at Caltech. I also previously interned at NVIDIA.

韩芃睿 / Email / Google Scholar / GitHub / LinkedIn / Twitter

News

[Apr 2026] 🔥 Our paper The Personality Illusion is accepted to ICML. See you in Seoul!
[Mar 2026] Our paper Large Language Model Reasoning Failures is featured by Live Science and Popular Mechanics. Thank you!
[Jan 2026] Our paper Large Language Model Reasoning Failures is accepted to TMLR with a Survey Certificate.
[Dec 2025] Honored to receive the Best Paper Honorable Mention Award @ NeurIPS LAW Workshop, for our Personality Illusion paper.

Research

My research aims to advance scientific understanding of AI (especially neural models like LLMs), and more broadly, the general principles of intelligence and intelligent behavior. I also work on problems where current AI still falls short of biological intelligence, such as adaptivity, robustness, and sample efficiency. At the same time, I am drawn to using AI as a powerful instrument for studying open scientific questions about the human mind and brain. I approach this through four interconnected threads:

Learn More Behavioral science of AI Click to explore →

Learn More Mechanistic science of AI Click to explore →

Learn More Building better AI Click to explore →

Learn More AI for mind and brain Click to explore →

Characterizing how models reason, generalize, and (mis)align, including studies of alignment, limitations, and trustworthy reasoning.

Interpreting model internals to understand the circuits, representations, and algorithms that give rise to intelligent behavior and drive observable performance and failures.

Current AI still falls short of biological intelligence in many ways; I aim to build systems that are more adaptive, continuously evolving, and safe.

Using AI as a scientific instrument to understand the human mind and tackle open scientific questions in cognition, language, and memory disorders.

The first two threads parallel cognitive science and neuroscience in their study of mind and brain. The third draws on both to build better AI; the fourth turns that AI back on biological intelligence and scientific discovery.

If any of this resonates with your interests, feel free to reach out and let's connect / collaborate!

Selected Publications

	The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs Pengrui Han, Rafal D. Kocielnik, Peiyang Song, Ramit Debnath, Dean Mobbs, Anima Anandkumar, and R. Michael Alvarez (* Equal Contribution) International Conference on Machine Learning (ICML), 2026 NeurIPS LAW Workshop: Bridging Language, Agent, and World Models, 2025, Best Paper Honorable Mention arXiv / project / code / media LLMs say they have personalities, but they don’t act like it. Alignment today shapes language, not behavior. This linguistic–behavioral dissociation cautions against equating coherent self-reports with cognitive depth.
	Large Language Model Reasoning Failures Peiyang Song, Pengrui Han*, and Noah Goodman ( Equal Contribution) Transactions on Machine Learning Research (TMLR), 2026, Survey Certificate arXiv / code / proceeding / media We present the first comprehensive survey dedicated to reasoning failures in LLMs. By unifying fragmented research efforts, our survey provides a structured perspective on systemic weaknesses in LLM reasoning, offering valuable insights and guiding future research towards building stronger, more reliable, and robust reasoning capabilities.
	In-Context Learning May Not Elicit Trustworthy Reasoning: A-Not-B Errors in Pretrained Language Models Pengrui Han, Peiyang Song, Haofei Yu, and Jiaxuan You (* Equal Contribution) Findings of Empirical Methods in Natural Language Processing (EMNLP), 2024 arXiv / code / proceeding Motivated by the crucial cognitive phenomenon of A-not-B errors, we present the first systematic evaluation on the surprisingly vulnerable inhibitory control abilities of LLMs. We reveal that this weakness undermines LLMs' trustworthy reasoning capabilities across diverse domains, and introduce various mitigations.
	ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs Pengrui Han, Rafal Kocielnik, Adhithya Saravanan,Roy Jiang, Or Sharir,and Anima Anandkumar (* Equal Contribution) Conference On Language Modeling (COLM), 2024 arXiv / code / proceeding We propose a light and efficient pipeline that enables both domain and non-domain experts to quickly generate synthetic debiasing data to mitigate specific or general bias in their models with parameter-efficient fine-tuning.
	Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance Guanyu Lin, Tao Feng, Pengrui Han, Ge Liu, Jiaxuan You ( Equal Contribution) System Demonstration Track of Empirical Methods in Natural Language Processing (EMNLP), 2024 Huggingface Live Demo: Link We propose a light and efficient pipeline that enables both domain and non-domain experts to quickly generate synthetic debiasing data to mitigate specific or general bias in their models with parameter-efficient fine-tuning.

Selected Awards

TMLR Survey Certification (2026)
NeurIPS LAW Workshop Best Paper Honorable Mention Award (2025)
Phi Beta Kappa Honor Society (2025)
Carleton College Chang-Lan Award (2024)
Caltech SURF Award (2023)
Carleton College Dean's List (2023)

Selected Media

'Not how you build a digital mind': How reasoning failures are preventing AI models from achieving human-level intelligence, Live Science, 2026
Scientists Found AI’s Fatal Flaw—The Most Advanced Models Are Failing Basic Logic Tests, Popular Mechanics, 2026
New Framework Simplifies the Complex Landscape of Agentic AI, VentureBeat, 2025
This AI Paper Explains Why Most "Agentic AI" Systems Feel Impressive in Demos and then Completely Fall Apart in Real Use, MarkTechPost, 2025
Researchers Discover "Personality Illusion" to Reveal a Profound Disconnect Between Language and Behavior in LLMs, MIT Technology Review China, 2025

Teaching

CS 440: Artificial Intelligence, Teaching Assistant @ UIUC, Summer 2026
CS 411: Database Systems, Teaching Assistant @ UIUC, Spring 2026
CS 512: Data Mining Principles, Teaching Assistant @ UIUC, Fall 2025
MATH 241: Ordinary Differential Equations, Teaching Assistant @ Carleton College, Fall 2024
MATH 321: Real Analysis, Teaching Assistant @ Carleton College, Spring 2024
MATH 232: Linear Algebra, Teaching Assistant @ Carleton College, Spring 2023
MATH 232: Linear Algebra, Teaching Assistant @ Carleton College, Winter 2023

Academic Services

Reviewer for conferences: ICLR, ICML, NeurIPS, ACL, COLM, COLING.
Reviewer for workshops: Re-Align, LLM-Cognition, BehaviorML, LTEDI, INTERPLAY, AI4Math, LatinX, Assessing World Models

Site source