B. Park | Paper Study 2025

Cognitive

Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs	2025 COLM
Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations	2025, url
On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts	Tenenbaum, 2025 url

Steering

Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis

Knowledge Conflict

Conflict-Aware Soft Prompting for Retrieval-Augmented Generation (EMNLP 2025)	url
KCR: Resolving Long-Context Knowledge Conflicts via Reasoning in LLMs	url
An Unlearning-based Approach to Conflict-free Model Editing
Resolving Knowledge Conflicts in Large Language Models
Can I understand what I create? Self-Knowledge Evaluation of Large Language Models
Taming Knowledge Conflicts in Language Models (ICML 2025)

Neuron Interpretation for LLM

Unveiling the Pitfalls of Knowledge Editing for Large Language Models	(ICLR 2024)
The Origins of Representation Manifolds in Large Language Models	(2025)

LLM Study

|Fluid Language Model Benchmarking | COLM 2025 | |Language Self-Play For Data-Free Training | (2025) |

NMR Logic

An Efficient Reasoner for Description Logics of Typicality and Rational Closure?	2017
Generics and Default Reasoning in Large Language Models	2025
Know your exceptions	2024

LLM Study

Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision
Scaling Agents via Continual Pre-training
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
AuPair: Golden Example Pairs for Code Repair (ICML 2025) Self-repair, Fix Quality Matrix, Submodular selection, In-context examples (AuPairs)	url