Cognitive
| Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs |
2025 COLM |
| Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations |
2025, url |
| On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts |
Tenenbaum, 2025 url |
Steering
| Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis |
Knowledge Conflict
| Conflict-Aware Soft Prompting for Retrieval-Augmented Generation (EMNLP 2025) |
url |
| KCR: Resolving Long-Context Knowledge Conflicts via Reasoning in LLMs |
url |
| An Unlearning-based Approach to Conflict-free Model Editing |
|
| Resolving Knowledge Conflicts in Large Language Models |
|
| Can I understand what I create? Self-Knowledge Evaluation of Large Language Models |
|
| Taming Knowledge Conflicts in Language Models (ICML 2025) |
|
Neuron Interpretation for LLM
| Unveiling the Pitfalls of Knowledge Editing for Large Language Models |
(ICLR 2024) |
| The Origins of Representation Manifolds in Large Language Models |
(2025) |
LLM Study
|Fluid Language Model Benchmarking | COLM 2025 |
|Language Self-Play For Data-Free Training | (2025) |
NMR Logic
| An Efficient Reasoner for Description Logics of Typicality and Rational Closure? |
2017 |
| Generics and Default Reasoning in Large Language Models |
2025 |
| Know your exceptions |
2024 |
LLM Study
| Compute as Teacher: Turning Inference Compute Into Reference-Free Supervision |
|
| Scaling Agents via Continual Pre-training |
|
| DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning |
|
AuPair: Golden Example Pairs for Code Repair (ICML 2025) Self-repair, Fix Quality Matrix, Submodular selection, In-context examples (AuPairs) |
url |
| |
|