Blog

14 January 2026

Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations

NIPS'25

๐Ÿ’กLLM์ด ์ž์‹ ์˜ ๋ชจ๋ธ ๋‚ด๋ถ€์—์„œ ์ผ์–ด๋‚˜๋Š” ์ƒํƒœ๋ฅผ ์–ผ๋งˆ๋‚˜ ์ธ์‹, ํ‰๊ฐ€, ์กฐ์ ˆํ•  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ โ€˜Neurofeedbackโ€™ (๋ชจ๋ธ์˜ ๋‚ด๋ถ€ ๋ ˆ์ด์–ด, ๋ฒกํ„ฐ ์กฐ์ • ๋ฐ ํ™œ์„ฑํ™” ์ •๋„ ์ธก์ •)๋ฐฉ์‹์œผ๋กœ ์ธก์ •ํ•˜์˜€๊ณ , ๊ทธ ๋Šฅ๋ ฅ์ด ์ œํ•œ์ ์ž„์„ ๋ณด์ž„

์ด์Šนํ™˜
14 January 2026

Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

ICLR'25

๐Ÿ’กSpeculative Decoding์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๋ณ‘๋ชฉ์ด Target model์˜ ์ •๋ ฌ(alignment) ๊ธฐ๋ฐ˜ ๊ฒ€์ฆ ๋•Œ๋ฌธ์ž„์„ ๋ฐํžˆ๊ณ , Target model์˜ ์ž„๋ฒ ๋”ฉ์œผ๋กœ ํ† ํฐ์˜ ์ •๋‹ต์„ฑ(correctness)์„ ํŒ์ •ํ•˜๋Š” ์ƒˆ๋กœ์šด ๊ฒ€์ฆ ๋ฐฉ์‹์ธ Judge Decoding ๋ฐฉ์‹์„ ๋„์ž…ํ•จ!

์ตœ๋ฏผ์˜
14 January 2026

Interpreting the Repeated Token Phenomenon in Large Language Models

ICML'25

๐Ÿ’กLLM์— ๊ฐ™์€ ๋‹จ์–ด๋ฅผ ๊ณ„์† ๋ฐ˜๋ณต์‹œํ‚ค๋ฉด ๋ชจ๋ธ์ด ์–ด๋А ์ˆœ๊ฐ„๋ถ€ํ„ฐ ๊ทธ ๋‹จ์–ด๋ฅผ ์ œ๋Œ€๋กœ ๋ฐ˜๋ณตํ•˜์ง€ ๋ชปํ•˜๊ณ  ๋ถ•๊ดด๋˜๋Š”๋ฐ, ์ด๋Š” attention sink๋ฅผ ๋งŒ๋“œ๋Š” neuron์ด ๋ฐ˜๋ณต๋˜๋Š” ํ† ํฐ์„ โ€˜๋ฌธ์žฅ์˜ ์ฒซ ํ† ํฐ(BoS)โ€™์œผ๋กœ ์˜ค์ธํ•˜์—ฌ attention์ด ๋ชฐ๋ฆฌ๊ธฐ ๋•Œ๋ฌธ์ž„