Blog

์—ผ๊ทœํ™˜
21 January 2026

Training a Generally Curious Agent

ICML'25

๐Ÿ’ก๋‚ด์žฌ์  ๋ณด์ƒ ์—†์ด๋„, LLM์ด ๋‹ค์–‘ํ•œ synthetic ์ƒํ˜ธ์ž‘์šฉ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ์ •๋ณด๋ฅผ ์Šค์Šค๋กœ ๋ชจ์œผ๊ณ , ๋‹จ๊ณ„๋ณ„๋กœ ํŒ๋‹จํ•˜๋ฉฐ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ๋ฐฐ์šฐ๊ฒŒ ํ•˜์ž!

21 January 2026

On LLM-Based Scientific Inductive Reasoning Beyond Equations

EMNLP'25

๐Ÿ’กํ˜„์žฌ LLM์€ โ€œ๋ฐฉ์ •์‹(์ˆ˜์‹)์œผ๋กœ ํ‘œํ˜„๋˜์ง€ ์•Š๋Š” ๊ณผํ•™์  ๊ทœ์น™โ€์„ ๊ด€์ฐฐ๋กœ๋ถ€ํ„ฐ ๊ท€๋‚ฉ์ ์œผ๋กœ ๋ฐœ๊ฒฌํ•˜๋Š” ๋ฐ ๊ทผ๋ณธ์ ์œผ๋กœ ์•ฝํ•˜๋‹ค.์ด๋ฅผ ๊ฒ€์ฆํ•˜๊ธฐ ์œ„ํ•ด ์ €์ž๋“ค์€ SIRBench-V1 ์ด๋ผ๋Š” ์ƒˆ๋กœ์šด ๋ฒค์น˜๋งˆํฌ๋ฅผ ๋งŒ๋“ค์—ˆ๊ณ , ์ตœ์‹  LLM๋“ค๋„ ๋Œ€๋ถ€๋ถ„ ๋‚ฎ์€ ์ •ํ™•๋„(๋ฝํ•ด์•ผ 45%) ์— ๋จธ๋ฌธ๋‹ค๋Š” ๊ฒƒ์„ ๋ณด์˜€๋‹ค.

์ด์Šนํ™˜
21 January 2026

MAP: Multi-Human-Value Alignment Palette

ICLR'25

๐Ÿ’ก๋‹ค์ค‘ ๊ฐ€์น˜ ์ •๋ ฌ์„ ๊ธฐ์กด์˜ ๊ฐ€์ค‘์น˜ ํŠœ๋‹ ๋ฐฉ์‹์ด ์•„๋‹ˆ๋ผ ์›ํ•˜๋Š” ์ˆ˜์ค€์˜ ๋ชฉํ‘œ(palette)๋ฅผ ๋จผ์ € ์ง€์ •ํ•˜๊ณ , ๊ทธ ๋ชฉํ‘œ๋ฅผ ๋งŒ์กฑํ•˜๋Š” ฮป๋ฅผ ์ž๋™์œผ๋กœ ์ฐพ์•„ Pareto ๊ฐœ์„ ์„ ๋ณด์žฅํ•˜๋Š” ์ •๋ ฌ๋กœ ๋ฐ”๊ฟ”๋ณด์ž!