cs.CL

该分类下的最新论文

cs.CL 2026-06-17
Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

Progress in legal AI increasingly depends on access to authoritative legal text at scale. Yet one of the most consequential layers of American law remains largely absent from existing machine-readable...

Denis Peskoff, Joe Barrow, Christopher Vu 等
cs.AI 2026-06-17
Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations t...

Siyi Gu, Jialin Chen, Sophia Zhou 等
cs.CL 2026-06-17
Trade-offs in Medical LLM Adaptation: An Empirical Study in French QA

The development of large language models (LLMs) has led to an increased focus on their adaptation to specialized domains and languages, yet the effectiveness of domain adaptation strategies remains un...

Ikram Belmadani, Oumaima El Khettari, Carlos Ramisch 等
cs.LG 2026-06-17
Structured Inference with Large Language Gibbs

The knowledge encoded in large language models (LLMs) can serve as a substrate for structured reasoning over variables describing a complex world, but accessing this knowledge in a probabilistically c...

Sanghyeok Choi, Henry Gouk, Esmeralda S. Whitammer
cs.LG 2026-06-17
STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Reinforcement Learning with Verifiable Rewards algorithms like GRPO have emerged as the dominant post-training paradigm for complex reasoning in LLMs, yet commonly suffer from policy entropy collapse ...

Haipeng Luo, Qingfeng Sun, Songli Wu 等