cs.CL - arXiv 学术档案

cs.CL 2026-06-17

Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

Progress in legal AI increasingly depends on access to authoritative legal text at scale. Yet one of the most consequential layers of American law remains largely absent from existing machine-readable...

Denis Peskoff, Joe Barrow, Christopher Vu 等

详情 PDF

cs.AI 2026-06-17

Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations t...

Siyi Gu, Jialin Chen, Sophia Zhou 等

详情 PDF

cs.CL 2026-06-17

Trade-offs in Medical LLM Adaptation: An Empirical Study in French QA

The development of large language models (LLMs) has led to an increased focus on their adaptation to specialized domains and languages, yet the effectiveness of domain adaptation strategies remains un...

Ikram Belmadani, Oumaima El Khettari, Carlos Ramisch 等

详情 PDF

cs.LG 2026-06-17

Structured Inference with Large Language Gibbs

The knowledge encoded in large language models (LLMs) can serve as a substrate for structured reasoning over variables describing a complex world, but accessing this knowledge in a probabilistically c...

Sanghyeok Choi, Henry Gouk, Esmeralda S. Whitammer

详情 PDF

cs.LG 2026-06-17

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Reinforcement Learning with Verifiable Rewards algorithms like GRPO have emerged as the dominant post-training paradigm for complex reasoning in LLMs, yet commonly suffer from policy entropy collapse ...

Haipeng Luo, Qingfeng Sun, Songli Wu 等

详情 PDF