Denis Peskoff, Joe Barrow, Christopher Vu 等
Progress in legal AI increasingly depends on access to authoritative legal text at scale. Yet one of the most consequential layers of American law remains largely absent from existing machine-readable...
V. Samuel Pérez-Díaz, Vinay L. Kashyap, Joshua D. Ingram 等
We present a framework to cross-match sources from the Chandra Source Catalog (CSC v2.1) with optical sources from Gaia Data Release 3. Unlike purely spatial approaches, we use source properties such ...
Mohamed Nabail, Leo Cheng, Jingmin Wang 等
Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on pa...
Siyi Gu, Jialin Chen, Sophia Zhou 等
Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations t...
Michael Finkelson, Daniel Segal, Eitan Richardson 等
Existing multi-speaker dialogue systems bind speakers to utterances through structured supervision: per-turn tags, multi-stream transcriptions, or learnable speaker embeddings. These systems operate w...
Anoushka Vyas, Aarushi Dhanuka, Sina Khoshfetrat Pakazad 等
Production data integration is bottlenecked by repeated, lossy handoffs between data owners, engineers, and analysts who must collaboratively discover, structure, and query enterprise data. We present...
Amiri Hayes, Belinda Li, Jacob Andreas
A longstanding goal of research on interpretable deep learning is to replace opaque neural computations with human-meaningful symbolic descriptions. In this paper, we propose an approach for approxima...
Ruida Wang, Rui Pan, Pengcheng Wang 等
Enhancing the formal math reasoning capabilities of Large Language Models (LLMs) has become a key focus in both mathematical and computer science communities in recent years. While significant progres...
Xizhuo, Zhang, Zekai Wang 等
High-fidelity simulation of spatiotemporal dynamics is computationally prohibitive, necessitating efficient super-resolution techniques to reconstruct high-resolution data from coarse-grained inputs. ...
Christopher B. Womack, Shahine Bouabid, Andrei Sokolov 等
As deep learning for physical systems continues to grow in popularity, efforts to improve generalizability have primarily focused on designing architectures that embed physical constraints. However, f...
Xin Ci Wong, Duygu Sarikaya, Kieran Zucker 等
Glioma segmentation in multiparametric MRI is a critical component of treatment planning. A segmentation model that fails silently on treatment-critical sub-regions represents a patient safety risk th...