arXiv 最新论文档案

自动采集 arXiv 最新发表的计算机科学和机器学习论文,提供专业中文翻译、学术贡献分析和资源关联。

最新论文

12 篇
cs.AI 2026-06-18
Test Title

Author1

测试标题

Test abstract...

cs.CL 2026-06-17
Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

Denis Peskoff, Joe Barrow, Christopher Vu 等

Progress in legal AI increasingly depends on access to authoritative legal text at scale. Yet one of the most consequential layers of American law remains largely absent from existing machine-readable...

astro-ph.IM 2026-06-17
The Chandra-Gaia Catalog of Counterparts: Resolving ambiguous Gaia matches to X-ray sources in the Chandra Source Catalo...

V. Samuel Pérez-Díaz, Vinay L. Kashyap, Joshua D. Ingram 等

We present a framework to cross-match sources from the Chandra Source Catalog (CSC v2.1) with optical sources from Gaia Data Release 3. Unlike purely spatial approaches, we use source properties such ...

cs.LG 2026-06-17
UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

Mohamed Nabail, Leo Cheng, Jingmin Wang 等

Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on pa...

cs.AI 2026-06-17
Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

Siyi Gu, Jialin Chen, Sophia Zhou 等

Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations t...

cs.SD 2026-06-17
Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

Michael Finkelson, Daniel Segal, Eitan Richardson 等

Existing multi-speaker dialogue systems bind speakers to utterances through structured supervision: per-turn tags, multi-stream transcriptions, or learnable speaker embeddings. These systems operate w...

cs.MA 2026-06-17
Data Intelligence Agents: Interpreting, Modeling, and Querying Enterprise Data via Autonomous Coding Agents

Anoushka Vyas, Aarushi Dhanuka, Sina Khoshfetrat Pakazad 等

Production data integration is bottlenecked by repeated, lossy handoffs between data owners, engineers, and analysts who must collaboratively discover, structure, and query enterprise data. We present...

cs.LG 2026-06-17
Explaining Attention with Program Synthesis

Amiri Hayes, Belinda Li, Jacob Andreas

A longstanding goal of research on interpretable deep learning is to replace opaque neural computations with human-meaningful symbolic descriptions. In this paper, we propose an approach for approxima...

cs.LG 2026-06-17
Diffusion-Proof: Recipe for Formal Theorem Proving Beyond Auto-Regressive Generation

Ruida Wang, Rui Pan, Pengcheng Wang 等

Enhancing the formal math reasoning capabilities of Large Language Models (LLMs) has become a key focus in both mathematical and computer science communities in recent years. While significant progres...

cs.LG 2026-06-17
P-K-GCN: Physics-augmented Koopman-enhanced Graph Convolutional Network for Deep Spatiotemporal Super-resolution

Xizhuo, Zhang, Zekai Wang 等

High-fidelity simulation of spatiotemporal dynamics is computationally prohibitive, necessitating efficient super-resolution techniques to reconstruct high-resolution data from coarse-grained inputs. ...

physics.ao-ph 2026-06-17
Optimal scenario design for climate emulation

Christopher B. Womack, Shahine Bouabid, Andrei Sokolov 等

As deep learning for physical systems continues to grow in popularity, efforts to improve generalizability have primarily focused on designing architectures that embed physical constraints. However, f...

cs.CV 2026-06-17
Confidence is Not Reliability: Rethinking MC Dropout in Brain Tumour Segmentation

Xin Ci Wong, Duygu Sarikaya, Kieran Zucker 等

Glioma segmentation in multiparametric MRI is a critical component of treatment planning. A segmentation model that fails silently on treatment-critical sub-regions represents a patient safety risk th...