cs.AI - arXiv 学术档案

cs.AI 2026-06-18

Test Title

测试标题

Test abstract...

Author1

cs.LG 2026-06-17

UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on pa...

Mohamed Nabail, Leo Cheng, Jingmin Wang 等

详情 PDF

cs.AI 2026-06-17

Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations t...

Siyi Gu, Jialin Chen, Sophia Zhou 等

详情 PDF

cs.SD 2026-06-17

Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

Existing multi-speaker dialogue systems bind speakers to utterances through structured supervision: per-turn tags, multi-stream transcriptions, or learnable speaker embeddings. These systems operate w...

Michael Finkelson, Daniel Segal, Eitan Richardson 等

详情 PDF

cs.MA 2026-06-17

Data Intelligence Agents: Interpreting, Modeling, and Querying Enterprise Data via Autonomous Coding Agents

Production data integration is bottlenecked by repeated, lossy handoffs between data owners, engineers, and analysts who must collaboratively discover, structure, and query enterprise data. We present...

Anoushka Vyas, Aarushi Dhanuka, Sina Khoshfetrat Pakazad 等

详情 PDF

cs.LG 2026-06-17

Explaining Attention with Program Synthesis

A longstanding goal of research on interpretable deep learning is to replace opaque neural computations with human-meaningful symbolic descriptions. In this paper, we propose an approach for approxima...

Amiri Hayes, Belinda Li, Jacob Andreas

详情 PDF

cs.HC 2026-06-17

Correct Yourself, Keep My Trust: How Self-Correction and Social Connection Shape Credibility in Social Chatbots

When social chatbots make mistakes, and they do, how they recover determines whether users trust them again. Social chatbots are increasingly integrated into everyday life, yet they remain prone to ge...

Biswadeep Sen, Yi-Chieh Lee

详情 PDF

cs.AI 2026-06-17

NeSyCat Torch: A Differentiable Tensor Implementation of Categorical Semantics for Neurosymbolic Learning

Neurosymbolic semantics is fragmented: classical, fuzzy, probabilistic and neural systems each define truth by their own inductive rules. NeSyCat, extending ULLER, subsumes them under a single inducti...

Daniel Romero Schellhorn, Till Mossakowski, Björn Gehrke

详情 PDF

cs.CL 2026-06-17

Trade-offs in Medical LLM Adaptation: An Empirical Study in French QA

The development of large language models (LLMs) has led to an increased focus on their adaptation to specialized domains and languages, yet the effectiveness of domain adaptation strategies remains un...

Ikram Belmadani, Oumaima El Khettari, Carlos Ramisch 等

详情 PDF

cs.CV 2026-06-17

A Multi-Domain Benchmark for Detecting AI-Generated Text-Rich Images from GPT-Image-2

Text-rich images often contain privacy-sensitive, transactional, or decision-relevant information. As recent multimodal image generation models become increasingly capable of synthesizing realistic te...

Yijin Wang, Shuyi Wang, Wenhan Zhang 等

详情 PDF

cs.AI 2026-06-17

X+Slides: Benchmarking Audience-Conditioned Slide Generation

Automatically generating slide decks from source documents is an important application of large language models (LLMs). Existing benchmarks primarily assess slide completeness and technical depth, whi...

Haodong Chen, Xuanhe Zhou, Wei Zhou 等

详情 PDF

cs.CV 2026-06-17

OneCanvas: 3D Scene Understanding via Panoramic Reprojection

Existing approaches to 3D scene understanding in Vision-Language Models (VLMs) either rely on complex, model-specific geometry encoders or large training budgets in pursuit of spatial reasoning. Inste...

Bartłomiej Baranowski, Dave Zhenyu Chen, Matthias Nießner

详情 PDF

cs.HC 2026-06-17

A Taxonomy of Mental Health and Technology Needs for Alzheimer's and Dementia Caregivers

Family members caring for individuals with Alzheimer's disease and related dementias (AD/ADRD) provide the foundation of long-term care worldwide. In 2023, more than 11 million U.S. family and friends...

Keran Wang, Drishti Goel, Jiayue Melissa Shi 等

详情 PDF

cs.AI 2026-06-17

TxBench-PP: Analyzing AI Agent Performance on Small-Molecule Preclinical Pharmacology

Artificial intelligence (AI) agents promise to accelerate drug discovery by compressing interpretation and decision-making loops, but practical deployment requires trusted evaluation on realistic prog...

Hannah Le, Ramesh Ramasamy, Alex Urrutia 等

详情 PDF

cs.LG 2026-06-17

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Reinforcement Learning with Verifiable Rewards algorithms like GRPO have emerged as the dominant post-training paradigm for complex reasoning in LLMs, yet commonly suffer from policy entropy collapse ...

Haipeng Luo, Qingfeng Sun, Songli Wu 等

详情 PDF

cs.LG 2026-06-17

Mechanism-Guided Selective Unlearning for RLVR-Induced Reasoning

We propose MAST (Mechanism-Aligned Selective Targeting), a mechanism-guided method for unlearning RLVR-induced reasoning with substantially lower collateral damage than standard full-parameter updates...

Chenyu Zhou, Qiliang Jiang, Shuning Wu 等

详情 PDF

cs.LG 2026-06-17

Machine Unlearning for the XGBoost Model with Network Intrusion Datasets

Machine Unlearning (MU) has emerged as an important technique for removing specific data points from trained models without requiring full retraining. However, most existing MU research focuses on dee...

Diana Magalhães, Eva Maia, João Vitorino 等

详情 PDF

cs.LG 2026-06-17

Compute Efficiency and Serial Runtime Tradeoffs for Stochastic Momentum Methods

Stochastic momentum methods such as heavy ball (HB), Nesterov momentum, and variants of Accelerated SGD (ASGD) [Kidambi et al., 2018] are widely used in modern training, but their stochastic benefits ...

Depen Morwani, Alexandru Meterez, Pranav Nair 等

详情 PDF