stat.ML - arXiv 学术档案

cs.LG 2026-06-17

A Human-in-the-Loop Bayesian Optimization Framework for Constraint-Aware Bioprocess Development

This work presents an extension to Pareto Front Guided Sampling (PFGS), a Human-in-the-Loop (HitL) Bayesian Optimization (BO) framework in which Gaussian process (GP) surrogate-derived quantities are ...

Samuel Stricker, Claus Wirnsperger, Alessandro Butté 等

详情 PDF

stat.ML 2026-06-17

Generalised Eigenvalue Geometry of Semantic Adversarial Attacks

Recent empirical work shows that semantically equivalent paraphrases can fool financial sentiment classifiers: although a paraphrase remains close to the original under a strong reference embedding, i...

Martin Anthony, Kaveh Salehzadeh Nobari

详情 PDF

cs.LG 2026-06-17

Compute Efficiency and Serial Runtime Tradeoffs for Stochastic Momentum Methods

Stochastic momentum methods such as heavy ball (HB), Nesterov momentum, and variants of Accelerated SGD (ASGD) [Kidambi et al., 2018] are widely used in modern training, but their stochastic benefits ...

Depen Morwani, Alexandru Meterez, Pranav Nair 等

详情 PDF

stat.ML 2026-06-17

On Local Population-Risk Certificates

This paper develops local certificates for population-risk increments around a current model. For a local candidate set $\mathcal D$, the certificate is a two-sided confidence band for \(P({\ell_{θ+...

Mingzhi Song

详情 PDF

cs.LG 2026-06-17

INDEQS: Informed Neural controlled Differential EQuationS

Neural Controlled Differential Equations (NCDE) provide a powerful continuous-time framework for forecasting time series, but standard graph-based extensions typically learn spatial structure purely f...

Michael Detzel, Gabriel Nobis, Kristiyan Blagov 等

详情 PDF

stat.ME 2026-06-17

Wasserstein Policy Learning for Distributional Outcomes

Offline policy learning has received growing attention in causal inference. The primary objective is to learn a policy (individualized treatment rule) as a mapping from covariates to treatment that ma...

Yiyan Huang, Cheuk Hang Leung, Qi Wu 等

详情 PDF

cs.LG 2026-06-17

Smoothness-Based Derandomization of PAC-Bayes Bounds

We study PAC-Bayes derandomization for smooth loss functions. Our goal is to obtain generalization bounds that hold with high probability for deterministic predictors by exploiting smoothness properti...

Alexandre Lemire Paquin, Brahim Chaib-Draa, Philippe Giguère

详情 PDF

math.ST 2026-06-17

Optimal score function estimation via derivatives constraints

We consider the problem of score function estimation via empirical risk minimization. We first start with the question of inferring the score function of a probability measure $μ$ with density on the ...

Thomas Bonis, Thanh Mai Pham Ngoc, Viet Chi Tran

详情 PDF

stat.ML 2026-06-17

Quantifying and Auditing LLM Evaluation via Positive--Unlabeled Learning

Large Language Models (LLMs) are increasingly used as judges for scalable evaluation, yet such LLM--as--a--Judge systems exhibit systematic biases that are decoupled from semantic quality, most notabl...

Zilong Zhang, Yi-Ting Hung, Lei Ding 等

详情 PDF

stat.ML 2026-06-17

Sequential Kernel-based Conditional Independence Testing via Adaptive Betting

Testing conditional independence is fundamental yet intrinsically difficult: without additional assumptions, Type I error control is impossible in general. The "Model-X'' paradigm addresses this diffi...

Zheng He, Danica J. Sutherland

详情 PDF

stat.ML 2026-06-17

FOSC-X: An Extended Framework for Optimal Local Cuts and Non-Horizontal Cluster Selection from Clustering Hierarchies

Extracting a flat clustering solution from a hierarchy is a common task in practical cluster analysis and can be formulated as an optimisation problem. Existing approaches focus on finding a single op...

Connor Simpson, Ricardo J. G. B. Campello

详情 PDF

stat.ME 2026-06-17

Balanced Twins: Causal Inference on Time Series with Hidden Confounding

Accurately estimating treatment effects in time series is essential for evaluating interventions in real-world applications, especially when treatment assignment is biased by unobserved factors. In ma...

Ouali Maha, Ghattas Badih, Flachaire Emmanuel 等

详情 PDF

cs.LG 2026-06-17

Strategic Feature Selection

When algorithmic predictors inform resource allocation in high-stakes domains such as healthcare, these predictors must account for strategic manipulation of input features. The typical solution is to...

Jivat Neet Kaur, Pratik Patil, Divya Shanmugam 等

详情 PDF

stat.ML 2026-06-17

Kernel of Partition Paths: A Unified Representation for Tree Ensembles

A recent line of work has reframed individual decision trees as linear models on engineered features associated with their splits, opening routes for oracle inequalities and feature-importance reinter...

Nicolas Mahler

详情 PDF

cs.LG 2026-06-17

Online Distributional Prediction via Latent Cluster Geometry Under Drift and Corruption

Online learning in non-stationary streams is often formulated as tracking a point estimate, but many applications require predicting the full data-generating distribution. We study online distribution...

Navyansh Mahla, Prateek Chanda, Ganesh Ramakrishnan

详情 PDF

stat.ML 2026-06-17

TimeLAVA: Learning-Agnostic Data Valuation for Time Series

Data valuation quantifies the intrinsic quality of individual samples to enable principled data curation, quality control, and robust learning. For time series in critical domains such as healthcare, ...

Wenqin Liu, Weizhi Quan, Aoqi Zuo 等

详情 PDF