Skip to main content

Showing 1–50 of 528 results for author: Zhang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2507.07359  [pdf, ps, other

    cs.LG cs.AI stat.ME stat.ML

    Goal-Oriented Sequential Bayesian Experimental Design for Causal Learning

    Authors: Zheyu Zhang, Jiayuan Dong, Jie Liu, Xun Huan

    Abstract: We present GO-CBED, a goal-oriented Bayesian framework for sequential causal experimental design. Unlike conventional approaches that select interventions aimed at inferring the full causal model, GO-CBED directly maximizes the expected information gain (EIG) on user-specified causal quantities of interest, enabling more targeted and efficient experimentation. The framework is both non-myopic, opt… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

    Comments: 10 pages, 6 figures

  2. arXiv:2507.04044  [pdf, ps, other

    stat.ME econ.EM

    A New and Efficient Debiased Estimation of General Treatment Models by Balanced Neural Networks Weighting

    Authors: Zeqi Wu, Meilin Wang, Wei Huang, Zheng Zhang

    Abstract: Estimation and inference of treatment effects under unconfounded treatment assignments often suffer from bias and the `curse of dimensionality' due to the nonparametric estimation of nuisance parameters for high-dimensional confounders. Although debiased state-of-the-art methods have been proposed for binary treatments under particular treatment models, they can be unstable for small sample sizes.… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

  3. arXiv:2506.23453  [pdf, ps, other

    stat.ML cs.LG

    Minimax Optimal Two-Stage Algorithm For Moment Estimation Under Covariate Shift

    Authors: Zhen Zhang, Xin Liu, Shaoli Wang, Jiaye Teng

    Abstract: Covariate shift occurs when the distribution of input features differs between the training and testing phases. In covariate shift, estimating an unknown function's moment is a classical problem that remains under-explored, despite its common occurrence in real-world scenarios. In this paper, we investigate the minimax lower bound of the problem when the source and target distributions are known.… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  4. arXiv:2506.14899  [pdf, ps, other

    stat.ML cs.LG

    Optimal Convergence Rates of Deep Neural Network Classifiers

    Authors: Zihan Zhang, Lei Shi, Ding-Xuan Zhou

    Abstract: In this paper, we study the binary classification problem on $[0,1]^d$ under the Tsybakov noise condition (with exponent $s \in [0,\infty]$) and the compositional assumption. This assumption requires the conditional class probability function of the data distribution to be the composition of $q+1$ vector-valued multivariate functions, where each component function is either a maximum value functio… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  5. arXiv:2506.12751  [pdf, ps, other

    stat.ML cs.LG

    Single Index Bandits: Generalized Linear Contextual Bandits with Unknown Reward Functions

    Authors: Yue Kang, Mingshuo Liu, Bongsoo Yi, Jing Lyu, Zhi Zhang, Doudou Zhou, Yao Li

    Abstract: Generalized linear bandits have been extensively studied due to their broad applicability in real-world online decision-making problems. However, these methods typically assume that the expected reward function is known to the users, an assumption that is often unrealistic in practice. Misspecification of this link function can lead to the failure of all existing algorithms. In this work, we addre… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  6. arXiv:2506.07854  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Residual Reweighted Conformal Prediction for Graph Neural Networks

    Authors: Zheng Zhang, Jie Bao, Zhixin Zhou, Nicolo Colombo, Lixin Cheng, Rui Luo

    Abstract: Graph Neural Networks (GNNs) excel at modeling relational data but face significant challenges in high-stakes domains due to unquantified uncertainty. Conformal prediction (CP) offers statistical coverage guarantees, but existing methods often produce overly conservative prediction intervals that fail to account for graph heteroscedasticity and structural biases. While residual reweighting CP vari… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  7. arXiv:2506.07469  [pdf, ps, other

    stat.ME econ.EM math.ST

    Individual Treatment Effect: Prediction Intervals and Sharp Bounds

    Authors: Zhehao Zhang, Thomas S. Richardson

    Abstract: Individual treatment effect (ITE) is often regarded as the ideal target of inference in causal analyses and has been the focus of several recent studies. In this paper, we describe the intrinsic limits regarding what can be learned concerning ITEs given data from large randomized experiments. We consider when a valid prediction interval for the ITE is informative and when it can be bounded away fr… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  8. arXiv:2506.06521  [pdf, ps, other

    cs.LG stat.ML

    Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs

    Authors: Shulun Chen, Runlong Zhou, Zihan Zhang, Maryam Fazel, Simon S. Du

    Abstract: We consider the gap-dependent regret bounds for episodic MDPs. We show that the Monotonic Value Propagation (MVP) algorithm achieves a variance-aware gap-dependent regret bound of… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 30 pages

  9. arXiv:2506.00933  [pdf, ps, other

    stat.ML cs.LG

    Reconstruction and Prediction of Volterra Integral Equations Driven by Gaussian Noise

    Authors: Zhihao Xu, Saisai Ding, Zhikun Zhang, Xiangjun Wang

    Abstract: Integral equations are widely used in fields such as applied modeling, medical imaging, and system identification, providing a powerful framework for solving deterministic problems. While parameter identification for differential equations has been extensively studied, the focus on integral equations, particularly stochastic Volterra integral equations, remains limited. This research addresses the… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  10. arXiv:2506.00866  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Projection Pursuit Density Ratio Estimation

    Authors: Meilin Wang, Wei Huang, Mingming Gong, Zheng Zhang

    Abstract: Density ratio estimation (DRE) is a paramount task in machine learning, for its broad applications across multiple domains, such as covariate shift adaptation, causal inference, independence tests and beyond. Parametric methods for estimating the density ratio possibly lead to biased results if models are misspecified, while conventional non-parametric methods suffer from the curse of dimensionali… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  11. arXiv:2505.24078  [pdf, ps, other

    stat.AP econ.GN

    Estimation of Gender Wage Gap in the University of North Carolina System

    Authors: Zihan Zhang, Jan Hannig

    Abstract: Gender pay equity remains an open challenge in academia despite decades of movements. Prior studies, however, have relied largely on descriptive regressions, leaving causal analysis underexplored. This study examines gender-based wage disparities among tenure-track faculty in the University of North Carolina system using both parametric and non-parametric causal inference methods. In particular, w… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  12. arXiv:2505.19043  [pdf, ps, other

    cs.LG stat.ML

    Offline Clustering of Linear Bandits: Unlocking the Power of Clusters in Data-Limited Environments

    Authors: Jingyuan Liu, Zeyu Zhang, Xuchuang Wang, Xutong Liu, John C. S. Lui, Mohammad Hajiesmaili, Carlee Joe-Wong

    Abstract: Contextual linear multi-armed bandits are a learning framework for making a sequence of decisions, e.g., advertising recommendations for a sequence of arriving users. Recent works have shown that clustering these users based on the similarity of their learned preferences can significantly accelerate the learning. However, prior work has primarily focused on the online setting, which requires conti… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  13. arXiv:2505.15944  [pdf, other

    stat.ME

    Optimal Treatment Allocations Accounting for Population Differences

    Authors: Wei Zhang, Zhiwei Zhang, Aiyi Liu

    Abstract: The treatment allocation mechanism in a randomized clinical trial can be optimized by maximizing the nonparametric efficiency bound for a specific measure of treatment effect. Optimal treatment allocations which may or may not depend on baseline covariates have been derived for a variety of effect measures focusing on the trial population, the patient population represented by the trial participan… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  14. arXiv:2505.09861  [pdf, other

    cs.LG cs.AI cs.IR stat.ME

    LiDDA: Data Driven Attribution at LinkedIn

    Authors: John Bencina, Erkut Aykutlug, Yue Chen, Zerui Zhang, Stephanie Sorenson, Shao Tang, Changshuai Wei

    Abstract: Data Driven Attribution, which assigns conversion credits to marketing interactions based on causal patterns learned from data, is the foundation of modern marketing intelligence and vital to any marketing businesses and advertising platform. In this paper, we introduce a unified transformer-based attribution approach that can handle member-level data, aggregate-level data, and integration of exte… ▽ More

    Submitted 21 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

  15. arXiv:2505.07825  [pdf, other

    stat.ML cs.LG math.PR

    Diffusion-based supervised learning of generative models for efficient sampling of multimodal distributions

    Authors: Hoang Tran, Zezhong Zhang, Feng Bao, Dan Lu, Guannan Zhang

    Abstract: We propose a hybrid generative model for efficient sampling of high-dimensional, multimodal probability distributions for Bayesian inference. Traditional Monte Carlo methods, such as the Metropolis-Hastings and Langevin Monte Carlo sampling methods, are effective for sampling from single-mode distributions in high-dimensional spaces. However, these methods struggle to produce samples with the corr… ▽ More

    Submitted 20 April, 2025; originally announced May 2025.

  16. arXiv:2505.05338  [pdf, other

    stat.ME

    A Unified Approach to Covariate Adjustment for Survival Endpoints in Randomized Clinical Trials

    Authors: Zhiwei Zhang, Ya Wang, Dong Xi

    Abstract: Covariate adjustment aims to improve the statistical efficiency of randomized trials by incorporating information from baseline covariates. Popular methods for covariate adjustment include analysis of covariance for continuous endpoints and standardized logistic regression for binary endpoints. For survival endpoints, while some covariate adjustment methods have been developed for specific effect… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  17. arXiv:2504.17322  [pdf, ps, other

    stat.ME

    Testing Conditional Independence via Density Ratio Regression

    Authors: Chunrong Ai, Zixuan Xu, Zheng Zhang

    Abstract: This paper develops a conditional independence (CI) test from a conditional density ratio (CDR) for weakly dependent data. The main contribution is presenting a closed-form expression for the estimated conditional density ratio function with good finite-sample performance. The key idea is exploiting the linear sieve combined with the quadratic norm. Matsushita et al. (2022) exploited the linear si… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  18. arXiv:2504.12288  [pdf, other

    stat.ME

    The underlap coefficient as a measure of a biomarker's discriminatory ability

    Authors: Zhaoxi Zhang, Vanda Inacio, Miguel de Carvalho

    Abstract: The first step in evaluating a potential diagnostic biomarker is to examine the variation in its values across different disease groups. In a three-class disease setting, the volume under the receiver operating characteristic surface and the three-class Youden index are commonly used summary measures of a biomarker's discriminatory ability. However, these measures rely on a stochastic ordering ass… ▽ More

    Submitted 17 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  19. arXiv:2504.07307  [pdf, ps, other

    cs.LG stat.ML

    Follow-the-Perturbed-Leader Approaches Best-of-Both-Worlds for the m-Set Semi-Bandit Problems

    Authors: Jingxin Zhan, Yuchen Xin, Chenjie Sun, Zhihua Zhang

    Abstract: We consider a common case of the combinatorial semi-bandit problem, the $m$-set semi-bandit, where the learner exactly selects $m$ arms from the total $d$ arms. In the adversarial setting, the best regret bound, known to be $\mathcal{O}(\sqrt{nmd})$ for time horizon $n$, is achieved by the well-known Follow-the-Regularized-Leader (FTRL) policy. However, this requires to explicitly compute the arm-… ▽ More

    Submitted 7 July, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

  20. arXiv:2504.02631  [pdf, other

    stat.CO

    Feature splitting parallel algorithm for Dantzig selectors

    Authors: Xiaofei Wu, Yue Chao, Rongmei Liang, Shi Tang, Zhiming Zhang

    Abstract: The Dantzig selector is a widely used and effective method for variable selection in ultra-high-dimensional data. Feature splitting is an efficient processing technique that involves dividing these ultra-high-dimensional variable datasets into manageable subsets that can be stored and processed more easily on a single machine. This paper proposes a variable splitting parallel algorithm for solving… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  21. arXiv:2504.01761  [pdf, other

    stat.ME econ.EM

    Non-parametric Quantile Regression and Uniform Inference with Unknown Error Distribution

    Authors: Haoze Hou, Wei Huang, Zheng Zhang

    Abstract: This paper studies the non-parametric estimation and uniform inference for the conditional quantile regression function (CQRF) with covariates exposed to measurement errors. We consider the case that the distribution of the measurement error is unknown and allowed to be either ordinary or super smooth. We estimate the density of the measurement error by the repeated measurements and propose the de… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  22. arXiv:2503.19218  [pdf, other

    cs.LG stat.ML

    Analytic DAG Constraints for Differentiable DAG Learning

    Authors: Zhen Zhang, Ignavier Ng, Dong Gong, Yuhang Liu, Mingming Gong, Biwei Huang, Kun Zhang, Anton van den Hengel, Javen Qinfeng Shi

    Abstract: Recovering the underlying Directed Acyclic Graph (DAG) structures from observational data presents a formidable challenge, partly due to the combinatorial nature of the DAG-constrained optimization problem. Recently, researchers have identified gradient vanishing as one of the primary obstacles in differentiable DAG learning and have proposed several DAG constraints to mitigate this issue. By deve… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: Accepted to ICLR 2025

    Journal ref: ICLR 2025

  23. arXiv:2503.15830  [pdf, other

    stat.ME stat.AP stat.CO

    Alignment of Continuous Brain Connectivity

    Authors: Martin Cole, Yang Xiang, Will Consagra, Anuj Srivastava, Xing Qiu, Zhengwu Zhang

    Abstract: Brain networks are typically represented by adjacency matrices, where each node corresponds to a brain region. In traditional brain network analysis, nodes are assumed to be matched across individuals, but the methods used for node matching often overlook the underlying connectivity information. This oversight can result in inaccurate node alignment, leading to inflated edge variability and reduce… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 12 pages of main text and 8 pages of supplemental material. 10 figures in main text and 3 figures in supplemental material

  24. arXiv:2503.09310  [pdf, other

    stat.ME math.ST

    Competing-risk Weibull survival model with multiple causes

    Authors: Kai Wang, Yuqin Mu, Shenyi Zhang, Zhengjun Zhang, Chengxiu Ling

    Abstract: The failure of a system can result from the simultaneous effects of multiple causes, where assigning a specific cause may be inappropriate or unavailable. Examples include contributing causes of death in epidemiology and the aetiology of neurodegenerative diseases like Alzheimer's. We propose a parametric Weibull accelerated failure time model for multiple causes, incorporating a data-driven, indi… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  25. arXiv:2502.15310  [pdf, other

    stat.ME

    Max-Linear Tail Regression

    Authors: Liujun Chen, Deyuan Li, Zhengjun Zhang

    Abstract: The relationship between a response variable and its covariates can vary significantly, especially in scenarios where covariates take on extremely high or low values. This paper introduces a max-linear tail regression model specifically designed to capture such extreme relationships. To estimate the regression coefficients within this framework, we propose a novel M-estimator based on extreme valu… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

  26. arXiv:2502.14208  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Non-Asymptotic Theory of Seminorm Lyapunov Stability: From Deterministic to Stochastic Iterative Algorithms

    Authors: Zaiwei Chen, Sheng Zhang, Zhe Zhang, Shaan Ul Haque, Siva Theja Maguluri

    Abstract: We study the problem of solving fixed-point equations for seminorm-contractive operators and establish foundational results on the non-asymptotic behavior of iterative algorithms in both deterministic and stochastic settings. Specifically, in the deterministic setting, we prove a fixed-point theorem for seminorm-contractive operators, showing that iterates converge geometrically to the kernel of t… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  27. arXiv:2502.14172  [pdf, other

    stat.ML cs.LG

    A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation

    Authors: Yang Peng, Kaicheng Jin, Liangyu Zhang, Zhihua Zhang

    Abstract: In this paper, we study the finite-sample statistical rates of distributional temporal difference (TD) learning with linear function approximation. The aim of distributional TD learning is to estimate the return distribution of a discounted Markov decision process for a given policy π. Previous works on statistical analysis of distributional TD learning mainly focus on the tabular case. In contras… ▽ More

    Submitted 13 May, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

  28. arXiv:2502.13117  [pdf, other

    stat.AP cs.AI

    Performance Evaluation of Large Language Models in Statistical Programming

    Authors: Xinyi Song, Kexin Xie, Lina Lee, Ruizhe Chen, Jared M. Clark, Hao He, Haoran He, Jie Min, Xinlei Zhang, Simin Zheng, Zhiyang Zhang, Xinwei Deng, Yili Hong

    Abstract: The programming capabilities of large language models (LLMs) have revolutionized automatic code generation and opened new avenues for automatic statistical analysis. However, the validity and quality of these generated codes need to be systematically evaluated before they can be widely adopted. Despite their growing prominence, a comprehensive evaluation of statistical code generated by LLMs remai… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 27 pages, 8 figures

  29. arXiv:2502.06719  [pdf, ps, other

    stat.ML cs.LG math.OC math.PR math.ST

    Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent

    Authors: Marina Sheshukova, Sergey Samsonov, Denis Belomestny, Eric Moulines, Qi-Man Shao, Zhuo-Song Zhang, Alexey Naumov

    Abstract: In this paper, we establish non-asymptotic convergence rates in the central limit theorem for Polyak-Ruppert-averaged iterates of stochastic gradient descent (SGD). Our analysis builds on the result of the Gaussian approximation for nonlinear statistics of independent random variables of Shao and Zhang (2022). Using this result, we prove the non-asymptotic validity of the multiplier bootstrap for… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    MSC Class: 60F05; 62L20; 93E35

  30. arXiv:2502.04543  [pdf, ps, other

    stat.ML cs.LG

    Sparsity-Based Interpolation of External, Internal and Swap Regret

    Authors: Zhou Lu, Y. Jennifer Sun, Zhiyu Zhang

    Abstract: Focusing on the expert problem in online learning, this paper studies the interpolation of several performance metrics via $φ$-regret minimization, which measures the total loss of an algorithm by its regret with respect to an arbitrary action modification rule $φ$. With $d$ experts and $T\gg d$ rounds in total, we present a single algorithm achieving the instance-adaptive $φ$-regret bound \begin{… ▽ More

    Submitted 17 June, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: COLT 2025. Equal contribution, alphabetical order

  31. arXiv:2502.00983  [pdf, other

    cs.LG stat.ML

    CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation

    Authors: Zhengzhe Zhang, Wenjia Meng, Haoliang Sun, Gang Pan

    Abstract: Context-based offline meta-reinforcement learning (OMRL) methods have achieved appealing success by leveraging pre-collected offline datasets to develop task representations that guide policy learning. However, current context-based OMRL methods often introduce spurious correlations, where task components are incorrectly correlated due to confounders. These correlations can degrade policy performa… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  32. arXiv:2502.00639  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer

    Authors: Tao Ren, Zishi Zhang, Zehao Li, Jingyang Jiang, Shentao Qin, Guanghao Li, Yan Li, Yi Zheng, Xinping Li, Min Zhan, Yijie Peng

    Abstract: The probabilistic diffusion model (DM), generating content by inferencing through a recursive chain structure, has emerged as a powerful framework for visual generation. After pre-training on enormous unlabeled data, the model needs to be properly aligned to meet requirements for downstream applications. How to efficiently align the foundation DM is a crucial task. Contemporary methods are either… ▽ More

    Submitted 24 March, 2025; v1 submitted 1 February, 2025; originally announced February 2025.

  33. arXiv:2501.17358  [pdf, other

    stat.ME

    Outcome Regression Methods for Analyzing Hybrid Control Studies: Balancing Bias and Variability

    Authors: Zhiwei Zhang, Jialuo Liu, Wei Liu

    Abstract: There is growing interest in a hybrid control design in which a randomized controlled trial is augmented with an external control arm from a previous trial or real world data. Existing methods for analyzing hybrid control studies include various downweighting and propensity score methods as well as methods that combine downweighting with propensity score stratification. In this article, we describ… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  34. arXiv:2501.14860  [pdf, other

    math.ST stat.ME

    The typicality principle and its implications for statistics and data science

    Authors: Yiran Jiang, Zeyu Zhang, Ryan Martin, Chuanhai Liu

    Abstract: A central focus of data science is the transformation of empirical evidence into knowledge. As such, the key insights and scientific attitudes of deep thinkers like Fisher, Popper, and Tukey are expected to inspire exciting new advances in machine learning and artificial intelligence in years to come. Along these lines, the present paper advances a novel {\em typicality principle} which states, ro… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

  35. arXiv:2501.11323  [pdf

    cs.LG eess.SP physics.app-ph stat.ML

    Physics-Informed Machine Learning for Efficient Reconfigurable Intelligent Surface Design

    Authors: Zhen Zhang, Jun Hui Qiu, Jun Wei Zhang, Hui Dong Li, Dong Tang, Qiang Cheng, Wei Lin

    Abstract: Reconfigurable intelligent surface (RIS) is a two-dimensional periodic structure integrated with a large number of reflective elements, which can manipulate electromagnetic waves in a digital way, offering great potentials for wireless communication and radar detection applications. However, conventional RIS designs highly rely on extensive full-wave EM simulations that are extremely time-consumin… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  36. arXiv:2501.11127  [pdf, other

    math.OC cs.LG stat.ML

    A Regularized Online Newton Method for Stochastic Convex Bandits with Linear Vanishing Noise

    Authors: Jingxin Zhan, Yuchen Xin, Kaicheng Jin, Zhihua Zhang

    Abstract: We study a stochastic convex bandit problem where the subgaussian noise parameter is assumed to decrease linearly as the learner selects actions closer and closer to the minimizer of the convex loss function. Accordingly, we propose a Regularized Online Newton Method (RONM) for solving the problem, based on the Online Newton Method (ONM) of arXiv:2406.06506. Our RONM reaches a polylogarithmic regr… ▽ More

    Submitted 19 January, 2025; originally announced January 2025.

  37. arXiv:2501.07035  [pdf, other

    stat.CO

    Parallel ADMM Algorithm with Gaussian Back Substitution for High-Dimensional Quantile Regression and Classification

    Authors: Xiaofei Wu, Dingzi Guo, Rongmei Liang, Zhimin Zhang

    Abstract: In the field of high-dimensional data analysis, modeling methods based on quantile loss function are highly regarded due to their ability to provide a comprehensive statistical perspective and effective handling of heterogeneous data. In recent years, many studies have focused on using the parallel alternating direction method of multipliers (P-ADMM) to solve high-dimensional quantile regression a… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  38. arXiv:2501.05012  [pdf, other

    stat.ME q-bio.QM

    SyNPar: Synthetic Null Data Parallelism for High-Power False Discovery Rate Control in High-Dimensional Variable Selection

    Authors: Changhu Wang, Ziheng Zhang, Jingyi Jessica Li

    Abstract: Balancing false discovery rate (FDR) and statistical power to ensure reliable discoveries is a key challenge in high-dimensional variable selection. Although several FDR control methods have been proposed, most involve perturbing the original data, either by concatenating knockoff variables or splitting the data into two halves, both of which can lead to a loss of power. In this paper, we introduc… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  39. arXiv:2501.03501  [pdf, other

    stat.AP

    Modeling Cell Type Developmental Trajectory using Multinomial Unbalanced Optimal Transport

    Authors: Junhao Zhu, Kevin Zhang, Zhaolei Zhang, Dehan Kong

    Abstract: Single-cell trajectory analysis aims to reconstruct the biological developmental processes of cells as they evolve over time, leveraging temporal correlations in gene expression. During cellular development, gene expression patterns typically change and vary across different cell types. A significant challenge in this analysis is that RNA sequencing destroys the cell, making it impossible to track… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

  40. arXiv:2501.02994  [pdf, other

    stat.ML cs.LG

    NeuroPMD: Neural Fields for Density Estimation on Product Manifolds

    Authors: William Consagra, Zhiling Gu, Zhengwu Zhang

    Abstract: We propose a novel deep neural network methodology for density estimation on product Riemannian manifold domains. In our approach, the network directly parameterizes the unknown density function and is trained using a penalized maximum likelihood framework, with a penalty term formed using manifold differential operators. The network architecture and estimation algorithm are carefully designed to… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

  41. arXiv:2412.20355  [pdf, other

    stat.ML cs.LG

    Confidence Interval Construction and Conditional Variance Estimation with Dense ReLU Networks

    Authors: Carlos Misael Madrid Padilla, Oscar Hernan Madrid Padilla, Yik Lun Kei, Zhi Zhang, Yanzhen Chen

    Abstract: This paper addresses the problems of conditional variance estimation and confidence interval construction in nonparametric regression using dense networks with the Rectified Linear Unit (ReLU) activation function. We present a residual-based framework for conditional variance estimation, deriving nonasymptotic bounds for variance estimation under both heteroscedastic and homoscedastic settings. We… ▽ More

    Submitted 31 December, 2024; v1 submitted 29 December, 2024; originally announced December 2024.

  42. arXiv:2412.17070  [pdf, ps, other

    math.PR math.OC stat.ML

    Decoupled Functional Central Limit Theorems for Two-Time-Scale Stochastic Approximation

    Authors: Yuze Han, Xiang Li, Jiadong Liang, Zhihua Zhang

    Abstract: In two-time-scale stochastic approximation (SA), two iterates are updated at different rates, governed by distinct step sizes, with each update influencing the other. Previous studies have demonstrated that the convergence rates of the error terms for these updates depend solely on their respective step sizes, a property known as decoupled convergence. However, a functional version of this decoupl… ▽ More

    Submitted 14 January, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

  43. arXiv:2412.14660  [pdf, other

    cs.CV cs.AI cs.CL cs.LG stat.ML

    Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models

    Authors: Zijun Chen, Wenbo Hu, Guande He, Zhijie Deng, Zheng Zhang, Richang Hong

    Abstract: Multimodal large language models (MLLMs) combine visual and textual data for tasks such as image captioning and visual question answering. Proper uncertainty calibration is crucial, yet challenging, for reliable use in areas like healthcare and autonomous driving. This paper investigates representative MLLMs, focusing on their calibration across various scenarios, including before and after visual… ▽ More

    Submitted 25 December, 2024; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: Accepted to COLING 2025

  44. arXiv:2411.19789  [pdf, other

    stat.ME

    Adjusting auxiliary variables under approximate neighborhood interference

    Authors: Xin Lu, Yuhao Wang, Zhiheng Zhang

    Abstract: Randomized experiments are the gold standard for causal inference. However, traditional assumptions, such as the Stable Unit Treatment Value Assumption (SUTVA), often fail in real-world settings where interference between units is present. Network interference, in particular, has garnered significant attention. Structural models, like the linear-in-means model, are commonly used to describe interf… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

    Comments: 46 pages

  45. arXiv:2411.17668  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Anytime Acceleration of Gradient Descent

    Authors: Zihan Zhang, Jason D. Lee, Simon S. Du, Yuxin Chen

    Abstract: This work investigates stepsize-based acceleration of gradient descent with {\em anytime} convergence guarantees. For smooth (non-strongly) convex optimization, we propose a stepsize schedule that allows gradient descent to achieve convergence guarantees of $O(T^{-1.119})$ for any stopping time $T$, where the stepsize schedule is predetermined without prior knowledge of the stopping time. This res… ▽ More

    Submitted 8 December, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: v2: We improve the convergence rate from $O(T^{-1.03})$ to O(T^{-1.119}) through more precise computations

  46. arXiv:2411.17472  [pdf, other

    cs.CV cs.LG stat.ML

    Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

    Authors: Eric Hanchen Jiang, Yasi Zhang, Zhi Zhang, Yixin Wan, Andrew Lizarraga, Shufan Li, Ying Nian Wu

    Abstract: Text-to-image (T2I) diffusion models have revolutionized generative modeling by producing high-fidelity, diverse, and visually realistic images from textual prompts. Despite these advances, existing models struggle with complex prompts involving multiple objects and attributes, often misaligning modifiers with their corresponding nouns or neglecting certain elements. Recent attention-based methods… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  47. arXiv:2411.09961  [pdf, ps, other

    stat.ML cs.LG math.ST

    Dense ReLU Neural Networks for Temporal-spatial Model

    Authors: Carlos Misael Madrid Padilla, Zhi Zhang, Xiaokai Luo, Daren Wang, Oscar Hernan Madrid Padilla

    Abstract: In this paper, we focus on fully connected deep neural networks utilizing the Rectified Linear Unit (ReLU) activation function for nonparametric estimation. We derive non-asymptotic bounds that lead to convergence rates, addressing both temporal and spatial dependence in the observed measurements. By accounting for dependencies across time and space, our models better reflect the complexities of r… ▽ More

    Submitted 9 June, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

  48. arXiv:2411.09128  [pdf, ps, other

    cs.IT stat.AP

    Performance Analysis of uRLLC in scalable Cell-free Radio Access Network System

    Authors: Ziyang Zhang, Dongming Wang, Yunxiang Guo, Yang Cao, Xiaohu You

    Abstract: As a critical component of beyond fifth-generation (B5G) and sixth-generation (6G) mobile communication systems, ultra-reliable low-latency communication (uRLLC) imposes stringent requirements on latency and reliability. In recent years, with the improvement of mobile communication network, centralized and distributed processing schemes for cellfree massive multiple-input multiple-output (CF-mMIMO… ▽ More

    Submitted 12 December, 2024; v1 submitted 13 November, 2024; originally announced November 2024.

  49. arXiv:2411.02465  [pdf, other

    cs.LG cs.AI stat.ML

    See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers

    Authors: Jiaxin Zhuang, Leon Yan, Zhenwei Zhang, Ruiqi Wang, Jiawei Zhang, Yuantao Gu

    Abstract: Time series anomaly detection (TSAD) is becoming increasingly vital due to the rapid growth of time series data across various sectors. Anomalies in web service data, for example, can signal critical incidents such as system failures or server malfunctions, necessitating timely detection and response. However, most existing TSAD methodologies rely heavily on manual feature engineering or require e… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: Under review

  50. arXiv:2410.21105  [pdf, ps, other

    econ.EM stat.ML

    Difference-in-Differences with Time-varying Continuous Treatments using Double/Debiased Machine Learning

    Authors: Michel F. C. Haddad, Martin Huber, Lucas Z. Zhang

    Abstract: We propose a difference-in-differences (DiD) method for a time-varying continuous treatment and multiple time periods. Our framework assesses the average treatment effect on the treated (ATET) when comparing two non-zero treatment doses. The identification is based on a conditional parallel trend assumption imposed on the mean potential outcome under the lower dose, given observed covariates and p… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.