Skip to main content

Showing 1–50 of 452 results for author: Zhang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.04026  [pdf, other

    stat.ML cs.LG

    Federated Control in Markov Decision Processes

    Authors: Hao Jin, Yang Peng, Liangyu Zhang, Zhihua Zhang

    Abstract: We study problems of federated control in Markov Decision Processes. To solve an MDP with large state space, multiple learning agents are introduced to collaboratively learn its optimal policy without communication of locally collected experience. In our settings, these agents have limited capabilities, which means they are restricted within different regions of the overall state space during the… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  2. arXiv:2405.03236  [pdf, other

    cs.LG stat.ML

    Federated Reinforcement Learning with Constraint Heterogeneity

    Authors: Hao Jin, Liangyu Zhang, Zhihua Zhang

    Abstract: We study a Federated Reinforcement Learning (FedRL) problem with constraint heterogeneity. In our setting, we aim to solve a reinforcement learning problem with multiple constraints while $N$ training agents are located in $N$ different environments with limited access to the constraint signals and they are expected to collaboratively learn a policy satisfying all constraint signals. Such learning… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  3. arXiv:2404.16287  [pdf, other

    stat.ML cs.CR cs.LG math.ST stat.ME

    Differentially Private Federated Learning: Servers Trustworthiness, Estimation, and Statistical Inference

    Authors: Zhe Zhang, Ryumei Nakada, Linjun Zhang

    Abstract: Differentially private federated learning is crucial for maintaining privacy in distributed environments. This paper investigates the challenges of high-dimensional estimation and inference under the constraints of differential privacy. First, we study scenarios involving an untrusted central server, demonstrating the inherent difficulties of accurate estimation in high-dimensional problems. Our f… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 56 pages, 3 figures

  4. arXiv:2404.14786  [pdf, other

    cs.AI cs.LG stat.ME

    LLM-Enhanced Causal Discovery in Temporal Domain from Interventional Data

    Authors: Peiwen Li, Xin Wang, Zeyang Zhang, Yuan Meng, Fang Shen, Yue Li, Jialong Wang, Yang Li, Wenweu Zhu

    Abstract: In the field of Artificial Intelligence for Information Technology Operations, causal discovery is pivotal for operation and maintenance of graph construction, facilitating downstream industrial tasks such as root cause analysis. Temporal causal discovery, as an emerging method, aims to identify temporal causal relationships between variables directly from observations by utilizing interventional… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  5. arXiv:2404.08803  [pdf, other

    math.PR stat.ML

    Random walks on simplicial complexes

    Authors: Thomas Bonis, Laurent Decreusefond, Viet Chi Tran, Zhihan Iris Zhang

    Abstract: The notion of Laplacian of a graph can be generalized to simplicial complexes and hypergraphs, and contains information on the topology of these structures. Even for a graph, the consideration of associated simplicial complexes is interesting to understand its shape. Whereas the Laplacian of a graph has a simple probabilistic interpretation as the generator of a continuous time Markov chain on the… ▽ More

    Submitted 7 May, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    MSC Class: 60D05

  6. arXiv:2404.06676  [pdf

    cs.LG eess.SP stat.AP

    Topological Feature Search Method for Multichannel EEG: Application in ADHD classification

    Authors: Tianming Cai, Guoying Zhao, Junbin Zang, Chen Zong, Zhidong Zhang, Chenyang Xue

    Abstract: In recent years, the preliminary diagnosis of Attention Deficit Hyperactivity Disorder (ADHD) using electroencephalography (EEG) has garnered attention from researchers. EEG, known for its expediency and efficiency, plays a pivotal role in the diagnosis and treatment of ADHD. However, the non-stationarity of EEG signals and inter-subject variability pose challenges to the diagnostic and classifica… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  7. arXiv:2404.04471  [pdf, ps, other

    stat.ME math.ST

    Estimation and Inference in Ultrahigh Dimensional Partially Linear Single-Index Models

    Authors: Shijie Cui, Xu Guo, Zhe Zhang

    Abstract: This paper is concerned with estimation and inference for ultrahigh dimensional partially linear single-index models. The presence of high dimensional nuisance parameter and nuisance unknown function makes the estimation and inference problem very challenging. In this paper, we first propose a profile partial penalized least squares estimator and establish the sparsity, consistency and asymptotic… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  8. arXiv:2404.03804  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    TransformerLSR: Attentive Joint Model of Longitudinal Data, Survival, and Recurrent Events with Concurrent Latent Structure

    Authors: Zhiyue Zhang, Yao Zhao, Yanxun Xu

    Abstract: In applications such as biomedical studies, epidemiology, and social sciences, recurrent events often co-occur with longitudinal measurements and a terminal event, such as death. Therefore, jointly modeling longitudinal measurements, recurrent events, and survival data while accounting for their dependencies is critical. While joint models for the three components exist in statistical literature,… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  9. arXiv:2404.03160  [pdf, other

    stat.AP

    Simultaneous clustering and estimation of additive shape invariant models for recurrent event data

    Authors: Zitong Zhang, Shizhe Chen

    Abstract: Technological advancements have enabled the recording of spiking activities from large neuron ensembles, presenting an exciting yet challenging opportunity for statistical analysis. This project considers the challenges from a common type of neuroscience experiments, where randomized interventions are applied over the course of each trial. The objective is to identify groups of neurons with unique… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  10. arXiv:2404.00912  [pdf, other

    math.ST stat.CO stat.ME stat.ML

    Inference in Randomized Least Squares and PCA via Normality of Quadratic Forms

    Authors: Leda Wang, Zhixiang Zhang, Edgar Dobriban

    Abstract: Randomized algorithms can be used to speed up the analysis of large datasets. In this paper, we develop a unified methodology for statistical inference via randomized sketching or projections in two of the most fundamental problems in multivariate statistical analysis: least squares and PCA. The methodology applies to fixed datasets -- i.e., is data-conditional -- and the only randomness is due to… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  11. arXiv:2404.00776  [pdf, other

    cs.LG cs.DB stat.ML

    PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

    Authors: Weihua Hu, Yiwen Yuan, Zecheng Zhang, Akihiro Nitta, Kaidi Cao, Vid Kocijan, Jure Leskovec, Matthias Fey

    Abstract: We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data. PyTorch Frame makes tabular deep learning easy by providing a PyTorch-based data structure to handle complex tabular data, introducing a model abstraction to enable modular implementation of tabular models, and allowing external foundation models to be incorporated to handle complex columns (e.g.,… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: https://github.com/pyg-team/pytorch-frame

  12. arXiv:2403.18127  [pdf, ps, other

    cs.LG math.ST stat.ML

    A Correction of Pseudo Log-Likelihood Method

    Authors: Shi Feng, Nuoya Xiong, Zhijie Zhang, Wei Chen

    Abstract: Pseudo log-likelihood is a type of maximum likelihood estimation (MLE) method used in various fields including contextual bandits, influence maximization of social networks, and causal bandits. However, in previous literature \citep{li2017provably, zhang2022online, xiong2022combinatorial, feng2023combinatorial1, feng2023combinatorial2}, the log-likelihood function may not be bounded, which may res… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 7 pages

  13. arXiv:2403.16059  [pdf, other

    stat.ML cs.LG math.OC

    Manifold Regularization Classification Model Based On Improved Diffusion Map

    Authors: Hongfu Guo, Wencheng Zou, Zeyu Zhang, Shuishan Zhang, Ruitong Wang, Jintao Zhang

    Abstract: Manifold regularization model is a semi-supervised learning model that leverages the geometric structure of a dataset, comprising a small number of labeled samples and a large number of unlabeled samples, to generate classifiers. However, the original manifold norm limits the performance of models to local regions. To address this limitation, this paper proposes an approach to improve manifold reg… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 20 pages, 24figures

  14. arXiv:2403.15711  [pdf, other

    cs.LG stat.ME stat.ML

    Identifiable Latent Neural Causal Models

    Authors: Yuhang Liu, Zhen Zhang, Dong Gong, Mingming Gong, Biwei Huang, Anton van den Hengel, Kun Zhang, Javen Qinfeng Shi

    Abstract: Causal representation learning seeks to uncover latent, high-level causal representations from low-level observed data. It is particularly good at predictions under unseen distribution shifts, because these shifts can generally be interpreted as consequences of interventions. Hence leveraging {seen} distribution shifts becomes a natural strategy to help identifying causal representations, which in… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  15. arXiv:2403.12250  [pdf, other

    stat.ME stat.AP stat.CO

    Bayesian Optimization Sequential Surrogate (BOSS) Algorithm: Fast Bayesian Inference for a Broad Class of Bayesian Hierarchical Models

    Authors: Dayi Li, Ziang Zhang

    Abstract: Approximate Bayesian inference based on Laplace approximation and quadrature methods have become increasingly popular for their efficiency at fitting latent Gaussian models (LGM), which encompass popular models such as Bayesian generalized linear models, survival models, and spatio-temporal models. However, many useful models fall under the LGM framework only if some conditioning parameters are fi… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: The authors contributed equally to this work. The names are listed alphabetically

  16. arXiv:2403.05811  [pdf, ps, other

    stat.ML cs.LG

    Near Minimax-Optimal Distributional Temporal Difference Algorithms and The Freedman Inequality in Hilbert Spaces

    Authors: Yang Peng, Liangyu Zhang, Zhihua Zhang

    Abstract: Distributional reinforcement learning (DRL) has achieved empirical success in various domains. One of the core tasks in the field of DRL is distributional policy evaluation, which involves estimating the return distribution $η^π$ for a given policy $π$. The distributional temporal difference (TD) algorithm has been accordingly proposed, which is an extension of the temporal difference algorithm in… ▽ More

    Submitted 14 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  17. arXiv:2402.09397  [pdf, other

    math.ST stat.CO

    On the Assessment of Bootstrap Intervals for Samples of Fixed Size

    Authors: Weizhen Wang, Chongxiu Yu, Zhongzhan Zhang

    Abstract: A reasonable confidence interval should have a confidence coefficient no less than the given nominal level and a small expected length to reliably and accurately estimate the parameter of interest, and the bootstrap interval is considered to be an efficient interval estimation technique. In this paper, we offer a first attempt at computing the coverage probability and expected length of a parametr… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  18. arXiv:2402.08182  [pdf, other

    cs.LG stat.ML

    Variational Continual Test-Time Adaptation

    Authors: Fan Lyu, Kaile Du, Yuyang Li, Hanyu Zhao, Zhang Zhang, Guangcan Liu, Liang Wang

    Abstract: The prior drift is crucial in Continual Test-Time Adaptation (CTTA) methods that only use unlabeled test data, as it can cause significant error propagation. In this paper, we introduce VCoTTA, a variational Bayesian approach to measure uncertainties in CTTA. At the source stage, we transform a pre-trained deterministic model into a Bayesian Neural Network (BNN) via a variational warm-up strategy,… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  19. arXiv:2402.07465  [pdf, other

    cs.LG cs.AI math.DS math.NA stat.ML

    Score-Based Physics-Informed Neural Networks for High-Dimensional Fokker-Planck Equations

    Authors: Zheyuan Hu, Zhongqiang Zhang, George Em Karniadakis, Kenji Kawaguchi

    Abstract: The Fokker-Planck (FP) equation is a foundational PDE in stochastic processes. However, curse of dimensionality (CoD) poses challenge when dealing with high-dimensional FP PDEs. Although Monte Carlo and vanilla Physics-Informed Neural Networks (PINNs) have shown the potential to tackle CoD, both methods exhibit numerical errors in high dimensions when dealing with the probability density function… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 22 pages

    MSC Class: 14J60

  20. arXiv:2402.06223  [pdf, other

    cs.LG cs.CV stat.ML

    Revealing Multimodal Contrastive Representation Learning through Latent Partial Causal Models

    Authors: Yuhang Liu, Zhen Zhang, Dong Gong, Biwei Huang, Mingming Gong, Anton van den Hengel, Kun Zhang, Javen Qinfeng Shi

    Abstract: Multimodal contrastive representation learning methods have proven successful across a range of domains, partly due to their ability to generate meaningful shared representations of complex phenomena. To enhance the depth of analysis and understanding of these acquired representations, we introduce a unified causal model specifically designed for multimodal data. By examining this model, we show t… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  21. arXiv:2402.04489  [pdf, other

    cs.LG cs.CR cs.CY stat.ME

    De-amplifying Bias from Differential Privacy in Language Model Fine-tuning

    Authors: Sanjari Srivastava, Piotr Mardziel, Zhikhun Zhang, Archana Ahlawat, Anupam Datta, John C Mitchell

    Abstract: Fairness and privacy are two important values machine learning (ML) practitioners often seek to operationalize in models. Fairness aims to reduce model bias for social/demographic sub-groups. Privacy via differential privacy (DP) mechanisms, on the other hand, limits the impact of any individual's training data on the resulting model. The trade-offs between privacy and fairness goals of trustworth… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  22. arXiv:2402.02720  [pdf, other

    cs.LG stat.ML

    Discounted Adaptive Online Prediction

    Authors: Zhiyu Zhang, David Bombara, Heng Yang

    Abstract: Online learning is not always about memorizing everything. Since the future can be statistically very different from the past, a critical challenge is to gracefully forget the history while new data comes in. To formalize this intuition, we revisit the classical notion of discounted regret using recently developed techniques in adaptive online learning. Our main result is a new algorithm that adap… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  23. arXiv:2402.02196  [pdf, other

    stat.ME cs.LG

    Sample-Efficient Clustering and Conquer Procedures for Parallel Large-Scale Ranking and Selection

    Authors: Zishi Zhang, Yijie Peng

    Abstract: We propose novel "clustering and conquer" procedures for the parallel large-scale ranking and selection (R&S) problem, which leverage correlation information for clustering to break the bottleneck of sample efficiency. In parallel computing environments, correlation-based clustering can achieve an $\mathcal{O}(p)$ sample complexity reduction rate, which is the optimal reduction rate theoretically… ▽ More

    Submitted 12 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  24. arXiv:2401.16421  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

    Authors: Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Di He, Jingjing Xu, Zhi Zhang, Hongxia Yang, Liwei Wang

    Abstract: In this work, we leverage the intrinsic segmentation of language sequences and design a new positional encoding method called Bilevel Positional Encoding (BiPE). For each position, our BiPE blends an intra-segment encoding and an inter-segment encoding. The intra-segment encoding identifies the locations within a segment and helps the model capture the semantic information therein via absolute pos… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 17 pages, 7 figures, 8 tables; Working in Progress

  25. arXiv:2401.14910  [pdf, other

    stat.ME stat.AP

    Modeling Extreme Events: Univariate and Multivariate Data-Driven Approaches

    Authors: Gloria Buriticá, Manuel Hentschel, Olivier C. Pasche, Frank Röttger, Zhongwei Zhang

    Abstract: Modern inference in extreme value theory faces numerous complications, such as missing data, hidden covariates or design problems. Some of those complications were exemplified in the EVA 2023 data challenge. The challenge comprises multiple individual problems which cover a variety of univariate and multivariate settings. This note presents the contribution of team genEVA in said competition, with… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  26. arXiv:2401.11380  [pdf, other

    cs.LG math.ST stat.ME stat.ML

    MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning

    Authors: Mao Hong, Zhiyue Zhang, Yue Wu, Yanxun Xu

    Abstract: Model-based offline reinforcement learning methods (RL) have achieved state-of-the-art performance in many decision-making problems thanks to their sample efficiency and generalizability. Despite these advancements, existing model-based offline RL approaches either focus on theoretical studies without developing practical algorithms or rely on a restricted parametric policy space, thus not fully l… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  27. arXiv:2401.11352  [pdf, other

    stat.ME

    Geometric Insights and Empirical Observations on Covariate Adjustment and Stratified Randomization in Randomized Clinical Trials

    Authors: Zhiwei Zhang

    Abstract: The statistical efficiency of randomized clinical trials can be improved by incorporating information from baseline covariates (i.e., pre-treatment patient characteristics). This can be done in the design stage using a covariate-adaptive randomization scheme such as stratified (permutated block) randomization, or in the analysis stage through covariate adjustment. This article provides a geometric… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  28. arXiv:2401.03893  [pdf, other

    math.OC stat.ML

    Finite-Time Decoupled Convergence in Nonlinear Two-Time-Scale Stochastic Approximation

    Authors: Yuze Han, Xiang Li, Zhihua Zhang

    Abstract: In two-time-scale stochastic approximation (SA), two iterates are updated at varying speeds using different step sizes, with each update influencing the other. Previous studies in linear two-time-scale SA have found that the convergence rates of the mean-square errors for these updates are dependent solely on their respective step sizes, leading to what is referred to as decoupled convergence. How… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  29. arXiv:2312.14416  [pdf, other

    stat.ME

    Joint Semi-Symmetric Tensor PCA for Integrating Multi-modal Populations of Networks

    Authors: Jiaming Liu, Lili Zheng, Zhengwu Zhang, Genevera I. Allen

    Abstract: Multi-modal populations of networks arise in many scenarios including in large-scale multi-modal neuroimaging studies that capture both functional and structural neuroimaging data for thousands of subjects. A major research question in such studies is how functional and structural brain connectivity are related and how they vary across the population. we develop a novel PCA-type framework for inte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  30. arXiv:2312.05590  [pdf, other

    math.OC stat.ME

    Gradient Tracking for High Dimensional Federated Optimization

    Authors: Jiadong Liang, Yang Peng, Zhihua Zhang

    Abstract: In this paper, we study the (decentralized) distributed optimization problem with high-dimensional sparse structure. Building upon the FedDA algorithm, we propose a (Decentralized) FedDA-GT algorithm, which combines the \textbf{gradient tracking} technique. It is able to eliminate the heterogeneity among different clients' objective functions while ensuring a dimension-free convergence rate. Compa… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  31. arXiv:2312.05134  [pdf, other

    cs.LG stat.ML

    Optimal Multi-Distribution Learning

    Authors: Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee

    Abstract: Multi-distribution learning (MDL), which seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions, has emerged as a unified framework in response to the evolving demand for robustness, fairness, multi-group collaboration, etc. Achieving data-efficient MDL necessitates adaptive sampling, also called on-demand sampling, throughout the learning process.… ▽ More

    Submitted 20 January, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  32. arXiv:2311.17143  [pdf, other

    astro-ph.IM astro-ph.HE cs.LG stat.ML

    Predicting the Age of Astronomical Transients from Real-Time Multivariate Time Series

    Authors: Hali Huang, Daniel Muthukrishna, Prajna Nair, Zimi Zhang, Michael Fausnaugh, Torsha Majumder, Ryan J. Foley, George R. Ricker

    Abstract: Astronomical transients, such as supernovae and other rare stellar explosions, have been instrumental in some of the most significant discoveries in astronomy. New astronomical sky surveys will soon record unprecedented numbers of transients as sparsely and irregularly sampled multivariate time series. To improve our understanding of the physical mechanisms of transients and their progenitor syste… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures. Accepted at the NeurIPS 2023 Machine Learning and the Physical Sciences workshop

  33. arXiv:2311.12319  [pdf, other

    stat.ML math.ST

    A unified consensus-based parallel ADMM algorithm for high-dimensional regression with combined regularizations

    Authors: Xiaofei Wu, Zhimin Zhang, Zhenyu Cui

    Abstract: The parallel alternating direction method of multipliers (ADMM) algorithm is widely recognized for its effectiveness in handling large-scale datasets stored in a distributed manner, making it a popular choice for solving statistical learning models. However, there is currently limited research on parallel algorithms specifically designed for high-dimensional regression with combined (composite) re… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  34. arXiv:2311.08434  [pdf, other

    cs.LG cs.AI stat.ML

    Uplift Modeling based on Graph Neural Network Combined with Causal Knowledge

    Authors: Haowen Wang, Xinyan Ye, Yangze Zhou, Zhiyi Zhang, Longhan Zhang, Jing Jiang

    Abstract: Uplift modeling is a fundamental component of marketing effect modeling, which is commonly employed to evaluate the effects of treatments on outcomes. Through uplift modeling, we can identify the treatment with the greatest benefit. On the other side, we can identify clients who are likely to make favorable decisions in response to a certain treatment. In the past, uplift modeling approaches relie… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 6 pages, 6 figures

  35. arXiv:2311.05061  [pdf, other

    cs.LG stat.ML

    Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics

    Authors: Soo Min Kwon, Zekai Zhang, Dogyoon Song, Laura Balzano, Qing Qu

    Abstract: Overparameterized models have proven to be powerful tools for solving various machine learning tasks. However, overparameterization often leads to a substantial increase in computational and memory costs, which in turn requires extensive resources to train. In this work, we present a novel approach for compressing overparameterized models, developed through studying their learning dynamics. We obs… ▽ More

    Submitted 11 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  36. arXiv:2310.12139  [pdf, ps, other

    math.OC stat.CO

    Optimal and parameter-free gradient minimization methods for convex and nonconvex optimization

    Authors: Guanghui Lan, Yuyuan Ouyang, Zhe Zhang

    Abstract: We propose novel optimal and parameter-free algorithms for computing an approximate solution with small (projected) gradient norm. Specifically, for computing an approximate solution such that the norm of its (projected) gradient does not exceed $\varepsilon$, we obtain the following results: a) for the convex case, the total number of gradient evaluations is bounded by… ▽ More

    Submitted 29 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  37. arXiv:2309.17262  [pdf, other

    stat.ML cs.LG

    Estimation and Inference in Distributional Reinforcement Learning

    Authors: Liangyu Zhang, Yang Peng, Jiadong Liang, Wenhao Yang, Zhihua Zhang

    Abstract: In this paper, we study distributional reinforcement learning from the perspective of statistical efficiency. We investigate distributional policy evaluation, aiming to estimate the complete distribution of the random return (denoted $η^π$) attained by a given policy $π$. We use the certainty-equivalence method to construct our estimator $\hatη^π$, given a generative model is available. We s… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  38. arXiv:2309.16409  [pdf, other

    stat.ML cs.LG

    Constructing Synthetic Treatment Groups without the Mean Exchangeability Assumption

    Authors: Yuhang Zhang, Yue Liu, Zhihua Zhang

    Abstract: The purpose of this work is to transport the information from multiple randomized controlled trials to the target population where we only have the control group data. Previous works rely critically on the mean exchangeability assumption. However, as pointed out by many current studies, the mean exchangeability assumption might be violated. Motivated by the synthetic control method, we construct a… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  39. arXiv:2309.16044  [pdf, ps, other

    cs.LG stat.ML

    Improving Adaptive Online Learning Using Refined Discretization

    Authors: Zhiyu Zhang, Heng Yang, Ashok Cutkosky, Ioannis Ch. Paschalidis

    Abstract: We study unconstrained Online Linear Optimization with Lipschitz losses. Motivated by the pursuit of instance optimality, we propose a new algorithm that simultaneously achieves ($i$) the AdaGrad-style second order gradient adaptivity; and ($ii$) the comparator norm adaptivity also known as "parameter freeness" in the literature. In particular, - our algorithm does not employ the impractical dou… ▽ More

    Submitted 22 February, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: ALT 2024

  40. arXiv:2309.00983  [pdf, other

    stat.ML cs.LG math.OC

    An Ensemble Score Filter for Tracking High-Dimensional Nonlinear Dynamical Systems

    Authors: Feng Bao, Zezhong Zhang, Guannan Zhang

    Abstract: We propose an ensemble score filter (EnSF) for solving high-dimensional nonlinear filtering problems with superior accuracy. A major drawback of existing filtering methods, e.g., particle filters or ensemble Kalman filters, is the low accuracy in handling high-dimensional and highly nonlinear problems. EnSF attacks this challenge by exploiting the score-based diffusion model, defined in a pseudo-t… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.09282

  41. arXiv:2308.13068  [pdf, other

    cs.LG cs.AI cs.PF stat.CO stat.ML

    Multivariate Time Series Anomaly Detection: Fancy Algorithms and Flawed Evaluation Methodology

    Authors: Mohamed El Amine Sehili, Zonghua Zhang

    Abstract: Multivariate Time Series (MVTS) anomaly detection is a long-standing and challenging research topic that has attracted tremendous research effort from both industry and academia recently. However, a careful study of the literature makes us realize that 1) the community is active but not as organized as other sibling machine learning communities such as Computer Vision (CV) and Natural Language Pro… ▽ More

    Submitted 1 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 17 pages, 7 figures, accepted at TPCTC 2023

    ACM Class: G.3; I.2.6; I.2.m

  42. arXiv:2308.05738  [pdf, other

    stat.CO q-bio.NC stat.AP stat.ME

    Continuous and Atlas-free Analysis of Brain Structural Connectivity

    Authors: William Consagra, Martin Cole, Xing Qiu, Zhengwu Zhang

    Abstract: Brain structural networks are often represented as discrete adjacency matrices with elements summarizing the connectivity between pairs of regions of interest (ROIs). These ROIs are typically determined a-priori using a brain atlas. The choice of atlas is often arbitrary and can lead to a loss of important connectivity information at the sub-ROI level. This work introduces an atlas-free framework… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  43. arXiv:2307.16792  [pdf, ps, other

    stat.ML cs.LG

    Classification with Deep Neural Networks and Logistic Loss

    Authors: Zihan Zhang, Lei Shi, Ding-Xuan Zhou

    Abstract: Deep neural networks (DNNs) trained with the logistic loss (i.e., the cross entropy loss) have made impressive advancements in various binary classification tasks. However, generalization analysis for binary classification with DNNs and logistic loss remains scarce. The unboundedness of the target function for the logistic loss is the main obstacle to deriving satisfactory generalization bounds. I… ▽ More

    Submitted 21 April, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  44. arXiv:2307.11255  [pdf, other

    stat.ME math.ST stat.CO

    A Framework for Statistical Inference via Randomized Algorithms

    Authors: Zhixiang Zhang, Sokbae Lee, Edgar Dobriban

    Abstract: Randomized algorithms, such as randomized sketching or projections, are a promising approach to ease the computational burden in analyzing large datasets. However, randomized algorithms also produce non-deterministic outputs, leading to the problem of evaluating their accuracy. In this paper, we develop a statistical inference framework for quantifying the uncertainty of the outputs of randomized… ▽ More

    Submitted 28 September, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

  45. arXiv:2307.08921  [pdf, other

    cs.LG stat.ML

    Optimistic Estimate Uncovers the Potential of Nonlinear Models

    Authors: Yaoyu Zhang, Zhongwang Zhang, Leyang Zhang, Zhiwei Bai, Tao Luo, Zhi-Qin John Xu

    Abstract: We propose an optimistic estimate to evaluate the best possible fitting performance of nonlinear models. It yields an optimistic sample size that quantifies the smallest possible sample size to fit/recover a target function using a nonlinear model. We estimate the optimistic sample sizes for matrix factorization models, deep models, and deep neural networks (DNNs) with fully-connected or convoluti… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  46. arXiv:2307.01668  [pdf, other

    cs.LG cs.CV stat.ML

    Training Energy-Based Models with Diffusion Contrastive Divergences

    Authors: Weijian Luo, Hao Jiang, Tianyang Hu, Jiacheng Sun, Zhenguo Li, Zhihua Zhang

    Abstract: Energy-Based Models (EBMs) have been widely used for generative modeling. Contrastive Divergence (CD), a prevailing training objective for EBMs, requires sampling from the EBM with Markov Chain Monte Carlo methods (MCMCs), which leads to an irreconcilable trade-off between the computational burden and the validity of the CD. Running MCMCs till convergence is computationally intensive. On the other… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  47. arXiv:2306.14859  [pdf, other

    cs.LG stat.ML

    Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories

    Authors: Zixuan Zhang, Minshuo Chen, Mengdi Wang, Wenjing Liao, Tuo Zhao

    Abstract: Existing theories on deep nonparametric regression have shown that when the input data lie on a low-dimensional manifold, deep neural networks can adapt to the intrinsic data structures. In real world applications, such an assumption of data lying exactly on a low dimensional manifold is stringent. This paper introduces a relaxed assumption that the input data are concentrated around a subset of… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  48. arXiv:2306.12925  [pdf, other

    cs.CL cs.AI cs.SD eess.AS stat.ML

    AudioPaLM: A Large Language Model That Can Speak and Listen

    Authors: Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats , et al. (5 additional authors not shown)

    Abstract: We introduce AudioPaLM, a large language model for speech understanding and generation. AudioPaLM fuses text-based and speech-based language models, PaLM-2 [Anil et al., 2023] and AudioLM [Borsos et al., 2022], into a unified multimodal architecture that can process and generate text and speech with applications including speech recognition and speech-to-speech translation. AudioPaLM inherits the… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Technical report

  49. arXiv:2306.04952  [pdf, other

    stat.ML cs.LG

    Entropy-based Training Methods for Scalable Neural Implicit Sampler

    Authors: Weijian Luo, Boya Zhang, Zhihua Zhang

    Abstract: Efficiently sampling from un-normalized target distributions is a fundamental problem in scientific computing and machine learning. Traditional approaches like Markov Chain Monte Carlo (MCMC) guarantee asymptotically unbiased samples from such distributions but suffer from computational inefficiency, particularly when dealing with high-dimensional targets, as they require numerous iterations to ge… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  50. arXiv:2305.16539  [pdf, other

    math.ST cs.IT math.PR stat.ME

    On the existence of powerful p-values and e-values for composite hypotheses

    Authors: Zhenyuan Zhang, Aaditya Ramdas, Ruodu Wang

    Abstract: Given a composite null $\mathcal P$ and composite alternative $\mathcal Q$, when and how can we construct a p-value whose distribution is exactly uniform under the null, and stochastically smaller than uniform under the alternative? Similarly, when and how can we construct an e-value whose expectation exactly equals one under the null, but its expected logarithm under the alternative is positive?… ▽ More

    Submitted 15 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 39 pages, 7 figures