Search | arXiv e-print repository

Some remarks on the uncolored versions of the original CFI-graphs

Authors: Yijia Chen, Jörg Flum, Mingjun Liu

Abstract: The CFI-graphs, named after Cai, Fürer, and Immerman, are central to the study of the graph isomorphism testing and of first-order logic with counting. They are colored graphs, and the coloring plays a role in many of their applications. As usual, it is not hard to remove the coloring by some extra graph gadgets, but at the cost of blowing up the size of the graphs and changing some parameters of… ▽ More The CFI-graphs, named after Cai, Fürer, and Immerman, are central to the study of the graph isomorphism testing and of first-order logic with counting. They are colored graphs, and the coloring plays a role in many of their applications. As usual, it is not hard to remove the coloring by some extra graph gadgets, but at the cost of blowing up the size of the graphs and changing some parameters of them as well. This might lead to suboptimal combinatorial bounds important to their applications. Since then for some uncolored variants of the CFI-graphs it has been shown that they serve the same purposes. We show that this already applies to the graphs obtained from the original CFI-graphs by forgetting the colors. Moreover, we will see that there is a first-order formula $\varphi(x,y)$ expressing in almost all uncolored CFI-graphs that $x$ and $y$ have the same color in the corresponding colored graphs. △ Less

Submitted 2 July, 2025; originally announced July 2025.

Comments: 46 pages

arXiv:2506.08723 [pdf, other]

Wasserstein and Convex Gaussian Approximations for Non-stationary Time Series of Diverging Dimensionality

Authors: Miaoshiqi Liu, Jun Yang, Zhou Zhou

Abstract: In high-dimensional time series analysis, Gaussian approximation (GA) schemes under various distance measures or on various collections of subsets of the Euclidean space play a fundamental role in a wide range of statistical inference problems. To date, most GA results for high-dimensional time series are established on hyper-rectangles and their equivalence. In this paper, by considering the 2-Wa… ▽ More In high-dimensional time series analysis, Gaussian approximation (GA) schemes under various distance measures or on various collections of subsets of the Euclidean space play a fundamental role in a wide range of statistical inference problems. To date, most GA results for high-dimensional time series are established on hyper-rectangles and their equivalence. In this paper, by considering the 2-Wasserstein distance and the collection of all convex sets, we establish a general GA theory for a broad class of high-dimensional non-stationary (HDNS) time series, extending the scope of problems that can be addressed in HDNS time series analysis. For HDNS time series of sufficiently weak dependence and light tail, the GA rates established in this paper are either nearly optimal with respect to the dimensionality and time series length, or they are nearly identical to the corresponding best-known GA rates established for independent data. A multiplier bootstrap procedure is utilized and theoretically justified to implement our GA theory. We demonstrate by two previously undiscussed time series applications the use of the GA theory and the bootstrap procedure as unified tools for a wide range of statistical inference problems in HDNS time series analysis. △ Less

Submitted 10 June, 2025; originally announced June 2025.

Comments: 70 pages

arXiv:2506.00801 [pdf, ps, other]

Adversarial Reinforcement Learning: A Duality-Based Approach To Solving Optimal Control Problems

Authors: Nan Chen, Mengzhou Liu, Xiaoyan Wang, Nanyi Zhang

Abstract: We propose an adversarial deep reinforcement learning (ADRL) algorithm for high-dimensional stochastic control problems. Inspired by the information relaxation duality, ADRL reformulates the control problem as a min-max optimization between policies and adversarial penalties, enforcing non-anticipativity while preserving optimality. Numerical experiments demonstrate ADRL's superior performance to… ▽ More We propose an adversarial deep reinforcement learning (ADRL) algorithm for high-dimensional stochastic control problems. Inspired by the information relaxation duality, ADRL reformulates the control problem as a min-max optimization between policies and adversarial penalties, enforcing non-anticipativity while preserving optimality. Numerical experiments demonstrate ADRL's superior performance to yield tight dual gaps. Our results highlight the potential of ADRL as a robust computational framework for high-dimensional stochastic control in simulation-based optimization contexts. △ Less

Submitted 1 July, 2025; v1 submitted 31 May, 2025; originally announced June 2025.

Comments: Accepted by the 2025 Winter Simulation Conference

arXiv:2505.17678 [pdf, ps, other]

Optimal control of variable-exponent subdiffusion

Authors: Yiqun Li, Mengmeng Liu, Wenlin Qiu, Xiangcheng Zheng

Abstract: This work investigates the optimal control of the variable-exponent subdiffusion, which extends the work [Gunzburger and Wang, {\it SIAM J. Control Optim.} 2019] to the variable-exponent case to account for the multiscale and crossover diffusion behavior. To resolve the difficulties caused by the leading variable-exponent operator, we adopt the convolution method to reformulate the model into an e… ▽ More This work investigates the optimal control of the variable-exponent subdiffusion, which extends the work [Gunzburger and Wang, {\it SIAM J. Control Optim.} 2019] to the variable-exponent case to account for the multiscale and crossover diffusion behavior. To resolve the difficulties caused by the leading variable-exponent operator, we adopt the convolution method to reformulate the model into an equivalent but more tractable form, and then prove the well-posedness and weighted regularity of the optimal control. As the convolution kernels in reformulated models are indefinite-sign, non-positive-definite, and non-monotonic, we adopt the discrete convolution kernel approach in numerical analysis to show the $O(τ(1+|\lnτ|)+h^2)$ accuracy of the schemes for state and adjoint equations. Numerical experiments are performed to substantiate the theoretical findings. △ Less

Submitted 30 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

MSC Class: 35R11; 49K20; 65M12; 65M60

arXiv:2505.14278 [pdf, ps, other]

Critical mass for finite-time chemotactic collapse in the critical dimension via comparison

Authors: Xuan Mao, Meng Liu, Yuxiang Li

Abstract: We study the Neumann initial-boundary value problem for the parabolic-elliptic chemotaxis system, proposed by Jäger and Luckhaus (1992). We confirm that their comparison methods can be simplified and refined, applicable to seek the critical mass $8π$ concerning finite-time blowup in the unit disk. As an application, we deal with a parabolic-elliptic-parabolic chemotaxis model involving indirect si… ▽ More We study the Neumann initial-boundary value problem for the parabolic-elliptic chemotaxis system, proposed by Jäger and Luckhaus (1992). We confirm that their comparison methods can be simplified and refined, applicable to seek the critical mass $8π$ concerning finite-time blowup in the unit disk. As an application, we deal with a parabolic-elliptic-parabolic chemotaxis model involving indirect signal production in the unit ball of $\mathbb R^4$, proposed by Tao and Winkler (2025). Within the framework of radially symmetric solutions, we prove that if initial mass is less than $64π^2$, then solution is globally bounded; for any $m$ exceeding $64π^2$, there exist nonnegative initial data with prescribed mass $m$ such that the corresponding classical solutions exhibit a formation of Dirac-delta type singularity in finite time, termed a chemotactic collapse. △ Less

Submitted 20 May, 2025; originally announced May 2025.

arXiv:2505.07578 [pdf, ps, other]

Positive normalized solutions of Schrödinger equations with Sobolev critical growth in bounded domains

Authors: Xiaojun Chang, Manting Liu, Duokui Yan

Abstract: This paper investigates the existence of positive normalized solutions to the Sobolev critical Schrödinger equation: \begin{equation*} \left\{ \begin{aligned} &-Δu +λu =|u|^{2^*-2}u \quad &\mbox{in}& \ Ω,\\ &\int_Ω|u|^{2}dx=c, \quad u=0 \quad &\mbox{on}& \ \partialΩ, \end{aligned} \right. \end{equation*} where $Ω\subset\mathbb{R}^{N}$ ($N\geq3$) is a bounded smooth domain, $2^*=\frac{2N}{N-2}$,… ▽ More This paper investigates the existence of positive normalized solutions to the Sobolev critical Schrödinger equation: \begin{equation*} \left\{ \begin{aligned} &-Δu +λu =|u|^{2^*-2}u \quad &\mbox{in}& \ Ω,\\ &\int_Ω|u|^{2}dx=c, \quad u=0 \quad &\mbox{on}& \ \partialΩ, \end{aligned} \right. \end{equation*} where $Ω\subset\mathbb{R}^{N}$ ($N\geq3$) is a bounded smooth domain, $2^*=\frac{2N}{N-2}$, $λ\in \mathbb{R}$ is a Lagrange multiplier, and $c>0$ is a prescribed constant. By introducing a novel blow-up analysis for Sobolev subcritical approximation solutions with uniformly bounded Morse index and fixed mass, we establish the existence of mountain pass type positive normalized solutions for $N\ge 3$. This resolves an open problem posed in [Pierotti, Verzini and Yu, SIAM J. Math. Anal. 2025]. △ Less

Submitted 12 May, 2025; originally announced May 2025.

MSC Class: 35B33; 35J20; 35J60; 35Q55

arXiv:2505.04599 [pdf, ps, other]

Complexity Lower Bounds of Adaptive Gradient Algorithms for Non-convex Stochastic Optimization under Relaxed Smoothness

Authors: Michael Crawshaw, Mingrui Liu

Abstract: Recent results in non-convex stochastic optimization demonstrate the convergence of popular adaptive algorithms (e.g., AdaGrad) under the $(L_0, L_1)$-smoothness condition, but the rate of convergence is a higher-order polynomial in terms of problem parameters like the smoothness constants. The complexity guaranteed by such algorithms to find an $ε$-stationary point may be significantly larger tha… ▽ More Recent results in non-convex stochastic optimization demonstrate the convergence of popular adaptive algorithms (e.g., AdaGrad) under the $(L_0, L_1)$-smoothness condition, but the rate of convergence is a higher-order polynomial in terms of problem parameters like the smoothness constants. The complexity guaranteed by such algorithms to find an $ε$-stationary point may be significantly larger than the optimal complexity of $Θ\left( ΔL σ^2 ε^{-4} \right)$ achieved by SGD in the $L$-smooth setting, where $Δ$ is the initial optimality gap, $σ^2$ is the variance of stochastic gradient. However, it is currently not known whether these higher-order dependencies can be tightened. To answer this question, we investigate complexity lower bounds for several adaptive optimization algorithms in the $(L_0, L_1)$-smooth setting, with a focus on the dependence in terms of problem parameters $Δ, L_0, L_1$. We provide complexity bounds for three variations of AdaGrad, which show at least a quadratic dependence on problem parameters $Δ, L_0, L_1$. Notably, we show that the decorrelated variant of AdaGrad-Norm requires at least $Ω\left( Δ^2 L_1^2 σ^2 ε^{-4} \right)$ stochastic gradient queries to find an $ε$-stationary point. We also provide a lower bound for SGD with a broad class of adaptive stepsizes. Our results show that, for certain adaptive algorithms, the $(L_0, L_1)$-smooth setting is fundamentally more difficult than the standard smooth setting, in terms of the initial optimality gap and the smoothness constants. △ Less

Submitted 7 May, 2025; originally announced May 2025.

Comments: ICLR 2025

arXiv:2505.00940 [pdf, other]

StablePCA: Learning Shared Representations across Multiple Sources via Minimax Optimization

Authors: Zhenyu Wang, Molei Liu, Jing Lei, Francis Bach, Zijian Guo

Abstract: When synthesizing multisource high-dimensional data, a key objective is to extract low-dimensional feature representations that effectively approximate the original features across different sources. Such general feature extraction facilitates the discovery of transferable knowledge, mitigates systematic biases such as batch effects, and promotes fairness. In this paper, we propose Stable Principa… ▽ More When synthesizing multisource high-dimensional data, a key objective is to extract low-dimensional feature representations that effectively approximate the original features across different sources. Such general feature extraction facilitates the discovery of transferable knowledge, mitigates systematic biases such as batch effects, and promotes fairness. In this paper, we propose Stable Principal Component Analysis (StablePCA), a novel method for group distributionally robust learning of latent representations from high-dimensional multi-source data. A primary challenge in generalizing PCA to the multi-source regime lies in the nonconvexity of the fixed rank constraint, rendering the minimax optimization nonconvex. To address this challenge, we employ the Fantope relaxation, reformulating the problem as a convex minimax optimization, with the objective defined as the maximum loss across sources. To solve the relaxed formulation, we devise an optimistic-gradient Mirror Prox algorithm with explicit closed-form updates. Theoretically, we establish the global convergence of the Mirror Prox algorithm, with the convergence rate provided from the optimization perspective. Furthermore, we offer practical criteria to assess how closely the solution approximates the original nonconvex formulation. Through extensive numerical experiments, we demonstrate StablePCA's high accuracy and efficiency in extracting robust low-dimensional representations across various finite-sample scenarios. △ Less

Submitted 1 May, 2025; originally announced May 2025.

arXiv:2504.15696 [pdf, ps, other]

Remodeling Conjecture with Descendants

Authors: Bohan Fang, Chiu-Chu Melissa Liu, Song Yu, Zhengyu Zong

Abstract: We formulate and prove the Remodeling Conjecture with descendants, which is a version of all-genus equivariant descendant mirror symmetry for semi-projective toric Calabi-Yau 3-orbifolds. We consider the $K$-group of equivariant coherent sheaves on the toric Calabi-Yau 3-orbifold with support bounded in a direction, and prove that it is isomorphic to a certain integral relative first homology grou… ▽ More We formulate and prove the Remodeling Conjecture with descendants, which is a version of all-genus equivariant descendant mirror symmetry for semi-projective toric Calabi-Yau 3-orbifolds. We consider the $K$-group of equivariant coherent sheaves on the toric Calabi-Yau 3-orbifold with support bounded in a direction, and prove that it is isomorphic to a certain integral relative first homology group of the equivariant mirror curve. We establish a correspondence between all-genus equivariant descendant Gromov-Witten invariants with $K$-theoretic framings and oscillatory integrals (Laplace transforms) of the Chekhov-Eynard-Orantin topological recursion invariants along relative 1-cycles on the equivariant mirror curve. Our genus-zero correspondence is an equivariant Hodge-theoretic mirror symmetry with integral structures. In the non-equivariant setting, we prove a conjecture of Hosono which equates central charges of compactly supported coherent sheaves with period integrals of integral 3-cycles on the Hori-Vafa mirror 3-fold. △ Less

Submitted 22 April, 2025; originally announced April 2025.

Comments: 71 pages, 4 figures

MSC Class: 14N35; 14J33

arXiv:2504.14595 [pdf, other]

Critical Ising model, Multiple SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$ and $β$-Jacobi Ensemble

Authors: Mingchang Liu

Abstract: Fix $N\ge 1$ and suppose that $(Ω;x_1,\ldots, x_{N}; x_{N+1}, x_{N+2})$ is a polygon, i.e. $Ω$ is a simply connected domain with locally connected boundary and $x_1,\ldots,x_{N+2}$ are $N+2$ different points located counterclockwisely on $\partialΩ$. Fix $κ\in (0,4)$. In this paper, we will give two different constructions of multiple $N$-SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$ on… ▽ More Fix $N\ge 1$ and suppose that $(Ω;x_1,\ldots, x_{N}; x_{N+1}, x_{N+2})$ is a polygon, i.e. $Ω$ is a simply connected domain with locally connected boundary and $x_1,\ldots,x_{N+2}$ are $N+2$ different points located counterclockwisely on $\partialΩ$. Fix $κ\in (0,4)$. In this paper, we will give two different constructions of multiple $N$-SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$ on $(Ω;x_1,\ldots,x_{N}; x_{N+1},x_{N+2})$ and prove that they give the same law on random curves. Then, by establishing the uniqueness of multiple $N$-SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$, we can obtain the joint law of the hitting points of multiple $N$-SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$ with odd (resp. even) indices on $(x_{N+1}x_{N+2})$. After shrinking $x_1,\ldots,x_N$ to one point, the law of hitting points with odd (resp. even) indices converge to $β$-Jacobi ensemble with the conjectured relation $β=\frac{8}κ$. We will establish a direct connection between SLE-type curves and $β$-Jacobi ensemble. As an application, we consider critical Ising model on a discrete polygon $(Ω^δ_δ;x^δ_1,\ldots,x^δ_{N}; x^δ_{N+1},x^δ_{N+2})$ with alternating boundary $(x^δ_{N+2}x^δ_{N+1})$ and free boundary $(x^δ_{N+1}x^δ_{N+2})$. Motivated by the partition function of multiple $N$-SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$, we derive the scaling limit of the probability of the event that the interface $γ_j^δ$ starting from $x^δ_j$ ends at $(x^δ_{N+1}x^δ_{N+2})$ for all $1\le j\le N$. Moreover, we prove that given this event, the interface $(γ_1^δ,\ldots,γ_N^δ)$ converges to multiple $N$-SLE$_κ\left(\frac{κ-6}{2},\frac{κ-6}{2}\right)$ with $κ=3$. △ Less

Submitted 20 April, 2025; originally announced April 2025.

Comments: 29 pages, 5 figures

arXiv:2504.10549 [pdf, ps, other]

On Outer Pressure Problem of Compressible Navier-Stokes System with Degenerate Heat-Conductivity in Unbounded Domains

Authors: Manyu Liu, Yanfang Peng, Zhilun Peng

Abstract: The compressible Navier-Stokes system with the constant viscosity and the nonlinear heat conductivity which is proportional to a positive power of the temperature and may be degenerate is considered. Under the outer pressure boundary conditions in one-dimensional unbounded spatial domains, the global existence of the strong solutions is obtained after proving that both the specific volume and temp… ▽ More The compressible Navier-Stokes system with the constant viscosity and the nonlinear heat conductivity which is proportional to a positive power of the temperature and may be degenerate is considered. Under the outer pressure boundary conditions in one-dimensional unbounded spatial domains, the global existence of the strong solutions is obtained after proving that both the specific volume and temperature are bounded from below and above independently of time and space. Moreover, the asymptotically stability of global solutions is established as time tends to infinity. △ Less

Submitted 14 April, 2025; originally announced April 2025.

arXiv:2503.20948 [pdf, ps, other]

Global SYZ mirror symmetry and homological mirror symmetry for principally polarized abelian varieties

Authors: Haniya Azam, Catherine Cannizzo, Heather Lee, Chiu-Chu Melissa Liu

Abstract: For any positive integer $g$, we introduce the moduli space $\mathcal{A}^F_g =[\mathcal{H}_g/P_g(\mathbb{Z})]$ parametrizing $g$-dimensional principally polarized abelian varieties $V_τ$ together with a Strominger-Yau-Zalsow (SYZ) fibration, where $τ\in \mathcal{H}_g$ is the genus-$g$ Seigel upper half space and $P_g(\mathbb{Z}) \subset \mathrm{Sp}(2g,\mathbb{Z})$ is the integral Siegel parabolic… ▽ More For any positive integer $g$, we introduce the moduli space $\mathcal{A}^F_g =[\mathcal{H}_g/P_g(\mathbb{Z})]$ parametrizing $g$-dimensional principally polarized abelian varieties $V_τ$ together with a Strominger-Yau-Zalsow (SYZ) fibration, where $τ\in \mathcal{H}_g$ is the genus-$g$ Seigel upper half space and $P_g(\mathbb{Z}) \subset \mathrm{Sp}(2g,\mathbb{Z})$ is the integral Siegel parabolic subgroup. We study global SYZ mirror symmetry over the global moduli $\mathcal{H}_g$ and $\mathcal{A}^F_g$, relating the B-model on $V_τ$ and the A-model on its mirror, a compact $2g$-dimensional torus $\mathbb{T}^{2g}$ equipped with a complexified symplectic form. For each $V_τ$, we establish a homological mirror symmetry (HMS) result at the cohomological level over $\mathbb{C}$. This implies core HMS at the cohomological level over $\mathbb{C}$ and a graded $\mathbb{C}$-algebra isomorphism known as Seidel's mirror map. We study global HMS where Floer cohomology groups $HF^*(\hat{\ell}, \hat{\ell}')$ form coherent sheaves over a complex manifold parametrizing triples $(τ, \hat{\ell}, \hat{\ell}')$ where $τ\in \mathcal{H}_g$ defines a complexified symplectic form $ω_τ$ on $\mathbb{T}^{2g}$ and $\hat{\ell}$, $\hat{\ell} '$ are affine Lagrangian branes in $(\mathbb{T}^{2g}, ω_τ)$. △ Less

Submitted 26 March, 2025; originally announced March 2025.

MSC Class: 53D37 (primary) 14J33 (secondary)

arXiv:2503.12439 [pdf, ps, other]

Finite-time blowup in a fully parabolic chemotaxis model involving indirect signal production

Authors: Xuan Mao, Meng Liu, Yuxiang Li

Abstract: This paper is concerned with a parabolic-parabolic-parabolic chemotaxis system with indirect signal production, modelling the impact of phenotypic heterogeneity on population aggregation \begin{equation*} \begin{cases} u_t = Δu - \nabla\cdot(u\nabla v),\\ v_t = Δv - v + w,\\ w_t = Δw - w + u, \end{cases} \end{equation*} posed on a ball in $\mathbb R^n$ with $n\geq5$, subject to homogeneous Neu… ▽ More This paper is concerned with a parabolic-parabolic-parabolic chemotaxis system with indirect signal production, modelling the impact of phenotypic heterogeneity on population aggregation \begin{equation*} \begin{cases} u_t = Δu - \nabla\cdot(u\nabla v),\\ v_t = Δv - v + w,\\ w_t = Δw - w + u, \end{cases} \end{equation*} posed on a ball in $\mathbb R^n$ with $n\geq5$, subject to homogeneous Neumann boundary conditions. The system has a four-dimensional critical mass phenomenon concerning blowup in finite or infinite time according to the seminal works of Fujie and Senba [J. Differential Equations, 263 (2017), 88--148; 266 (2019), 942--976]. We prove that for any prescribed mass $m > 0$, there exist radially symmetric and nonnegative initial data $(u_0,v_0,w_0)\in C^0(\overlineΩ)\times C^2(\overlineΩ)\times C^2(\overlineΩ)$ with $\int_Ωu_0 = m$ such that the corresponding classical solutions blow up in finite time. The key ingredient is a novel integral inequality for the cross-term integral $\int_Ωuv$ constructed via a Lyapunov functional. △ Less

Submitted 16 March, 2025; originally announced March 2025.

Comments: 20 pages

arXiv:2503.10272 [pdf, ps, other]

Symmetry and classification of positive solutions of some weighted elliptic equations

Authors: Kui Li, Mengyao Liu, Jianfeng Wu

Abstract: We study the weighted elliptic equation \begin{equation} -div(|x|^{-2a}\nabla u)=|x|^{-bp}|u|^{p-2}u~~~\mbox{in}~\mathbb{R}^N ~~~~~~~~~~~~~~~~~~~~(0.1)\end{equation} with $N\geq 2$, which arises from the Caffarelli-Kohn-Nirenberg inequalities. Under the assumptions of finite energy and $a_1+a_2=N-2$, for nonnegative solutions we prove the equivalence between equation (0.1) with $a=a_1$ and equatio… ▽ More We study the weighted elliptic equation \begin{equation} -div(|x|^{-2a}\nabla u)=|x|^{-bp}|u|^{p-2}u~~~\mbox{in}~\mathbb{R}^N ~~~~~~~~~~~~~~~~~~~~(0.1)\end{equation} with $N\geq 2$, which arises from the Caffarelli-Kohn-Nirenberg inequalities. Under the assumptions of finite energy and $a_1+a_2=N-2$, for nonnegative solutions we prove the equivalence between equation (0.1) with $a=a_1$ and equation (0.1) with $a=a_2$. Without finite energy assumptions, for $2\leq p<2^*$ we give the optimal parameter range in which nonnegative solutions of (0.1) in $\mathbf{L}^\infty_{Loc}(\mathbb{R}^N)$ must be radially symmetric, and give a complete classification for these solutions in this range. △ Less

Submitted 13 March, 2025; originally announced March 2025.

arXiv:2503.03908 [pdf, other]

On the Convergence of Adam-Type Algorithm for Bilevel Optimization under Unbounded Smoothness

Authors: Xiaochuan Gong, Jie Hao, Mingrui Liu

Abstract: Adam has become one of the most popular optimizers for training modern deep neural networks, such as transformers. However, its applicability is largely restricted to single-level optimization problems. In this paper, we aim to extend vanilla Adam to tackle bilevel optimization problems, which have important applications in machine learning, such as meta-learning. In particular, we study stochasti… ▽ More Adam has become one of the most popular optimizers for training modern deep neural networks, such as transformers. However, its applicability is largely restricted to single-level optimization problems. In this paper, we aim to extend vanilla Adam to tackle bilevel optimization problems, which have important applications in machine learning, such as meta-learning. In particular, we study stochastic bilevel optimization problems where the lower-level function is strongly convex and the upper-level objective is nonconvex with potentially unbounded smoothness. This unbounded smooth objective function covers a broad class of neural networks, including transformers, which may exhibit non-Lipschitz gradients. In this work, we introduce AdamBO, a single-loop Adam-type method that achieves $\widetilde{O}(ε^{-4})$ oracle complexity to find $ε$-stationary points, where the oracle calls involve stochastic gradient or Hessian/Jacobian-vector product evaluations. The key to our analysis is a novel randomness decoupling lemma that provides refined control over the lower-level variable. We conduct extensive experiments on various machine learning tasks involving bilevel formulations with recurrent neural networks (RNNs) and transformers, demonstrating the effectiveness of our proposed Adam-type algorithm. △ Less

Submitted 5 March, 2025; originally announced March 2025.

Comments: 49 pages, 5 figures

arXiv:2503.03188 [pdf, ps, other]

The Hille-Yosida theorem for $C$-semigroups on a complete random normed module

Authors: Xia Zhang, Leilei Wei, Ming Liu

Abstract: In this paper, we first introduce the notion of the Laplace transform for an abstract-valued function on a complete random normed module $\mathcal{S}$. Then, utilizing the countable concatenation property of the $(\varepsilon, λ)-$topology on $\mathcal{S}$, we prove the differentiability, Post-Widder inversion formula and uniqueness of such a Laplace transform. Second, based on the above work, we… ▽ More In this paper, we first introduce the notion of the Laplace transform for an abstract-valued function on a complete random normed module $\mathcal{S}$. Then, utilizing the countable concatenation property of the $(\varepsilon, λ)-$topology on $\mathcal{S}$, we prove the differentiability, Post-Widder inversion formula and uniqueness of such a Laplace transform. Second, based on the above work, we establish the Hille-Yosida theorem for an exponentially bounded $C$-semigroup on $\mathcal{S}$, considering both the dense and nondense cases of the range of $C$, respectively, which extends and improves several important results. Besides, an example constructed in this paper exhibits that the domain of the generator $A$ of an exponentially bounded $C$-semigroup may not be dense on a nontrivial complete random normed module. Finally, we also apply such a Laplace transform to abstract Cauchy problems in the random setting. △ Less

Submitted 5 March, 2025; originally announced March 2025.

Comments: 27 pages

MSC Class: 46H25; 45R05

arXiv:2503.03096 [pdf, ps, other]

$C$-existence families, $C$-semigroups and their associated abstract Cauchy problems in complete random normed modules

Authors: Xia Zhang, Leilei Wei, Ming Liu

Abstract: In this paper, we first introduce the notion of a (mild) $C$-existence family in complete random normed modules, then we prove that a (mild) $C$-existence family can guarantee the existence of the (mild) solutions of the associated abstract Cauchy problem in the random setting. Second, we investigate several important properties peculiar to locally almost surely bounded $C$-semigroups in complete… ▽ More In this paper, we first introduce the notion of a (mild) $C$-existence family in complete random normed modules, then we prove that a (mild) $C$-existence family can guarantee the existence of the (mild) solutions of the associated abstract Cauchy problem in the random setting. Second, we investigate several important properties peculiar to locally almost surely bounded $C$-semigroups in complete random normed modules, which are not involved in the classical theory of $C$-semigroups. Finally, based on the above work, some relations among $C$-existence families, $C$-semigroups and their associated abstract Cauchy problems in complete random normed modules are established, which extend and improve some known results. Besides, an application to a type of stochastic differential equations is also given. △ Less

Submitted 12 April, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

Comments: 21 pages

MSC Class: 46H25; 45R05; 46N30

arXiv:2502.21211 [pdf, ps, other]

Parabolic presentations of Yangian in types $B$ and $C$

Authors: Zhihua Chang, Naihuan Jing, Ming Liu, Haitao Ma

Abstract: We establish a parabolic presentation of the extended Yangian $\X(\mathfrak{g}_{N})$ associated with the Lie algebras $\mathfrak{g}_{N}$ of type $B$ and $C$, parameterized by a symmetric composition $ν$ of $N$. By formulating a block matrix version of the RTT presentation of $\X(\mathfrak{g}_{N})$, we systematically derive the generators and relations through the Gauss decomposition of the generat… ▽ More We establish a parabolic presentation of the extended Yangian $\X(\mathfrak{g}_{N})$ associated with the Lie algebras $\mathfrak{g}_{N}$ of type $B$ and $C$, parameterized by a symmetric composition $ν$ of $N$. By formulating a block matrix version of the RTT presentation of $\X(\mathfrak{g}_{N})$, we systematically derive the generators and relations through the Gauss decomposition of the generator matrix in $ν$-block form. Furthermore, leveraging this parabolic presentation, we obtain a novel formula for the center of $\X(\mathfrak{g}_{N})$, offering new insights into its structure. △ Less

Submitted 8 March, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

Comments: 50 pp; updated references

MSC Class: Primary: 17B37; secondary: 81R51

arXiv:2501.00853 [pdf, ps, other]

A dual representation theorem on the conditional Orlicz space generated from a random normed module

Authors: Xia Zhang, Ke Qian, Ming Liu

Abstract: In this paper, we first introduce the notion of a random Orlicz function, and further present the conditional Orlicz space generated from a random normed module. Second, we prove the denseness of the Orlicz heart of a random normed module $E$ in $E$ with respect to the $(\varepsilon, λ)$-topology. Finally, based on the above work, we establish a dual representation theorem on the conditional Orlic… ▽ More In this paper, we first introduce the notion of a random Orlicz function, and further present the conditional Orlicz space generated from a random normed module. Second, we prove the denseness of the Orlicz heart of a random normed module $E$ in $E$ with respect to the $(\varepsilon, λ)$-topology. Finally, based on the above work, we establish a dual representation theorem on the conditional Orlicz space generated from a random normed module, which extends and improves some known results. △ Less

Submitted 1 January, 2025; originally announced January 2025.

Comments: 11 pages

MSC Class: 46H25; 46E30; 46B20

arXiv:2412.20017 [pdf, other]

A Nearly Optimal Single Loop Algorithm for Stochastic Bilevel Optimization under Unbounded Smoothness

Authors: Xiaochuan Gong, Jie Hao, Mingrui Liu

Abstract: This paper studies the problem of stochastic bilevel optimization where the upper-level function is nonconvex with potentially unbounded smoothness and the lower-level function is strongly convex. This problem is motivated by meta-learning applied to sequential data, such as text classification using recurrent neural networks, where the smoothness constant of the upper-level loss function scales l… ▽ More This paper studies the problem of stochastic bilevel optimization where the upper-level function is nonconvex with potentially unbounded smoothness and the lower-level function is strongly convex. This problem is motivated by meta-learning applied to sequential data, such as text classification using recurrent neural networks, where the smoothness constant of the upper-level loss function scales linearly with the gradient norm and can be potentially unbounded. Existing algorithm crucially relies on the nested loop design, which requires significant tuning efforts and is not practical. In this paper, we address this issue by proposing a Single Loop bIlevel oPtimizer (SLIP). The proposed algorithm first updates the lower-level variable by a few steps of stochastic gradient descent, and then simultaneously updates the upper-level variable by normalized stochastic gradient descent with momentum and the lower-level variable by stochastic gradient descent. Under standard assumptions, we show that our algorithm finds an $ε$-stationary point within $\widetilde{O}(1/ε^4)$\footnote{Here $\widetilde{O}(\cdot)$ compresses logarithmic factors of $1/ε$ and $1/δ$, where $δ\in(0,1)$ denotes the failure probability.} oracle calls of stochastic gradient or Hessian-vector product, both in expectation and with high probability. This complexity result is nearly optimal up to logarithmic factors without mean-square smoothness of the stochastic gradient oracle. Our proof relies on (i) a refined characterization and control of the lower-level variable and (ii) establishing a novel connection between bilevel optimization and stochastic optimization under distributional drift. Our experiments on various tasks show that our algorithm significantly outperforms strong baselines in bilevel optimization. △ Less

Submitted 27 December, 2024; originally announced December 2024.

Comments: ICML 2024

arXiv:2412.11626 [pdf, ps, other]

Quasi-geodesics in integrable and non-integrable exclusion processes

Authors: Patrik L. Ferrari, Min Liu

Abstract: Backwards geodesics for TASEP were introduced in [Fer18]. We consider flat initial conditions and show that under proper scaling its end-point converges to maximizer argument of the Airy$_2$ process minus a parabola. We generalize its definition to generic non-integrable models including ASEP and speed changed ASEP (call it quasi-geodesics). We numerically verify that its end-point is universal, w… ▽ More Backwards geodesics for TASEP were introduced in [Fer18]. We consider flat initial conditions and show that under proper scaling its end-point converges to maximizer argument of the Airy$_2$ process minus a parabola. We generalize its definition to generic non-integrable models including ASEP and speed changed ASEP (call it quasi-geodesics). We numerically verify that its end-point is universal, where the scaling coefficients are analytically computed through the KPZ scaling theory. △ Less

Submitted 16 December, 2024; originally announced December 2024.

arXiv:2411.19904 [pdf, other]

doi 10.1007/s41980-025-00990-4

Normed modules, integral sequences, and integrals with variable upper limits

Authors: Miantao Liu, Yu-Zhe Liu, Shengda Liu

Abstract: This paper provides a new categorification of the Lebesgue integral with variable upper limits by using normed modules over finite-dimensional $\Bbbk$-algebras $\mathitΛ$ and the category $\mathscr{A}^p_{\mathitΛ}$ associated with $\mathitΛ$. The integration process is redefined through the introduction of an integral partially ordered set and an abstract integral with variable upper limits. Final… ▽ More This paper provides a new categorification of the Lebesgue integral with variable upper limits by using normed modules over finite-dimensional $\Bbbk$-algebras $\mathitΛ$ and the category $\mathscr{A}^p_{\mathitΛ}$ associated with $\mathitΛ$. The integration process is redefined through the introduction of an integral partially ordered set and an abstract integral with variable upper limits. Finally, we present two important applications: (1) the categorification of basic elementary functions, including (anti-)trigonometric and logarithmic functions, and (2) a new approach for characterizing the global dimensions of gentle algebras. △ Less

Submitted 30 April, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

Comments: 28 pages; 2 figures

MSC Class: 16D10; 16G10; 46H25

arXiv:2411.13913 [pdf, ps, other]

Generalizing subdiffusive Black-Scholes model by variable exponent: Model transformation and numerical approximation

Authors: Meihui Zhang, Mengmeng Liu, Wenlin Qiu, Xiangcheng Zheng

Abstract: This work generalizes the subdiffusive Black-Scholes model by introducing the variable exponent in order to provide adequate descriptions for the option pricing, where the variable exponent may account for the variation of the memory property. In addition to standard nonlinear-to-linear transformation, we apply a further spatial-temporal transformation to convert the model to a more tractable form… ▽ More This work generalizes the subdiffusive Black-Scholes model by introducing the variable exponent in order to provide adequate descriptions for the option pricing, where the variable exponent may account for the variation of the memory property. In addition to standard nonlinear-to-linear transformation, we apply a further spatial-temporal transformation to convert the model to a more tractable form in order to circumvent the difficulties caused by the ``non-positive, non-monotonic'' variable-exponent memory kernel. An interesting phenomenon is that the spatial transformation not only eliminates the advection term but naturally turns the original noncoercive spatial operator into a coercive one due to the specific structure of the Black-Scholes model, which thus avoids imposing constraints on coefficients. Then we perform numerical analysis for both the semi-discrete and fully discrete schemes to support numerical simulation. Numerical experiments are carried out to substantiate the theoretical results. △ Less

Submitted 21 November, 2024; originally announced November 2024.

arXiv:2411.10151 [pdf, other]

Single-Frequency Self-Alignment RF Resonant Beam for Information and Power Transfer

Authors: Qingwei Jiang, Mingqing Liu, Mengyuan Xu, Wen Fang, Mingliang Xiong, Qingwen Liu, Shengli Zhou

Abstract: Due to power attenuation, improving transmission efficiency in the radio-frequency (RF) band remains a significant challenge, which hinders advancements in various fields of the Internet of Things (IoT), such as wireless power transfer (WPT) and wireless communication. Array design and retro-directive beamforming (RD-BF) techniques offer simple and effective ways to enhance transmission efficiency… ▽ More Due to power attenuation, improving transmission efficiency in the radio-frequency (RF) band remains a significant challenge, which hinders advancements in various fields of the Internet of Things (IoT), such as wireless power transfer (WPT) and wireless communication. Array design and retro-directive beamforming (RD-BF) techniques offer simple and effective ways to enhance transmission efficiency. However, when the target is an array or in the near field, the RD-BF system (RD-BFS) cannot radiate more energy to the target due to phase irregularities in the target region, resulting in challenges in achieving higher efficiency. To address this issue, we propose the RF-based resonant beam system (RF-RBS), which adaptively optimizes phase and power distribution between transmitting and receiving arrays by leveraging the resonance mechanism to achieve higher transmission efficiency. We analyze the system structure and develop an analytical model to evaluate power flow and resonance establishment. Numerical analysis demonstrates that the proposed RF-RBS achieves self-alignment without beam control and provides higher transmission efficiency compared to RD-BFS, with improvements of up to 16%. This self-alignment capability allows the system to effectively transfer power and information across varying distances and offsets. The numerical results indicate the capability to transmit watt-level power and achieve 21 bps/Hz of downlink spectral efficiency in indoor settings, highlighting the advantages of RF-RBS in information and power transfer for mobile applications. △ Less

Submitted 24 January, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

arXiv:2411.07908 [pdf, ps, other]

Asymptotically sharp bounds for cancellative and union-free hypergraphs

Authors: Miao Liu, Chong Shangguan, Chenyang Zhang

Abstract: An $r$-graph is called $t$-cancellative if for arbitrary $t+2$ distinct edges $A_1,\ldots,A_t,B,C$, it holds that $(\cup_{i=1}^t A_i)\cup B\neq (\cup_{i=1}^t A_i)\cup C$; it is called $t$-union-free if for arbitrary two distinct subsets $\mathcal{A},\mathcal{B}$, each consisting of at most $t$ edges, it holds that $\cup_{A\in\mathcal{A}} A\neq \cup_{B\in\mathcal{B}} B$. Let $C_t(n,r)$ and… ▽ More An $r$-graph is called $t$-cancellative if for arbitrary $t+2$ distinct edges $A_1,\ldots,A_t,B,C$, it holds that $(\cup_{i=1}^t A_i)\cup B\neq (\cup_{i=1}^t A_i)\cup C$; it is called $t$-union-free if for arbitrary two distinct subsets $\mathcal{A},\mathcal{B}$, each consisting of at most $t$ edges, it holds that $\cup_{A\in\mathcal{A}} A\neq \cup_{B\in\mathcal{B}} B$. Let $C_t(n,r)$ and $U_t(n,r)$ denote the maximum number of edges that can be contained in an $n$-vertex $t$-cancellative and $t$-union-free $r$-graph, respectively. The study of $C_t(n,r)$ and $U_t(n,r)$ has a long history, dating back to the classic works of Erdős and Katona, and Erdős and Moser in the 1970s. In 2020, Shangguan and Tamo showed that $C_{2(t-1)}(n,tk)=Θ(n^k)$ and $U_{t+1}(n,tk)=Θ(n^k)$ for all $t\ge 2$ and $k\ge 2$. In this paper, we determine the asymptotics of these two functions up to a lower order term, by showing that for all $t\ge 2$ and $k\ge 2$, \begin{align*} \text{$\lim_{n\rightarrow\infty}\frac{C_{2(t-1)}(n,tk)}{n^k}=\lim_{n\rightarrow\infty}\frac{U_{t+1}(n,tk)}{n^k}=\frac{1}{k!}\cdot \frac{1}{\binom{tk-1}{k-1}}$.} \end{align*} Previously, it was only known by a result of Füredi in 2012 that $\lim_{n\rightarrow\infty}\frac{C_{2}(n,4)}{n^2}=\frac{1}{6}$. To prove the lower bounds of the limits, we utilize a powerful framework developed recently by Delcourt and Postle, and independently by Glock, Joos, Kim, Kühn, and Lichev, which shows the existence of near-optimal hypergraph packings avoiding certain small configurations, and to prove the upper bounds, we apply a novel counting argument that connects $C_{2(t-1)}(n,tk)$ to a classic result of Kleitman and Frankl on a special case of the famous Erdős Matching Conjecture. △ Less

Submitted 12 November, 2024; originally announced November 2024.

Comments: 21 pages

arXiv:2410.16506 [pdf, other]

ReLU neural network approximation to piecewise constant functions

Authors: Zhiqiang Cai, Junpyo Choi, Min Liu

Abstract: This paper studies the approximation property of ReLU neural networks (NNs) to piecewise constant functions with unknown interfaces in bounded regions in $\mathbb{R}^d$. Under the assumption that the discontinuity interface $Γ$ may be approximated by a connected series of hyperplanes with a prescribed accuracy $\varepsilon >0$, we show that a three-layer ReLU NN is sufficient to accurately approxi… ▽ More This paper studies the approximation property of ReLU neural networks (NNs) to piecewise constant functions with unknown interfaces in bounded regions in $\mathbb{R}^d$. Under the assumption that the discontinuity interface $Γ$ may be approximated by a connected series of hyperplanes with a prescribed accuracy $\varepsilon >0$, we show that a three-layer ReLU NN is sufficient to accurately approximate any piecewise constant function and establish its error bound. Moreover, if the discontinuity interface is convex, an analytical formula of the ReLU NN approximation with exact weights and biases is provided. △ Less

Submitted 21 October, 2024; originally announced October 2024.

Comments: 17 pages, 8 figures, submitted to the journal

MSC Class: 68T07; 41A25; 41A46

arXiv:2410.13617 [pdf, ps, other]

Weyl group symmetries of the toric variety associated with Weyl chambers

Authors: Tao Gui, Hongsheng Hu, Minhua Liu

Abstract: For any crystallographic root system, let $W$ be the associated Weyl group, and let $\mathit{WP}$ be the weight polytope (also known as the $W$-permutohedron) associated with an arbitrary strongly dominant weight. The action of $W$ on $\mathit{WP}$ induces an action on the toric variety $X(\mathit{WP})$ associated with the normal fan of $\mathit{WP}$, and hence an action on the rational cohomology… ▽ More For any crystallographic root system, let $W$ be the associated Weyl group, and let $\mathit{WP}$ be the weight polytope (also known as the $W$-permutohedron) associated with an arbitrary strongly dominant weight. The action of $W$ on $\mathit{WP}$ induces an action on the toric variety $X(\mathit{WP})$ associated with the normal fan of $\mathit{WP}$, and hence an action on the rational cohomology ring $H^*\left(X(\mathit{WP})\right)$. Let $P$ be the corresponding dominant weight polytope, which is a fundamental region of the $W$-action on $\mathit{WP}$. We give a type uniform algebraic proof that the fixed subring $H^*\left(X(\mathit{WP})\right)^{W}$ is isomorphic to the cohomology ring $H^*\left(X(P)\right)$ of the toric variety $X(P)$ associated with the normal fan of $P$. Notably, our proof applies to all finite (not necessarily crystallographic) Coxeter groups, answering a question of Horiguchi--Masuda--Shareshian--Song about non-crystallographic root systems. △ Less

Submitted 17 October, 2024; originally announced October 2024.

Comments: 14 pages, comments are welcome!

MSC Class: Primary 13A50; Secondary 14M25; 17B22; 52B05

arXiv:2410.08506 [pdf, ps, other]

doi 10.1016/j.geomphys.2025.105535

Spectral forms and de-Rham Hodge operator

Authors: Jian Wang, Yong Wang, Mingyu Liu

Abstract: Motivated by the trilinear functional of differential one-forms, spectral triple and spectral torsion for the Hodge-Dirac operator, we introduce a multilinear functional of differential one-forms for a finitely summable regular spectral triple with a noncommutative residue, which generalize the spectral torsion defined by Dabrowski-Sitarz-Zalecki. The main results of this paper recover two forms,… ▽ More Motivated by the trilinear functional of differential one-forms, spectral triple and spectral torsion for the Hodge-Dirac operator, we introduce a multilinear functional of differential one-forms for a finitely summable regular spectral triple with a noncommutative residue, which generalize the spectral torsion defined by Dabrowski-Sitarz-Zalecki. The main results of this paper recover two forms, torsion of the linear connection and four forms by the noncommutative residue and perturbed de-Rham Hodge operators, and provides an explicit computation of generalized spectral torsion associated with the perturbed de-Rham Hodge Dirac triple. △ Less

Submitted 14 October, 2024; v1 submitted 11 October, 2024; originally announced October 2024.

Comments: arXiv admin note: text overlap with arXiv:2408.07149

Journal ref: Journal of Geometry and Physics .Volume 214, August 2025, 105535

arXiv:2410.06105 [pdf, other]

Passive inverse obstacle scattering problems for the Helmholtz equation

Authors: Thorsten Hohage, Meng Liu

Abstract: Passive imaging involves recording waves generated by uncontrolled, random sources and utilizing correlations of such waves to image the medium through which they propagate. In this paper, we focus on passive inverse obstacle scattering problems governed by the Helmholtz equation in $\mathbb{R}^d\;(d=2,3)$. The random source is modelled by a Gaussian random process. Uniqueness results are establis… ▽ More Passive imaging involves recording waves generated by uncontrolled, random sources and utilizing correlations of such waves to image the medium through which they propagate. In this paper, we focus on passive inverse obstacle scattering problems governed by the Helmholtz equation in $\mathbb{R}^d\;(d=2,3)$. The random source is modelled by a Gaussian random process. Uniqueness results are established for the inverse problems to determine the source strength or shape and location of an obstacle, or both of them simultaneously from near-field correlation measurements. Finally, we present efficient methods for numerical reconstructions. △ Less

Submitted 8 October, 2024; originally announced October 2024.

arXiv:2409.19212 [pdf, other]

An Accelerated Algorithm for Stochastic Bilevel Optimization under Unbounded Smoothness

Authors: Xiaochuan Gong, Jie Hao, Mingrui Liu

Abstract: This paper investigates a class of stochastic bilevel optimization problems where the upper-level function is nonconvex with potentially unbounded smoothness and the lower-level problem is strongly convex. These problems have significant applications in sequential data learning, such as text classification using recurrent neural networks. The unbounded smoothness is characterized by the smoothness… ▽ More This paper investigates a class of stochastic bilevel optimization problems where the upper-level function is nonconvex with potentially unbounded smoothness and the lower-level problem is strongly convex. These problems have significant applications in sequential data learning, such as text classification using recurrent neural networks. The unbounded smoothness is characterized by the smoothness constant of the upper-level function scaling linearly with the gradient norm, lacking a uniform upper bound. Existing state-of-the-art algorithms require $\widetilde{O}(1/ε^4)$ oracle calls of stochastic gradient or Hessian/Jacobian-vector product to find an $ε$-stationary point. However, it remains unclear if we can further improve the convergence rate when the assumptions for the function in the population level also hold for each random realization almost surely. To address this issue, we propose a new Accelerated Bilevel Optimization algorithm named AccBO. The algorithm updates the upper-level variable by normalized stochastic gradient descent with recursive momentum and the lower-level variable by the stochastic Nesterov accelerated gradient descent algorithm with averaging. We prove that our algorithm achieves an oracle complexity of $\widetilde{O}(1/ε^3)$ to find an $ε$-stationary point, when the lower-level stochastic gradient's variance is $O(ε)$. Our proof relies on a novel lemma characterizing the dynamics of stochastic Nesterov accelerated gradient descent algorithm under distribution drift with high probability for the lower-level variable, which is of independent interest and also plays a crucial role in analyzing the hypergradient estimation error over time. Experimental results on various tasks confirm that our proposed algorithm achieves the predicted theoretical acceleration and significantly outperforms baselines in bilevel optimization. △ Less

Submitted 15 January, 2025; v1 submitted 27 September, 2024; originally announced September 2024.

Comments: Accepted by NeurIPS 2024. The code is available at https://github.com/MingruiLiu-ML-Lab/Accelerated-Bilevel-Optimization-Unbounded-Smoothness

arXiv:2409.16836 [pdf, ps, other]

Strong holomorphic Morse inequalities on non-compact complex manifolds with optimal fundamental estimate

Authors: Manli Liu, Guokuan Shao, Wenxuan Wang

Abstract: In this paper, we establish strong holomorphic Morse inequalities on non-compact manifolds under the condition of optimal fundamental estimates. We show that optimal fundamental estimates are satisfied and then strong holomorphic Morse inequalities hold true in various settings. In this paper, we establish strong holomorphic Morse inequalities on non-compact manifolds under the condition of optimal fundamental estimates. We show that optimal fundamental estimates are satisfied and then strong holomorphic Morse inequalities hold true in various settings. △ Less

Submitted 25 September, 2024; v1 submitted 25 September, 2024; originally announced September 2024.

MSC Class: Primary: 32A25; 53C55; 32W05; 32L10

arXiv:2409.16683 [pdf, other]

Robust Max Statistics for High-Dimensional Inference

Authors: Mingshuo Liu, Miles E. Lopes

Abstract: Although much progress has been made in the theory and application of bootstrap approximations for max statistics in high dimensions, the literature has largely been restricted to cases involving light-tailed data. To address this issue, we propose an approach to inference based on robust max statistics, and we show that their distributions can be accurately approximated via bootstrapping when the… ▽ More Although much progress has been made in the theory and application of bootstrap approximations for max statistics in high dimensions, the literature has largely been restricted to cases involving light-tailed data. To address this issue, we propose an approach to inference based on robust max statistics, and we show that their distributions can be accurately approximated via bootstrapping when the data are both high-dimensional and heavy-tailed. In particular, the data are assumed to satisfy an extended version of the well-established $L^{4}$-$L^2$ moment equivalence condition, as well as a weak variance decay condition. In this setting, we show that near-parametric rates of bootstrap approximation can be achieved in the Kolmogorov metric, independently of the data dimension. Moreover, this theoretical result is complemented by favorable empirical results involving both synthetic data and an application to financial data. △ Less

Submitted 25 September, 2024; originally announced September 2024.

arXiv:2409.14499 [pdf, other]

A Review of Scalable and Privacy-Preserving Multi-Agent Frameworks for Distributed Energy Resources

Authors: Xiang Huo, Hao Huang, Katherine R. Davis, H. Vincent Poor, Mingxi Liu

Abstract: Distributed energy resources (DERs) are gaining prominence due to their advantages in improving energy efficiency, reducing carbon emissions, and enhancing grid resilience. Despite the increasing deployment, the potential of DERs has yet to be fully explored and exploited. A fundamental question restrains the management of numerous DERs in large-scale power systems, "How should DER data be securel… ▽ More Distributed energy resources (DERs) are gaining prominence due to their advantages in improving energy efficiency, reducing carbon emissions, and enhancing grid resilience. Despite the increasing deployment, the potential of DERs has yet to be fully explored and exploited. A fundamental question restrains the management of numerous DERs in large-scale power systems, "How should DER data be securely processed and DER operations be efficiently optimized?" To address this question, this paper considers two critical issues, namely privacy for processing DER data and scalability in optimizing DER operations, then surveys existing and emerging solutions from a multi-agent framework perspective. In the context of scalability, this paper reviews state-of-the-art research that relies on parallel control, optimization, and learning within distributed and/or decentralized information exchange structures, while in the context of privacy, it identifies privacy preservation measures that can be synthesized into the aforementioned scalable structures. Despite research advances in these areas, challenges remain because these highly interdisciplinary studies blend a wide variety of scalable computing architectures and privacy preservation techniques from different fields, making them difficult to adapt in practice. To mitigate this issue, this paper provides a holistic review of trending strategies that orchestrate privacy and scalability for large-scale power system operations from a multi-agent perspective, particularly for DER control problems. Furthermore, this review extrapolates new approaches for future scalable, privacy-aware, and cybersecure pathways to unlock the full potential of DERs through controlling, optimizing, and learning generic multi-agent-based cyber-physical systems. △ Less

Submitted 11 November, 2024; v1 submitted 22 September, 2024; originally announced September 2024.

arXiv:2409.10091 [pdf, ps, other]

Multidimensional analogues of the refined versions of Bohr inequalities involving Schwarz mappings

Authors: Shanshan Jia, Ming-Sheng Liu, Saminathan Ponnusamy

Abstract: Our first aim of this article is to establish several new versions of refined Bohr inequalities for bounded analytic functions in the unit disk involving Schwarz functions. Secondly, %as applications of these results, we obtain several new multidimensional analogues of the refined Bohr inequalities for bounded holomorphic mappings on the unit ball in a complex Banach space involving higher dimensi… ▽ More Our first aim of this article is to establish several new versions of refined Bohr inequalities for bounded analytic functions in the unit disk involving Schwarz functions. Secondly, %as applications of these results, we obtain several new multidimensional analogues of the refined Bohr inequalities for bounded holomorphic mappings on the unit ball in a complex Banach space involving higher dimensional Schwarz mappings. All the results are proved to be sharp. △ Less

Submitted 16 September, 2024; originally announced September 2024.

Comments: 25 pages; It is with a journal

MSC Class: Primary: 30A10; 30C45; 30C62; Secondary: 30C75

arXiv:2409.09624 [pdf, ps, other]

Advancements in Log-P-Analytic Functions: Landau-Type Theorems and Their Refinements

Authors: Hanghang Zhao, Ming-Sheng Liu, Kit Ian Kou

Abstract: This work begins by introducing the groundbreaking concept of log-p-analytic functions. Following this introduction, we proceed to delineate four distinct formulations of Landau-type theorems, specifically crafted for the domain of poly-analytic functions. Among these, two theorems are distinguished by their exactitude, and a third theorem offers a refinement to the existing work of Abdulhadi and… ▽ More This work begins by introducing the groundbreaking concept of log-p-analytic functions. Following this introduction, we proceed to delineate four distinct formulations of Landau-type theorems, specifically crafted for the domain of poly-analytic functions. Among these, two theorems are distinguished by their exactitude, and a third theorem offers a refinement to the existing work of Abdulhadi and Hajj. Concluding the paper, we present four specialized versions of Landau-type theorems applicable to a subset of bounded log-p-analytic functions, resulting in the derivation of two precise outcomes. △ Less

Submitted 15 September, 2024; originally announced September 2024.

Comments: 18 pages, this article is with a journal since June 2024

arXiv:2409.07745 [pdf, other]

Generalized Independence Test for Modern Data

Authors: Mingshuo Liu, Doudou Zhou, Hao Chen

Abstract: The test of independence is a crucial component of modern data analysis. However, traditional methods often struggle with the complex dependency structures found in high-dimensional data. To overcome this challenge, we introduce a novel test statistic that captures intricate relationships using similarity and dissimilarity information derived from the data. The statistic exhibits strong power acro… ▽ More The test of independence is a crucial component of modern data analysis. However, traditional methods often struggle with the complex dependency structures found in high-dimensional data. To overcome this challenge, we introduce a novel test statistic that captures intricate relationships using similarity and dissimilarity information derived from the data. The statistic exhibits strong power across a broad range of alternatives for high-dimensional data, as demonstrated in extensive simulation studies. Under mild conditions, we show that the new test statistic converges to the $χ^2_4$ distribution under the permutation null distribution, ensuring straightforward type I error control. Furthermore, our research advances the moment method in proving the joint asymptotic normality of multiple double-indexed permutation statistics. We showcase the practical utility of this new test with an application to the Genotype-Tissue Expression dataset, where it effectively measures associations between human tissues. △ Less

Submitted 12 September, 2024; originally announced September 2024.

arXiv:2409.06279 [pdf, ps, other]

The Radon-Nikod$\acute{Y}$m property of $\mathbb{L}$-Banach spaces and the dual representation theorem of $\mathbb{L}$-Bochner function spaces

Authors: Xia Zhang, Xiangle Yan, Ming Liu

Abstract: In this paper, we first introduce $\mathbb{L}$-$μ$-measurable functions and $\mathbb{L}$-Bochner integrable functions on a finite measure space $(S,\mathcal{F},μ),$ and give an $\mathbb{L}$-valued analogue of the canonical $L^{p}(Ω,\mathcal{F},μ).$ Then we investigate the completeness of such an $\mathbb{L}$-valued analogue and propose the Radon-Nikod$\acute{y}$m property of $\mathbb{L}$-Banach sp… ▽ More In this paper, we first introduce $\mathbb{L}$-$μ$-measurable functions and $\mathbb{L}$-Bochner integrable functions on a finite measure space $(S,\mathcal{F},μ),$ and give an $\mathbb{L}$-valued analogue of the canonical $L^{p}(Ω,\mathcal{F},μ).$ Then we investigate the completeness of such an $\mathbb{L}$-valued analogue and propose the Radon-Nikod$\acute{y}$m property of $\mathbb{L}$-Banach spaces. Meanwhile, an example constructed in this paper shows that there do exist an $\mathbb{L}$-Banach space which fails to possess the Radon-Nikod$\acute{y}$m property. Finally, based on above work, we establish the dual representation theorem of $\mathbb{L}$-Bochner integrable function spaces, which extends and improves the corresponding classical result. △ Less

Submitted 10 September, 2024; originally announced September 2024.

MSC Class: Primary 46B22; 46B10; Secondary 46E30

arXiv:2409.06175 [pdf, other]

Involution matrix loci and orbit harmonics

Authors: Moxuan J. Liu, Yichen Ma, Brendon Rhoades, Hai Zhu

Abstract: Let $\mathrm{Mat}_{n \times n}(\mathbb{C})$ be the affine space of $n \times n$ complex matrices with coordinate ring $\mathbb{C}[\mathbf{x}_{n \times n}]$. We define graded quotients of $\mathbb{C}[\mathbf{x}_{n \times n}]$ which carry an action of the symmetric group $\mathfrak{S}_n$ by simultaneous permutation of rows and columns. These quotient rings are obtained by applying the orbit harmonic… ▽ More Let $\mathrm{Mat}_{n \times n}(\mathbb{C})$ be the affine space of $n \times n$ complex matrices with coordinate ring $\mathbb{C}[\mathbf{x}_{n \times n}]$. We define graded quotients of $\mathbb{C}[\mathbf{x}_{n \times n}]$ which carry an action of the symmetric group $\mathfrak{S}_n$ by simultaneous permutation of rows and columns. These quotient rings are obtained by applying the orbit harmonics method to matrix loci corresponding to all involutions in $\mathfrak{S}_n$ and the conjugacy classes of involutions in $\mathfrak{S}_n$ with a given number of fixed points. In the case of perfect matchings on $\{1, \dots, n\}$ with $n$ even, the Hilbert series of our quotient ring is related to Tracy-Widom distributions and its graded Frobenius image gives a refinement of the plethysm $s_{n/2}[s_2]$. △ Less

Submitted 9 September, 2024; originally announced September 2024.

Comments: 33 pages

arXiv:2408.09855 [pdf, ps, other]

doi 10.1007/s00220-025-05273-x

The $q$-immanants and higher quantum Capelli identities

Authors: Naihuan Jing, Ming Liu, Alexander Molev

Abstract: We construct polynomials ${\mathbb{S}}_μ(z)$ parameterized by Young diagrams $μ$, whose coefficients are central elements of the quantized enveloping algebra ${\rm U}_q({\mathfrak{gl}}_n)$. Their constant terms coincide with the central elements provided by the general construction of Drinfeld and Reshetikhin. For another special value of $z$, we get $q$-analogues of Okounkov's quantum immanants f… ▽ More We construct polynomials ${\mathbb{S}}_μ(z)$ parameterized by Young diagrams $μ$, whose coefficients are central elements of the quantized enveloping algebra ${\rm U}_q({\mathfrak{gl}}_n)$. Their constant terms coincide with the central elements provided by the general construction of Drinfeld and Reshetikhin. For another special value of $z$, we get $q$-analogues of Okounkov's quantum immanants for ${\mathfrak{gl}}_n$. We show that the Harish-Chandra image of ${\mathbb{S}}_μ(z)$ is a factorial Schur polynomial. We also prove quantum analogues of the higher Capelli identities and derive Newton-type identities. △ Less

Submitted 30 September, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

Comments: 19 pages, more detailed proofs are given, references extended

Journal ref: Commun. Math. Phys. 406 (2025) paper 99

arXiv:2408.05259 [pdf, ps, other]

Unicity problem on meromorphic mappings of complete Kahler manifolds

Authors: Xianjing Dong, Mengyue Liu

Abstract: Nevanlinna's unicity theorems have always held an important position in value distribution theory. The main purpose of this paper is to generalize the classical Nevanlinna's unicity theorems to non-compact complete Kahler manifolds with nonpositive sectional curvature or nonnegative Ricci curvature. Nevanlinna's unicity theorems have always held an important position in value distribution theory. The main purpose of this paper is to generalize the classical Nevanlinna's unicity theorems to non-compact complete Kahler manifolds with nonpositive sectional curvature or nonnegative Ricci curvature. △ Less

Submitted 9 August, 2024; originally announced August 2024.

Comments: 14 pages. arXiv admin note: substantial text overlap with arXiv:2308.16520, arXiv:2301.01295

MSC Class: 32H30; 30D35

arXiv:2408.04469 [pdf, other]

Achieving Robust Data-driven Contextual Decision Making in a Data Augmentation Way

Authors: Zhaoen Li, Maoqi Liu, Zhi-Hai Zhang

Abstract: This paper focuses on the contextual optimization problem where a decision is subject to some uncertain parameters and covariates that have some predictive power on those parameters are available before the decision is made. More specifically, we focus on solving the Wasserstein-distance-based distributionally robust optimization (DRO) model for the problem, which maximizes the worst-case expected… ▽ More This paper focuses on the contextual optimization problem where a decision is subject to some uncertain parameters and covariates that have some predictive power on those parameters are available before the decision is made. More specifically, we focus on solving the Wasserstein-distance-based distributionally robust optimization (DRO) model for the problem, which maximizes the worst-case expected objective over an uncertainty set including all distributions closed enough to a nominal distribution with respect to the Wasserstein distance. We develop a stochastic gradient descent algorithm based on the idea of data augmentation to solve the model efficiently. The algorithm iteratively a) does a bootstrapping sample from the nominal distribution; b) perturbs the adversarially and c) updates decisions. Accordingly, the computational time of the algorithm is only determined by the number of iterations and the complexity of computing the gradient of a single sample. Except for efficiently solving the model, the algorithm provide additional advantages that the proposed algorithm can cope with any nominal distributions and therefore is extendable to solve the problem in an online setting. We also prove that the algorithm converges to the optimal solution of the DRO model at a rate of a $O(1/\sqrt{T})$, where $T$ is the number of iterations of bootstrapping. Consequently, the performance guarantee of the algorithm is that of the DRO model plus $O(1/\sqrt{T})$. Through extensive numerical experiments, we demonstrate the superior performance of the proposed algorithm to several benchmarks. △ Less

Submitted 9 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

arXiv:2407.05564 [pdf, ps, other]

A Re-solving Heuristic for Dynamic Assortment Optimization with Knapsack Constraints

Authors: Xi Chen, Mo Liu, Yining Wang, Yuan Zhou

Abstract: In this paper, we consider a multi-stage dynamic assortment optimization problem with multi-nomial choice modeling (MNL) under resource knapsack constraints. Given the current resource inventory levels, the retailer makes an assortment decision at each period, and the goal of the retailer is to maximize the total profit from purchases. With the exact optimal dynamic assortment solution being compu… ▽ More In this paper, we consider a multi-stage dynamic assortment optimization problem with multi-nomial choice modeling (MNL) under resource knapsack constraints. Given the current resource inventory levels, the retailer makes an assortment decision at each period, and the goal of the retailer is to maximize the total profit from purchases. With the exact optimal dynamic assortment solution being computationally intractable, a practical strategy is to adopt the re-solving technique that periodically re-optimizes deterministic linear programs (LP) arising from fluid approximation. However, the fractional structure of MNL makes the fluid approximation in assortment optimization highly non-linear, which brings new technical challenges. To address this challenge, we propose a new epoch-based re-solving algorithm that effectively transforms the denominator of the objective into the constraint. Theoretically, we prove that the regret (i.e., the gap between the resolving policy and the optimal objective of the fluid approximation) scales logarithmically with the length of time horizon and resource capacities. △ Less

Submitted 7 July, 2024; originally announced July 2024.

arXiv:2407.03141 [pdf, other]

Optimal Unimodular Matching

Authors: Nathanaël Enriquez, Mike Liu, Laurent Ménard, Vianney Perchet

Abstract: We consider sequences of finite weighted random graphs that converge locally to unimodular i.i.d. weighted random trees. When the weights are atomless, we prove that the matchings of maximal weight converge locally to a matching on the limiting tree. For this purpose, we introduce and study unimodular matchings on weighted unimodular random trees as well as a notion of optimality for these objects… ▽ More We consider sequences of finite weighted random graphs that converge locally to unimodular i.i.d. weighted random trees. When the weights are atomless, we prove that the matchings of maximal weight converge locally to a matching on the limiting tree. For this purpose, we introduce and study unimodular matchings on weighted unimodular random trees as well as a notion of optimality for these objects. In this context, we prove that, in law, there is a unique optimal unimodular matching for a given unimodular tree. We then prove that this law is the local limit of the sequence of matchings of maximal weight. Along the way, we also show that this law is characterised by an equation derived from a message passing algorithm. △ Less

Submitted 28 March, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

Comments: 58 pages, 19 figures. Improved overall presentation of the paper and added references

MSC Class: 05C70; 05C82; 60C05; 60K35

arXiv:2406.06522 [pdf, other]

Multiple SLEs for $κ\in (0,8)$: Coulomb gas integrals and pure partition functions

Authors: Yu Feng, Mingchang Liu, Eveliina Peltola, Hao Wu

Abstract: In this article, we give an explicit relationship of SLE partition functions with Coulomb gas formalism of conformal field theory. We first construct a family of SLE${}_κ$ partition functions as Coulomb gas integrals and derive their various properties. In accordance with an interpretation as probabilistic correlations in loop $O(n)$ models, they are always positive when $κ\in (8/3,8)$, while they… ▽ More In this article, we give an explicit relationship of SLE partition functions with Coulomb gas formalism of conformal field theory. We first construct a family of SLE${}_κ$ partition functions as Coulomb gas integrals and derive their various properties. In accordance with an interpretation as probabilistic correlations in loop $O(n)$ models, they are always positive when $κ\in (8/3,8)$, while they may have zeroes for $κ\leq 8/3$. They also admit a Fröbenius series expansion that matches with the algebraic content from CFT. Moreover, we check that at the first level of fusion, they have logarithmic asymptotic behavior when $κ=8/3$ and $κ=8$, in accordance with logarithmic minimal models $M(2,1)$ and $M(2,3)$, respectively. Second, we construct SLE${}_κ$ pure partition functions and show that they are continuous in $κ\in (0,8)$ and they decay to zero as a polynomial of $(8-κ)$ when $κ\to 8$. We explicitly relate the Coulomb gas integrals and pure partition functions together in terms of the meander matrix. As a by-product, our results yield a construction of global non-simple multiple chordal SLE${}_κ$ measures ($κ\in (4,8)$) uniquely determined by their re-sampling property. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 104 pages

arXiv:2404.16794 [pdf, other]

Structure-Preserving Oscillation-Eliminating Discontinuous Galerkin Schemes for Ideal MHD Equations: Locally Divergence-Free and Positivity-Preserving

Authors: Mengqing Liu, Kailiang Wu

Abstract: Numerically simulating magnetohydrodynamics (MHD) poses notable challenges, including the suppression of spurious oscillations near discontinuities (e.g., shocks) and preservation of essential physical structures (e.g., the divergence-free constraint of magnetic field and the positivity of density and pressure). This paper develops structure-preserving oscillation-eliminating discontinuous Galerki… ▽ More Numerically simulating magnetohydrodynamics (MHD) poses notable challenges, including the suppression of spurious oscillations near discontinuities (e.g., shocks) and preservation of essential physical structures (e.g., the divergence-free constraint of magnetic field and the positivity of density and pressure). This paper develops structure-preserving oscillation-eliminating discontinuous Galerkin (OEDG) schemes for ideal MHD. The schemes leverage a locally divergence-free (LDF) oscillation-eliminating (OE) procedure to suppress spurious oscillations while retaining the LDF property of magnetic field and many desirable attributes of original DG schemes, such as conservation, local compactness, and optimal convergence rates. The OE procedure is based on the solution operator of a novel damping equation, a linear system of ordinary differential equations that are exactly solvable without any discretization. The OE procedure is performed after each Runge-Kutta stage and does not impact DG spatial discretization, facilitating its easy integration into existing DG codes as an independent module. Moreover, this paper presents a rigorous positivity-preserving (PP) analysis of the LDF OEDG schemes on Cartesian meshes, utilizing the optimal convex decomposition technique and the geometric quasi-linearization (GQL) approach. Efficient PP LDF OEDG schemes are derived by incorporating appropriate discretization of Godunov-Powell source terms into only the discrete equations of cell averages, under a condition achievable through a simple PP limiter. Several one- and two-dimensional MHD tests verify the accuracy, effectiveness, and robustness of the proposed structure-preserving OEDG schemes. △ Less

Submitted 2 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

Comments: 55 pages

arXiv:2404.16324 [pdf, other]

Improved impedance inversion by the iterated graph Laplacian

Authors: Davide Bianchi, Florian Bossmann, Wenlong Wang, Mingming Liu

Abstract: We introduce a data-adaptive inversion method that integrates classical or deep learning-based approaches with iterative graph Laplacian regularization, specifically targeting acoustic impedance inversion - a critical task in seismic exploration. Our method initiates from an impedance estimate derived using either traditional inversion techniques or neural network-based methods. This initial estim… ▽ More We introduce a data-adaptive inversion method that integrates classical or deep learning-based approaches with iterative graph Laplacian regularization, specifically targeting acoustic impedance inversion - a critical task in seismic exploration. Our method initiates from an impedance estimate derived using either traditional inversion techniques or neural network-based methods. This initial estimate guides the construction of a graph Laplacian operator, effectively capturing structural characteristics of the impedance profile. Utilizing a Tikhonov-inspired variational framework with this graph-informed prior, our approach iteratively updates and refines the impedance estimate while continuously recalibrating the graph Laplacian. This iterative refinement shows rapid convergence, increased accuracy, and enhanced robustness to noise compared to initial reconstructions alone. Extensive validation performed on synthetic and real seismic datasets across varying noise levels confirms the effectiveness of our method. Performance evaluations include four initial inversion methods: two classical techniques and two neural networks - previously established in the literature. △ Less

Submitted 15 April, 2025; v1 submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16284 [pdf, ps, other]

$L^p$-regularity of a geometrically nonlinear system in supercritical dimensions

Authors: Chang-Yu Guo, Chang-Lin Xiang, Ming-Lun Liu

Abstract: In a recent work, Gastel and Neff introduced an interesting system from a geometrically nonlinear flat cosserat micropolar model and established interior regularity in the critical dimension. Inspired by their work on this flat Cosserat model, in this article, we establish both interior regularity and sharp $L^p$ regularity for their system in supercritical dimensions. In a recent work, Gastel and Neff introduced an interesting system from a geometrically nonlinear flat cosserat micropolar model and established interior regularity in the critical dimension. Inspired by their work on this flat Cosserat model, in this article, we establish both interior regularity and sharp $L^p$ regularity for their system in supercritical dimensions. △ Less

Submitted 13 October, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

Comments: Sci. China Math., to appear 2024

MSC Class: 35B65; 35J47; 35G50

arXiv:2404.09372 [pdf, other]

Word-length curve counting on the once-punctured torus

Authors: David Fisac, Mingkun Liu

Abstract: We classify closed curves on a once-punctured torus with a single self-intersection from a combinatorial perspective. We determine the number of closed curves with given word-length and with zero, one, and arbitrary self-intersections. We classify closed curves on a once-punctured torus with a single self-intersection from a combinatorial perspective. We determine the number of closed curves with given word-length and with zero, one, and arbitrary self-intersections. △ Less

Submitted 23 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

Comments: 24 pages, 7 figures

MSC Class: Primary: 57K20. Secondary: 05A05; 68R15

arXiv:2404.07666 [pdf, ps, other]

On the elliptic harmonic mappings and sense-preserving harmonic mappings

Authors: Ming-Sheng Liu, Hao XU

Abstract: In this paper, we first establish two versions of Landau-Bloch type theorem for $(K,K')$-elliptic harmonic mappings with a bounded minimum distortion. Next, we provide several coefficient estimates and a conjecture for $(K,K')$-elliptic harmonic mappings. Then, we establish three new versions of Landau-Bloch type theorem for sense-preserving harmonic mappings. Finally, we establish two sharp versi… ▽ More In this paper, we first establish two versions of Landau-Bloch type theorem for $(K,K')$-elliptic harmonic mappings with a bounded minimum distortion. Next, we provide several coefficient estimates and a conjecture for $(K,K')$-elliptic harmonic mappings. Then, we establish three new versions of Landau-Bloch type theorem for sense-preserving harmonic mappings. Finally, we establish two sharp versions of Landau-Bloch type theorem for certain harmonic mappings. These results are sharp in some given cases and improve the related results of different authors. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 17 pages, this article is with a journal since Oct. 2023

MSC Class: Primary 30C50; 31A05; Secondary 32A18; 30C62; 33E05

arXiv:2404.05064 [pdf, other]

A Structure-Guided Gauss-Newton Method for Shallow ReLU Neural Network

Authors: Zhiqiang Cai, Tong Ding, Min Liu, Xinyu Liu, Jianlin Xia

Abstract: In this paper, we propose a structure-guided Gauss-Newton (SgGN) method for solving least squares problems using a shallow ReLU neural network. The method effectively takes advantage of both the least squares structure and the neural network structure of the objective function. By categorizing the weights and biases of the hidden and output layers of the network as nonlinear and linear parameters,… ▽ More In this paper, we propose a structure-guided Gauss-Newton (SgGN) method for solving least squares problems using a shallow ReLU neural network. The method effectively takes advantage of both the least squares structure and the neural network structure of the objective function. By categorizing the weights and biases of the hidden and output layers of the network as nonlinear and linear parameters, respectively, the method iterates back and forth between the nonlinear and linear parameters. The nonlinear parameters are updated by a damped Gauss-Newton method and the linear ones are updated by a linear solver. Moreover, at the Gauss-Newton step, a special form of the Gauss-Newton matrix is derived for the shallow ReLU neural network and is used for efficient iterations. It is shown that the corresponding mass and Gauss-Newton matrices in the respective linear and nonlinear steps are symmetric and positive definite under reasonable assumptions. Thus, the SgGN method naturally produces an effective search direction without the need of additional techniques like shifting in the Levenberg-Marquardt method to achieve invertibility of the Gauss-Newton matrix. The convergence and accuracy of the method are demonstrated numerically for several challenging function approximation problems, especially those with discontinuities or sharp transition layers that pose significant challenges for commonly used training algorithms in machine learning. △ Less

Submitted 7 April, 2024; originally announced April 2024.

MSC Class: 65D15; 65K10

Showing 1–50 of 321 results for author: Liu, M