Skip to main content

Showing 1–50 of 75 results for author: Nguyen, H L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10201  [pdf, other

    cs.DS cs.CR cs.IT cs.LG

    Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages

    Authors: Hilal Asi, Vitaly Feldman, Jelani Nelson, Huy L. Nguyen, Kunal Talwar, Samson Zhou

    Abstract: We study the problem of private vector mean estimation in the shuffle model of privacy where $n$ users each have a unit vector $v^{(i)} \in\mathbb{R}^d$. We propose a new multi-message protocol that achieves the optimal error using $\tilde{\mathcal{O}}\left(\min(n\varepsilon^2,d)\right)$ messages per user. Moreover, we show that any (unbiased) protocol that achieves optimal error requires each use… ▽ More

    Submitted 25 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Fixed author ordering

  2. arXiv:2312.07535  [pdf, other

    cs.DS cs.LG

    Improved Frequency Estimation Algorithms with and without Predictions

    Authors: Anders Aamand, Justin Y. Chen, Huy Lê Nguyen, Sandeep Silwal, Ali Vakilian

    Abstract: Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketch… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  3. arXiv:2309.12668  [pdf, other

    cs.RO

    UWA360CAM: A 360$^{\circ}$ 24/7 Real-Time Streaming Camera System for Underwater Applications

    Authors: Quan-Dung Pham, Yipeng Zhu, Tan-Sang Ha, K. H. Long Nguyen, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Omnidirectional camera is a cost-effective and information-rich sensor highly suitable for many marine applications and the ocean scientific community, encompassing several domains such as augmented reality, mapping, motion estimation, visual surveillance, and simultaneous localization and mapping. However, designing and constructing such a high-quality 360$^{\circ}$ real-time streaming camera sys… ▽ More

    Submitted 30 September, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  4. arXiv:2307.09069  [pdf, other

    cs.CR

    Mitigating Intersection Attacks in Anonymous Microblogging

    Authors: Sarah Abdelwahab Gaballah, Thanh Hoang Long Nguyen, Lamya Abdullah, Ephraim Zimmer, Max Mühlhäuser

    Abstract: Anonymous microblogging systems are known to be vulnerable to intersection attacks due to network churn. An adversary that monitors all communications can leverage the churn to learn who is publishing what with increasing confidence over time. In this paper, we propose a protocol for mitigating intersection attacks in anonymous microblogging systems by grouping users into anonymity sets based on s… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  5. arXiv:2306.07298  [pdf, other

    cs.HC cs.AI

    Referring to Screen Texts with Voice Assistants

    Authors: Shruti Bhargava, Anand Dhoot, Ing-Marie Jonsson, Hoang Long Nguyen, Alkesh Patel, Hong Yu, Vincent Renkens

    Abstract: Voice assistants help users make phone calls, send messages, create events, navigate, and do a lot more. However, assistants have limited capacity to understand their users' context. In this work, we aim to take a step in this direction. Our work dives into a new experience for users to refer to phone numbers, addresses, email addresses, URLs, and dates on their phone screens. Our focus lies in re… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: 7 pages, Accepted to ACL Industry Track 2023

  6. arXiv:2306.04444  [pdf, other

    cs.LG cs.CR stat.ML

    Fast Optimal Locally Private Mean Estimation via Random Projections

    Authors: Hilal Asi, Vitaly Feldman, Jelani Nelson, Huy L. Nguyen, Kunal Talwar

    Abstract: We study the problem of locally private mean estimation of high-dimensional vectors in the Euclidean ball. Existing algorithms for this problem either incur sub-optimal error or have high communication and/or run-time complexity. We propose a new algorithmic framework, ProjUnit, for private mean estimation that yields algorithms that are computationally efficient, have low communication complexity… ▽ More

    Submitted 26 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Added the correct github link

  7. arXiv:2305.16013  [pdf, other

    cs.DS cs.LG

    Online and Streaming Algorithms for Constrained $k$-Submodular Maximization

    Authors: Fabian Spaeh, Alina Ene, Huy L. Nguyen

    Abstract: Constrained $k$-submodular maximization is a general framework that captures many discrete optimization problems such as ad allocation, influence maximization, personalized recommendation, and many others. In many of these applications, datasets are large or decisions need to be made in an online manner, which motivates the development of efficient streaming and online algorithms. In this work, we… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  8. arXiv:2303.14582  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Identification of Negative Transfers in Multitask Learning Using Surrogate Models

    Authors: Dongyue Li, Huy L. Nguyen, Hongyang R. Zhang

    Abstract: Multitask learning is widely used in practice to train a low-resource target task by augmenting it with multiple related source tasks. Yet, naively combining all the source tasks with a target task does not always improve the prediction performance for the target task due to negative transfers. Thus, a critical problem in multitask learning is identifying subsets of source tasks that would benefit… ▽ More

    Submitted 27 December, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

    Comments: 30 pages. Appeared in TMLR'23

  9. arXiv:2303.01453  [pdf, other

    cs.DS cs.LG

    Improved Space Bounds for Learning with Experts

    Authors: Anders Aamand, Justin Y. Chen, Huy Lê Nguyen, Sandeep Silwal

    Abstract: We give improved tradeoffs between space and regret for the online learning with expert advice problem over $T$ days with $n$ experts. Given a space budget of $n^δ$ for $δ\in (0,1)$, we provide an algorithm achieving regret $\tilde{O}(n^2 T^{1/(1+δ)})$, improving upon the regret bound $\tilde{O}(n^2 T^{2/(2+δ)})$ in the recent work of [PZ23]. The improvement is particularly salient in the regime… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  10. arXiv:2302.14843  [pdf, ps, other

    math.OC cs.DS cs.LG

    High Probability Convergence of Stochastic Gradient Methods

    Authors: Zijian Liu, Ta Duy Nguyen, Thien Hang Nguyen, Alina Ene, Huy Lê Nguyen

    Abstract: In this work, we describe a generic approach to show convergence with high probability for both stochastic convex and non-convex optimization with sub-Gaussian noise. In previous works for convex optimization, either the convergence is only in expectation or the bound depends on the diameter of the domain. Instead, we show high probability convergence with bounds depending on the initial distance… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: This paper subsumes arXiv paper arxiv:2210.00679

  11. arXiv:2210.17028  [pdf, other

    cs.LG

    Improved Learning-augmented Algorithms for k-means and k-medians Clustering

    Authors: Thy Nguyen, Anamay Chaturvedi, Huy Lê Nguyen

    Abstract: We consider the problem of clustering in the learning-augmented setting, where we are given a data set in $d$-dimensional Euclidean space, and a label for each data point given by an oracle indicating what subsets of points should be clustered together. This setting captures situations where we have access to some auxiliary information about the data set relevant for our clustering objective, for… ▽ More

    Submitted 1 March, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

  12. arXiv:2210.14315  [pdf, ps, other

    cs.LG cs.CR cs.DS stat.ML

    Streaming Submodular Maximization with Differential Privacy

    Authors: Anamay Chaturvedi, Huy Lê Nguyen, Thy Nguyen

    Abstract: In this work, we study the problem of privately maximizing a submodular function in the streaming setting. Extensive work has been done on privately maximizing submodular functions in the general case when the function depends upon the private data of individuals. However, when the size of the data stream drawn from the domain of the objective function is large or arrives very fast, one must priva… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  13. arXiv:2210.00679  [pdf, ps, other

    math.OC cs.DS cs.LG

    High Probability Convergence for Accelerated Stochastic Mirror Descent

    Authors: Alina Ene, Huy L. Nguyen

    Abstract: In this work, we describe a generic approach to show convergence with high probability for stochastic convex optimization. In previous works, either the convergence is only in expectation or the bound depends on the diameter of the domain. Instead, we show high probability convergence with bounds depending on the initial distance to the optimal solution as opposed to the domain diameter. The algor… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  14. arXiv:2209.14853  [pdf, other

    cs.LG cs.DS

    META-STORM: Generalized Fully-Adaptive Variance Reduced SGD for Unbounded Functions

    Authors: Zijian Liu, Ta Duy Nguyen, Thien Hang Nguyen, Alina Ene, Huy L. Nguyen

    Abstract: We study the application of variance reduction (VR) techniques to general non-convex stochastic optimization problems. In this setting, the recent work STORM [Cutkosky-Orabona '19] overcomes the drawback of having to compute gradients of "mega-batches" that earlier VR methods rely on. There, STORM utilizes recursive momentum to achieve the VR effect and is then later made fully adaptive in STORM+… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  15. arXiv:2209.14827  [pdf, other

    cs.LG cs.DS

    On the Convergence of AdaGrad(Norm) on $\R^{d}$: Beyond Convexity, Non-Asymptotic Rate and Acceleration

    Authors: Zijian Liu, Ta Duy Nguyen, Alina Ene, Huy L. Nguyen

    Abstract: Existing analysis of AdaGrad and other adaptive methods for smooth convex optimization is typically for functions with bounded domain diameter. In unconstrained problems, previous works guarantee an asymptotic convergence rate without an explicit constant factor that holds true for the entire function class. Furthermore, in the stochastic setting, only a modified version of AdaGrad, different from… ▽ More

    Submitted 4 October, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Updated manuscript from ICLR 2023 with fixed typos

  16. arXiv:2209.11817  [pdf, other

    cs.LG cs.DS

    An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low Regret

    Authors: Matthew Jones, Huy Lê Nguyen, Thy Nguyen

    Abstract: Recently a multi-agent variant of the classical multi-armed bandit was proposed to tackle fairness issues in online learning. Inspired by a long line of work in social choice and economics, the goal is to optimize the Nash social welfare instead of the total utility. Unfortunately previous algorithms either are not efficient or achieve sub-optimal regret in terms of the number of rounds $T$. We pr… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  17. arXiv:2207.11337  [pdf, other

    cs.DS

    Fair Range k-center

    Authors: Huy Lê Nguyen, Thy Nguyen, Matthew Jones

    Abstract: We study the problem of fairness in k-centers clustering on data with disjoint demographic groups. Specifically, this work proposes a variant of fairness which restricts each group's number of centers with both a lower bound (minority-protection) and an upper bound (restricted-domination), and provides both an offline and one-pass streaming algorithm for the problem. In the special case where the… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  18. arXiv:2204.09583  [pdf, other

    cs.LG

    Improved Group Robustness via Classifier Retraining on Independent Splits

    Authors: Thien Hang Nguyen, Hongyang R. Zhang, Huy Le Nguyen

    Abstract: Deep neural networks trained by minimizing the average risk can achieve strong average performance. Still, their performance for a subgroup may degrade if the subgroup is underrepresented in the overall data population. Group distributionally robust optimization (Sagawa et al., 2020a), or group DRO in short, is a widely used baseline for learning models with strong worst-group performance. We note… ▽ More

    Submitted 28 July, 2023; v1 submitted 20 April, 2022; originally announced April 2022.

  19. arXiv:2203.00194  [pdf, other

    cs.CR cs.DS cs.LG

    Private Frequency Estimation via Projective Geometry

    Authors: Vitaly Feldman, Jelani Nelson, Huy Lê Nguyen, Kunal Talwar

    Abstract: In this work, we propose a new algorithm ProjectiveGeometryResponse (PGR) for locally differentially private (LDP) frequency estimation. For a universe size of $k$ and with $n$ users, our $\varepsilon$-LDP algorithm has communication cost $\lceil\log_2k\rceil$ bits in the private coin setting and $\varepsilon\log_2 e + O(1)$ in the public coin setting, and has computation cost… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  20. BeDivFuzz: Integrating Behavioral Diversity into Generator-based Fuzzing

    Authors: Hoang Lam Nguyen, Lars Grunske

    Abstract: A popular metric to evaluate the performance of fuzzers is branch coverage. However, we argue that focusing solely on covering many different branches (i.e., the richness) is not sufficient since the majority of the covered branches may have been exercised only once, which does not inspire a high confidence in the reliability of the covered code. Instead, the distribution of the executed branches… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: To appear in the proceedings of the 44th International Conference on Software Engineering (ICSE 2022)

  21. arXiv:2201.12302  [pdf, other

    math.OC cs.DS cs.LG

    Adaptive Accelerated (Extra-)Gradient Methods with Variance Reduction

    Authors: Zijian Liu, Ta Duy Nguyen, Alina Ene, Huy L. Nguyen

    Abstract: In this paper, we study the finite-sum convex optimization problem focusing on the general convex case. Recently, the study of variance reduced (VR) methods and their accelerated variants has made exciting progress. However, the step size used in the existing VR algorithms typically depends on the smoothness parameter, which is often unknown and requires tuning in practice. To address this problem… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  22. arXiv:2108.01208  [pdf, other

    cs.CL cs.SD eess.AS

    User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems

    Authors: Hoang Long Nguyen, Vincent Renkens, Joris Pelemans, Srividya Pranavi Potharaju, Anil Kumar Nalamalapu, Murat Akbacak

    Abstract: Recognition errors are common in human communication. Similar errors often lead to unwanted behaviour in dialogue systems or virtual assistants. In human communication, we can recover from them by repeating misrecognized words or phrases; however in human-machine communication this recovery mechanism is not available. In this paper, we attempt to bridge this gap and present a system that allows a… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Will be published in Interspeech 2021

  23. arXiv:2106.09170  [pdf, other

    cs.LG

    A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams

    Authors: Heitor Murilo Gomes, Maciej Grzenda, Rodrigo Mello, Jesse Read, Minh Huong Le Nguyen, Albert Bifet

    Abstract: Unlabelled data appear in many domains and are particularly relevant to streaming applications, where even though data is abundant, labelled data is rare. To address the learning problems associated with such data, one can ignore the unlabelled data and focus only on the labelled data (supervised learning); use the labelled data and attempt to leverage the unlabelled data (semi-supervised learning… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  24. arXiv:2105.15007  [pdf, ps, other

    cs.DS cs.CR cs.LG

    Locally Private $k$-Means Clustering with Constant Multiplicative Approximation and Near-Optimal Additive Error

    Authors: Anamay Chaturvedi, Matthew Jones, Huy L. Nguyen

    Abstract: Given a data set of size $n$ in $d'$-dimensional Euclidean space, the $k$-means problem asks for a set of $k$ points (called centers) so that the sum of the $\ell_2^2$-distances between points of a given data set of size $n$ and the set of $k$ centers is minimized. Recent work on this problem in the locally private setting achieves constant multiplicative approximation with additive error… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: 61 pages

  25. arXiv:2103.12564  [pdf, other

    cs.NE cs.AI cs.LG

    Linear Constraints Learning for Spiking Neurons

    Authors: Huy Le Nguyen, Dominique Chu

    Abstract: We introduce a new supervised learning algorithm based to train spiking neural networks for classification. The algorithm overcomes a limitation of existing multi-spike learning methods: it solves the problem of interference between interacting output spikes during a learning trial. This problem of learning interference causes learning performance in existing approaches to decrease as the number o… ▽ More

    Submitted 11 August, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: 35 pages, 11 figures

  26. arXiv:2103.02420  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Multi-view Audio and Music Classification

    Authors: Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Lam Pham, Philipp Koch, Ian McLoughlin, Alfred Mertins

    Abstract: We propose in this work a multi-view learning approach for audio and music classification. Considering four typical low-level representations (i.e. different views) commonly used for audio and music recognition tasks, the proposed multi-view network consists of four subnetworks, each handling one input types. The learned embedding in the subnetworks are then concatenated to form the multi-view emb… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted to ICASSP 2021

  27. arXiv:2102.07684   

    cs.DS cs.LG

    Fair and Optimal Cohort Selection for Linear Utilities

    Authors: Konstantina Bairaktari, Huy Le Nguyen, Jonathan Ullman

    Abstract: The rise of algorithmic decision-making has created an explosion of research around the fairness of those algorithms. While there are many compelling notions of individual fairness, beginning with the work of Dwork et al., these notions typically do not satisfy desirable composition properties. To this end, Dwork and Ilvento introduced the fair cohort selection problem, which captures a specific a… ▽ More

    Submitted 5 October, 2022; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: This paper has been subsumed by the arXiv paper arXiv:2009.02207

  28. arXiv:2012.12138  [pdf, ps, other

    cs.LG cs.CR cs.DS math.OC

    Projection-Free Bandit Optimization with Privacy Guarantees

    Authors: Alina Ene, Huy L. Nguyen, Adrian Vladu

    Abstract: We design differentially private algorithms for the bandit convex optimization problem in the projection-free setting. This setting is important whenever the decision set has a complex geometry, and access to it is done efficiently only through a linear optimization oracle, hence Euclidean projections are unavailable (e.g. matroid polytope, submodular base polytope). This is the first differential… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: Appears in AAAI-21

  29. arXiv:2012.07664  [pdf, ps, other

    cs.LG cs.AI cs.NE

    Constraints on Hebbian and STDP learned weights of a spiking neuron

    Authors: Dominique Chu, Huy Le Nguyen

    Abstract: We analyse mathematically the constraints on weights resulting from Hebbian and STDP learning rules applied to a spiking neuron with weight normalisation. In the case of pure Hebbian learning, we find that the normalised weights equal the promotion probabilities of weights up to correction terms that depend on the learning rate and are usually small. A similar relation can be derived for STDP algo… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  30. arXiv:2011.03847  [pdf, other

    cs.LG cs.CY cs.SI

    Google Trends Analysis of COVID-19

    Authors: Hoang Long Nguyen, Zhenhe Pan, Hashim Abu-gellban, Fang Jin, Yuanlin Zhang

    Abstract: The World Health Organization (WHO) announced that COVID-19 was a pandemic disease on the 11th of March as there were 118K cases in several countries and territories. Numerous researchers worked on forecasting the number of confirmed cases since anticipating the growth of the cases helps governments adopting knotty decisions to ease the lockdowns orders for their countries. These orders help sever… ▽ More

    Submitted 7 November, 2020; originally announced November 2020.

  31. arXiv:2010.16153  [pdf, other

    cs.HC

    Time-position characterization of conflicts: a case study of collaborative editing

    Authors: Hoai Le Nguyen, Claudia-Lavinia Ignat

    Abstract: Collaborative editing (CE) became increasingly common, often compulsory in academia and industry where people work in teams and are distributed across space and time. We aim to study collabora-tive editing behavior in terms of collaboration patterns users adopt and in terms of a characterisation of conflicts, i.e. edits from different users that occur close in time and position in the document. Th… ▽ More

    Submitted 30 October, 2020; originally announced October 2020.

    Journal ref: The 26th International Conference on Collaboration Technologies and Social Computing (CollabTech 2020), Sep 2020, Tartu, Estonia

  32. arXiv:2010.09132  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Self-Attention Generative Adversarial Network for Speech Enhancement

    Authors: Huy Phan, Huy Le Nguyen, Oliver Y. Chén, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins

    Abstract: Existing generative adversarial networks (GANs) for speech enhancement solely rely on the convolution operation, which may obscure temporal dependencies across the sequence input. To remedy this issue, we propose a self-attention layer adapted from non-local attention, coupled with the convolutional and deconvolutional layers of a speech enhancement GAN (SEGAN) using raw signal input. Further, we… ▽ More

    Submitted 6 February, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: 46th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2021). Source code is available at http://github.com/pquochuy/sasegan

  33. arXiv:2010.07799  [pdf, other

    cs.LG cs.DS

    Adaptive and Universal Algorithms for Variational Inequalities with Optimal Convergence

    Authors: Alina Ene, Huy L. Nguyen

    Abstract: We develop new adaptive algorithms for variational inequalities with monotone operators, which capture many problems of interest, notably convex optimization and convex-concave saddle point problems. Our algorithms automatically adapt to unknown problem parameters such as the smoothness and the norm of the operator, and the variance of the stochastic evaluation oracle. We show that our algorithms… ▽ More

    Submitted 26 August, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

  34. arXiv:2009.13317  [pdf, ps, other

    cs.DS cs.LG

    A note on differentially private clustering with large additive error

    Authors: Huy L. Nguyen

    Abstract: In this note, we describe a simple approach to obtain a differentially private algorithm for k-clustering with nearly the same multiplicative factor as any non-private counterpart at the cost of a large polynomial additive error. The approach is the combination of a simple geometric observation independent of privacy consideration and any existing private algorithm with a constant approximation.

    Submitted 28 September, 2020; originally announced September 2020.

  35. arXiv:2009.02207  [pdf, ps, other

    cs.DS cs.LG

    Fair and Useful Cohort Selection

    Authors: Konstantina Bairaktari, Paul Langton, Huy L. Nguyen, Niklas Smedemark-Margulies, Jonathan Ullman

    Abstract: A challenge in fair algorithm design is that, while there are compelling notions of individual fairness, these notions typically do not satisfy desirable composition properties, and downstream applications based on fair classifiers might not preserve fairness. To study fairness under composition, Dwork and Ilvento introduced an archetypal problem called fair-cohort-selection problem, where a singl… ▽ More

    Submitted 6 April, 2022; v1 submitted 4 September, 2020; originally announced September 2020.

    Comments: This is a merger of the previous version and arXiv:2102.07684

  36. arXiv:2008.12388  [pdf, ps, other

    cs.DS cs.LG

    Differentially Private Clustering via Maximum Coverage

    Authors: Matthew Jones, Huy Lê Nguyen, Thy Nguyen

    Abstract: This paper studies the problem of clustering in metric spaces while preserving the privacy of individual data. Specifically, we examine differentially private variants of the k-medians and Euclidean k-means problems. We present polynomial algorithms with constant multiplicative error and lower additive error than the previous state-of-the-art for each problem. Additionally, our algorithms use a cl… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

  37. arXiv:2007.08840  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    Adaptive Gradient Methods for Constrained Convex Optimization and Variational Inequalities

    Authors: Alina Ene, Huy L. Nguyen, Adrian Vladu

    Abstract: We provide new adaptive first-order methods for constrained convex optimization. Our main algorithms AdaACSA and AdaAGD+ are accelerated methods, which are universal in the sense that they achieve nearly-optimal convergence rates for both smooth and non-smooth functions, even when they only have access to stochastic gradients. In addition, they do not require any prior knowledge on how the objecti… ▽ More

    Submitted 15 February, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: Full version of AAAI-21 paper. The current version adds an experimental evaluation and revises the exposition

  38. arXiv:1911.12959  [pdf, ps, other

    cs.DS

    Optimal Streaming Algorithms for Submodular Maximization with Cardinality Constraints

    Authors: Naor Alaluf, Alina Ene, Moran Feldman, Huy L. Nguyen, Andrew Suh

    Abstract: We study the problem of maximizing a non-monotone submodular function subject to a cardinality constraint in the streaming model. Our main contribution is a single-pass (semi-)streaming algorithm that uses roughly $O(k / \varepsilon^2)$ memory, where $k$ is the size constraint. At the end of the stream, our algorithm post-processes its data structure using any offline algorithm for submodular maxi… ▽ More

    Submitted 10 August, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: This paper is a merger of arXiv:1906.11237 and arXiv:1911.12959

  39. arXiv:1905.13272  [pdf, other

    cs.DS cs.LG

    Parallel Algorithm for Non-Monotone DR-Submodular Maximization

    Authors: Alina Ene, Huy L. Nguyen

    Abstract: In this work, we give a new parallel algorithm for the problem of maximizing a non-monotone diminishing returns submodular function subject to a cardinality constraint. For any desired accuracy $ε$, our algorithm achieves a $1/e - ε$ approximation using $O(\log{n} \log(1/ε) / ε^3)$ parallel rounds of function evaluations. The approximation guarantee nearly matches the best approximation guarantee… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  40. arXiv:1904.04129  [pdf, ps, other

    cs.DS

    A note on Cunningham's algorithm for matroid intersection

    Authors: Huy L. Nguyen

    Abstract: In the matroid intersection problem, we are given two matroids of rank $r$ on a common ground set $E$ of $n$ elements and the goal is to find the maximum set that is independent in both matroids. In this note, we show that Cunningham's algorithm for matroid intersection can be implemented to use $O(nr\log^2(r))$ independent oracle calls.

    Submitted 8 April, 2019; originally announced April 2019.

  41. arXiv:1902.09009  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Private Algorithms for Learning Large-Margin Halfspaces

    Authors: Huy L. Nguyen, Jonathan Ullman, Lydia Zakynthinou

    Abstract: We present new differentially private algorithms for learning a large-margin halfspace. In contrast to previous algorithms, which are based on either differentially private simulations of the statistical query model or on private convex optimization, the sample complexity of our algorithms depends only on the margin of the data, and not on the dimension. We complement our results with a lower boun… ▽ More

    Submitted 23 February, 2020; v1 submitted 24 February, 2019; originally announced February 2019.

    Comments: changed title, added references and remarks

  42. arXiv:1812.01591  [pdf, ps, other

    cs.DS cs.LG

    A Parallel Double Greedy Algorithm for Submodular Maximization

    Authors: Alina Ene, Huy L. Nguyen, Adrian Vladu

    Abstract: We study parallel algorithms for the problem of maximizing a non-negative submodular function. Our main result is an algorithm that achieves a nearly-optimal $1/2 -ε$ approximation using $O(\log(1/ε) / ε)$ parallel rounds of function evaluations. Our algorithm is based on a continuous variant of the double greedy algorithm of Buchbinder et al. that achieves the optimal $1/2$ approximation in the s… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  43. arXiv:1811.07464  [pdf, other

    cs.DS

    Towards Nearly-linear Time Algorithms for Submodular Maximization with a Matroid Constraint

    Authors: Alina Ene, Huy L. Nguyen

    Abstract: We consider fast algorithms for monotone submodular maximization subject to a matroid constraint. We assume that the matroid is given as input in an explicit form, and the goal is to obtain the best possible running times for important matroids. We develop a new algorithm for a \emph{general matroid constraint} with a $1 - 1/e - ε$ approximation that achieves a fast running time provided we have a… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: There is text overlap with an earlier version arXiv:1709.09767v2. That version has been replaced by a paper with only the result for a knapsack constraint, and this paper has the results for matroid constraints

  44. arXiv:1808.09987  [pdf, ps, other

    cs.DS

    Submodular Maximization with Matroid and Packing Constraints in Parallel

    Authors: Alina Ene, Huy L. Nguyen, Adrian Vladu

    Abstract: We consider the problem of maximizing the multilinear extension of a submodular function subject a single matroid constraint or multiple packing constraints with a small number of adaptive rounds of evaluation queries. We obtain the first algorithms with low adaptivity for submodular maximization with a matroid constraint. Our algorithms achieve a $1-1/e-ε$ approximation for monotone functions a… ▽ More

    Submitted 8 November, 2018; v1 submitted 29 August, 2018; originally announced August 2018.

  45. arXiv:1805.08356  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Improved Algorithms for Collaborative PAC Learning

    Authors: Huy L. Nguyen, Lydia Zakynthinou

    Abstract: We study a recent model of collaborative PAC learning where $k$ players with $k$ different tasks collaborate to learn a single classifier that works for all tasks. Previous work showed that when there is a classifier that has very small error on all tasks, there is a collaborative algorithm that finds a single classifier for all tasks and has $O((\ln (k))^2)$ times the worst-case sample complexity… ▽ More

    Submitted 30 October, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

  46. arXiv:1804.06705  [pdf, other

    cs.CL

    Alquist: The Alexa Prize Socialbot

    Authors: Jan Pichl, Petr Marek, Jakub Konrád, Martin Matulík, Hoang Long Nguyen, Jan Šedivý

    Abstract: This paper describes a new open domain dialogue system Alquist developed as part of the Alexa Prize competition for the Amazon Echo line of products. The Alquist dialogue system is designed to conduct a coherent and engaging conversation on popular topics. We are presenting a hybrid system combining several machine learning and rule based approaches. We discuss and describe the Alquist pipeline, d… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

  47. arXiv:1804.05379  [pdf, ps, other

    cs.DS

    Submodular Maximization with Nearly-optimal Approximation and Adaptivity in Nearly-linear Time

    Authors: Alina Ene, Huy L. Nguyen

    Abstract: In this paper, we study the tradeoff between the approximation guarantee and adaptivity for the problem of maximizing a monotone submodular function subject to a cardinality constraint. The adaptivity of an algorithm is the number of sequential rounds of queries it makes to the evaluation oracle of the function, where in every round the algorithm is allowed to make polynomially-many parallel queri… ▽ More

    Submitted 30 October, 2018; v1 submitted 15 April, 2018; originally announced April 2018.

  48. Shadow Symbolic Execution with Java PathFinder

    Authors: Yannic Noller, Hoang Lam Nguyen, Minxing Tang, Timo Kehrer

    Abstract: Regression testing ensures that a software system when it evolves still performs correctly and that the changes introduce no unintended side-effects. However, the creation of regression test cases that show divergent behavior needs a lot of effort. A solution is the idea of shadow symbolic execution, originally implemented based on KLEE for programs written in C, which takes a unifed version of th… ▽ More

    Submitted 5 February, 2018; originally announced February 2018.

    Comments: 5 pages, Java PathFinder Workshop 2017, ACM SIGSOFT Software Engineering Notes

    Journal ref: SIGSOFT Softw. Eng. Notes 42, 4 (January 2018), 1-5

  49. arXiv:1709.09767  [pdf, ps, other

    cs.DS

    A Nearly-linear Time Algorithm for Submodular Maximization with a Knapsack Constraint

    Authors: Alina Ene, Huy L. Nguyen

    Abstract: We consider the problem of maximizing a monotone submodular function subject to a knapsack constraint. Our main contribution is an algorithm that achieves a nearly-optimal, $1 - 1/e - ε$ approximation, using $(1/ε)^{O(1/ε^4)} n \log^2{n}$ function evaluations and arithmetic operations. Our algorithm is impractical but theoretically interesting, since it overcomes a fundamental running time bottlen… ▽ More

    Submitted 18 November, 2018; v1 submitted 27 September, 2017; originally announced September 2017.

    Comments: The matroid results included in v2 are now part of a separate arxiv paper

  50. arXiv:1703.01830  [pdf, ps, other

    cs.LG cs.DS

    Decomposable Submodular Function Minimization: Discrete and Continuous

    Authors: Alina Ene, Huy L. Nguyen, László A. Végh

    Abstract: This paper investigates connections between discrete and continuous approaches for decomposable submodular function minimization. We provide improved running time estimates for the state-of-the-art continuous algorithms for the problem using combinatorial arguments. We also provide a systematic experimental comparison of the two types of methods, based on a clear distinction between level-0 and le… ▽ More

    Submitted 6 March, 2017; originally announced March 2017.