Skip to main content

Showing 1–50 of 114 results for author: Nelson, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20612  [pdf, ps, other

    cs.CV cs.CL cs.LG

    Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models

    Authors: Peter Robicheaux, Matvei Popov, Anish Madan, Isaac Robinson, Joseph Nelson, Deva Ramanan, Neehar Peri

    Abstract: Vision-language models (VLMs) trained on internet-scale data achieve remarkable zero-shot detection performance on common objects like car, truck, and pedestrian. However, state-of-the-art models still struggle to generalize to out-of-distribution classes, tasks and imaging modalities not typically found in their pre-training. Rather than simply re-training VLMs on more visual data, we argue that… ▽ More

    Submitted 16 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: The first two authors contributed equally. Project Page: https://rf100-vl.org/

  2. My CXL Pool Obviates Your PCIe Switch

    Authors: Yuhong Zhong, Daniel S. Berger, Pantea Zardoshti, Enrique Saurez, Jacob Nelson, Antonis Psistakis, Joshua Fried, Asaf Cidon

    Abstract: Pooling PCIe devices across multiple hosts offers a promising solution to mitigate stranded I/O resources, enhance device utilization, address device failures, and reduce total cost of ownership. The only viable option today are PCIe switches, which decouple PCIe devices from hosts by connecting them through a hardware switch. However, the high cost and limited flexibility of PCIe switches hinder… ▽ More

    Submitted 21 April, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

  3. arXiv:2503.07783  [pdf

    cs.AI

    Sensemaking in Novel Environments: How Human Cognition Can Inform Artificial Agents

    Authors: Robert E. Patterson, Regina Buccello-Stout, Mary E. Frame, Anna M. Maresca, Justin Nelson, Barbara Acker-Mills, Erica Curtis, Jared Culbertson, Kevin Schmidt, Scott Clouse, Steve Rogers

    Abstract: One of the most vital cognitive skills to possess is the ability to make sense of objects, events, and situations in the world. In the current paper, we offer an approach for creating artificially intelligent agents with the capacity for sensemaking in novel environments. Objectives: to present several key ideas: (1) a novel unified conceptual framework for sensemaking (which includes the existenc… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 14 pages, 5 figures

    MSC Class: I.2.0

  4. arXiv:2503.05113  [pdf

    cs.CE q-bio.QM

    FOSS solution for Molecular Dynamics Simulation Automation and Collaboration with MDSGAT

    Authors: Jai Geddes Nelson, Xiaochen Liu, Ken Tye Yong

    Abstract: The process of setting up and successfully running Molecular Dynamics Simulations (MDS) is outlined to be incredibly labour and computationally expensive with a very high barrier to entry for newcomers wishing to utilise the benefits and insights of MDS. Here, presented, is a unique Free and Open-Source Software (FOSS) solution that aims to not only reduce the barrier of entry for new Molecular Dy… ▽ More

    Submitted 14 March, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

  5. arXiv:2501.17754  [pdf

    math.NA cs.RO eess.SY physics.bio-ph

    Analysis of the navigation of magnetic microrobots through cerebral bifurcations

    Authors: Pedro G. Alves, Maria Pinto, Rosa Moreira, Derick Sivakumaran, Fabian C. Landers, Maria Guix, Bradley J. Nelson, Andreas D. Flouris, Salvador Pané, Josep Puigmartí-Luis, Tiago Sotto Mayor

    Abstract: Local administration of thrombolytics in ischemic stroke could accelerate clot lysis and the ensuing reperfusion while minimizing the side effects of systemic administration. Medical microrobots could be injected into the bloodstream and magnetically navigated to the clot for administering the drugs directly to the target. The magnetic manipulation required to navigate medical microrobots will dep… ▽ More

    Submitted 29 January, 2025; originally announced January 2025.

    Journal ref: Adv. Intell. Syst. 2400993 (2025)

  6. arXiv:2501.11553  [pdf

    cs.RO cond-mat.mtrl-sci eess.SY physics.app-ph physics.bio-ph physics.med-ph

    Clinically Ready Magnetic Microrobots for Targeted Therapies

    Authors: Fabian C. Landers, Lukas Hertle, Vitaly Pustovalov, Derick Sivakumaran, Oliver Brinkmann, Kirstin Meiners, Pascal Theiler, Valentin Gantenbein, Andrea Veciana, Michael Mattmann, Silas Riss, Simone Gervasoni, Christophe Chautems, Hao Ye, Semih Sevim, Andreas D. Flouris, Josep Puigmartí-Luis, Tiago Sotto Mayor, Pedro Alves, Tessa Lühmann, Xiangzhong Chen, Nicole Ochsenbein, Ueli Moehrlen, Philipp Gruber, Miriam Weisskopf , et al. (3 additional authors not shown)

    Abstract: Systemic drug administration often causes off-target effects limiting the efficacy of advanced therapies. Targeted drug delivery approaches increase local drug concentrations at the diseased site while minimizing systemic drug exposure. We present a magnetically guided microrobotic drug delivery system capable of precise navigation under physiological conditions. This platform integrates a clinica… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  7. arXiv:2412.01143  [pdf, other

    cs.DS

    Space Complexity of Minimum Cut Problems in Single-Pass Streams

    Authors: Matthew Ding, Alexandro Garces, Jason Li, Honghao Lin, Jelani Nelson, Vihan Shah, David P. Woodruff

    Abstract: We consider the problem of finding a minimum cut of a weighted graph presented as a single-pass stream. While graph sparsification in streams has been intensively studied, the specific application of finding minimum cuts in streams is less well-studied. To this end, we show upper and lower bounds on minimum cut problems in insertion-only streams for a variety of settings, including for both random… ▽ More

    Submitted 6 December, 2024; v1 submitted 2 December, 2024; originally announced December 2024.

    Comments: 25+3 pages, 2 figures. Accepted to ITCS 2025. v2: minor updates to author information

  8. arXiv:2411.06370  [pdf, ps, other

    cs.DS

    One Attack to Rule Them All: Tight Quadratic Bounds for Adaptive Queries on Cardinality Sketches

    Authors: Edith Cohen, Jelani Nelson, Tamás Sarlós, Mihir Singhal, Uri Stemmer

    Abstract: Cardinality sketches are compact data structures for representing sets or vectors. These sketches are space-efficient, typically requiring only logarithmic storage in the input size, and enable approximation of cardinality (or the number of nonzero entries). A crucial property in applications is \emph{composability}, meaning that the sketch of a union of sets can be computed from individual sketch… ▽ More

    Submitted 13 March, 2025; v1 submitted 10 November, 2024; originally announced November 2024.

  9. arXiv:2411.02535  [pdf, other

    quant-ph cs.CC

    Polynomial-Time Classical Simulation of Noisy Circuits with Naturally Fault-Tolerant Gates

    Authors: Jon Nelson, Joel Rajakumar, Dominik Hangleiter, Michael J. Gullans

    Abstract: We construct a polynomial-time classical algorithm that samples from the output distribution of low-depth noisy Clifford circuits with any product-state inputs and final single-qubit measurements in any basis. This class of circuits includes Clifford-magic circuits and Conjugated-Clifford circuits, which are important candidates for demonstrating quantum advantage using non-universal gates. Additi… ▽ More

    Submitted 10 December, 2024; v1 submitted 4 November, 2024; originally announced November 2024.

  10. arXiv:2405.01437  [pdf, other

    cs.GT eess.SY q-bio.PE

    Two competing populations with a common environmental resource

    Authors: Keith Paarporn, James Nelson

    Abstract: Feedback-evolving games is a framework that models the co-evolution between payoff functions and an environmental state. It serves as a useful tool to analyze many social dilemmas such as natural resource consumption, behaviors in epidemics, and the evolution of biological populations. However, it has primarily focused on the dynamics of a single population of agents. In this paper, we consider th… ▽ More

    Submitted 21 August, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  11. arXiv:2404.10201  [pdf, other

    cs.DS cs.CR cs.IT cs.LG

    Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages

    Authors: Hilal Asi, Vitaly Feldman, Jelani Nelson, Huy L. Nguyen, Kunal Talwar, Samson Zhou

    Abstract: We study the problem of private vector mean estimation in the shuffle model of privacy where $n$ users each have a unit vector $v^{(i)} \in\mathbb{R}^d$. We propose a new multi-message protocol that achieves the optimal error using $\tilde{\mathcal{O}}\left(\min(n\varepsilon^2,d)\right)$ messages per user. Moreover, we show that any (unbiased) protocol that achieves optimal error requires each use… ▽ More

    Submitted 25 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Fixed author ordering

  12. arXiv:2403.14770  [pdf, other

    cs.AR

    Beehive: A Flexible Network Stack for Direct-Attached Accelerators

    Authors: Katie Lim, Matthew Giordano, Theano Stavrinos, Irene Zhang, Jacob Nelson, Baris Kasikci, Tom Anderson

    Abstract: Direct-attached accelerators, where application accelerators are directly connected to the datacenter network via a hardware network stack, offer substantial benefits in terms of reduced latency, CPU overhead, and energy use. However, a key challenge is that modern datacenter network stacks are complex, with interleaved protocol layers, network management functions, and virtualization support. To… ▽ More

    Submitted 11 September, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: To appear at MICRO 2024

  13. arXiv:2403.00028  [pdf, ps, other

    cs.CR cs.LG

    Lower Bounds for Differential Privacy Under Continual Observation and Online Threshold Queries

    Authors: Edith Cohen, Xin Lyu, Jelani Nelson, Tamás Sarlós, Uri Stemmer

    Abstract: One of the most basic problems for studying the "price of privacy over time" is the so called private counter problem, introduced by Dwork et al. (2010) and Chan et al. (2010). In this problem, we aim to track the number of events that occur over time, while hiding the existence of every single event. More specifically, in every time step $t\in[T]$ we learn (in an online fashion) that $Δ_t\geq 0$… ▽ More

    Submitted 17 April, 2024; v1 submitted 28 February, 2024; originally announced March 2024.

  14. arXiv:2312.02132  [pdf, other

    cs.LG cs.AI cs.CR cs.DS

    Hot PATE: Private Aggregation of Distributions for Diverse Task

    Authors: Edith Cohen, Benjamin Cohen-Wang, Xin Lyu, Jelani Nelson, Tamas Sarlos, Uri Stemmer

    Abstract: The Private Aggregation of Teacher Ensembles (PATE) framework enables privacy-preserving machine learning by aggregating responses from disjoint subsets of sensitive data. Adaptations of PATE to tasks with inherent output diversity such as text generation face a core tension: preserving output diversity reduces teacher agreement, which in turn increases the noise required for differential privacy,… ▽ More

    Submitted 17 May, 2025; v1 submitted 4 December, 2023; originally announced December 2023.

  15. arXiv:2311.01242  [pdf, other

    quant-ph cs.CE

    Pushing the Limits of Quantum Computing for Simulating PFAS Chemistry

    Authors: Emil Dimitrov, Goar Sanchez-Sanz, James Nelson, Lee O'Riordan, Myles Doyle, Sean Courtney, Venkatesh Kannan, Hassan Naseri, Alberto Garcia Garcia, James Tricker, Marisa Faraggi, Joshua Goings, Luning Zhao

    Abstract: Accurate and scalable methods for computational quantum chemistry can accelerate research and development in many fields, ranging from drug discovery to advanced material design. Solving the electronic Schrodinger equation is the core problem of computational chemistry. However, the combinatorial complexity of this problem makes it intractable to find exact solutions, except for very small systems… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  16. arXiv:2310.12871  [pdf, other

    stat.AP cs.CY

    The origins of unpredictability in life trajectory prediction tasks

    Authors: Ian Lundberg, Rachel Brown-Weinstock, Susan Clampet-Lundquist, Sarah Pachman, Timothy J. Nelson, Vicki Yang, Kathryn Edin, Matthew J. Salganik

    Abstract: Why are life trajectories difficult to predict? We investigated this question through in-depth qualitative interviews with 40 families sampled from a multi-decade longitudinal study. Our sampling and interviewing process were informed by the earlier efforts of hundreds of researchers to predict life outcomes for participants in this study. The qualitative evidence we uncovered in these interviews… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 54 pages, 8 figures

    ACM Class: J.4

  17. arXiv:2310.01347  [pdf, ps, other

    quant-ph cs.CC

    Hamiltonians whose low-energy states require $Ω(n)$ T gates

    Authors: Nolan J. Coble, Matthew Coudron, Jon Nelson, Seyed Sajjad Nezhadi

    Abstract: The recent resolution of the NLTS Conjecture [ABN22] establishes a prerequisite to the Quantum PCP (QPCP) Conjecture through a novel use of newly-constructed QLDPC codes [LZ22]. Even with NLTS now solved, there remain many independent and unresolved prerequisites to the QPCP Conjecture, such as the NLSS Conjecture of [GL22]. In this work we focus on a specific and natural prerequisite to both NLSS… ▽ More

    Submitted 10 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: fixed typos, updated abstract, additional references added

  18. arXiv:2310.00145  [pdf, other

    cs.RO cs.AI

    3D Reconstruction in Noisy Agricultural Environments: A Bayesian Optimization Perspective for View Planning

    Authors: Athanasios Bacharis, Konstantinos D. Polyzos, Henry J. Nelson, Georgios B. Giannakis, Nikolaos Papanikolopoulos

    Abstract: 3D reconstruction is a fundamental task in robotics that gained attention due to its major impact in a wide variety of practical settings, including agriculture, underwater, and urban environments. This task can be carried out via view planning (VP), which aims to optimally place a certain number of cameras in positions that maximize the visual information, improving the resulting 3D reconstructio… ▽ More

    Submitted 18 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  19. arXiv:2308.14733  [pdf, other

    cs.CR cs.DS

    Differentially Private Aggregation via Imperfect Shuffling

    Authors: Badih Ghazi, Ravi Kumar, Pasin Manurangsi, Jelani Nelson, Samson Zhou

    Abstract: In this paper, we introduce the imperfect shuffle differential privacy model, where messages sent from users are shuffled in an almost uniform manner before being observed by a curator for private aggregation. We then consider the private summation problem. We show that the standard split-and-mix protocol by Ishai et. al. [FOCS 2006] can be adapted to achieve near-optimal utility bounds in the imp… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  20. arXiv:2308.02025  [pdf

    cs.CY cs.IR

    Applications and Societal Implications of Artificial Intelligence in Manufacturing: A Systematic Review

    Authors: John P. Nelson, Justin B. Biddle, Philip Shapira

    Abstract: This paper undertakes a systematic review of relevant extant literature to consider the potential societal implications of the growth of AI in manufacturing. We analyze the extensive range of AI applications in this domain, such as interfirm logistics coordination, firm procurement management, predictive maintenance, and shop-floor monitoring and control of processes, machinery, and workers. Addit… ▽ More

    Submitted 25 July, 2023; originally announced August 2023.

  21. arXiv:2306.04444  [pdf, other

    cs.LG cs.CR stat.ML

    Fast Optimal Locally Private Mean Estimation via Random Projections

    Authors: Hilal Asi, Vitaly Feldman, Jelani Nelson, Huy L. Nguyen, Kunal Talwar

    Abstract: We study the problem of locally private mean estimation of high-dimensional vectors in the Euclidean ball. Existing algorithms for this problem either incur sub-optimal error or have high communication and/or run-time complexity. We propose a new algorithmic framework, ProjUnit, for private mean estimation that yields algorithms that are computationally efficient, have low communication complexity… ▽ More

    Submitted 26 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Added the correct github link

  22. arXiv:2305.10834  [pdf

    cs.AI cs.CL cs.HC cs.MM

    AIwriting: Relations Between Image Generation and Digital Writing

    Authors: Scott Rettberg, Talan Memmott, Jill Walker Rettberg, Jason Nelson, Patrick Lichty

    Abstract: During 2022, both transformer-based AI text generation sys-tems such as GPT-3 and AI text-to-image generation systems such as DALL-E 2 and Stable Diffusion made exponential leaps forward and are unquestionably altering the fields of digital art and electronic literature. In this panel a group of electronic literature authors and theorists consider new oppor-tunities for human creativity presented… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Extended abstract for panel presented at ISEA 2023, Paris 16-22 May 2023

    ACM Class: J.5

  23. arXiv:2304.04488  [pdf, other

    cs.DC

    Hybrid Computing for Interactive Datacenter Applications

    Authors: Pratyush Patel, Katie Lim, Kushal Jhunjhunwalla, Ashlie Martinez, Max Demoulin, Jacob Nelson, Irene Zhang, Thomas Anderson

    Abstract: Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost effective than CPUs for a wide variety of datacenter applications. Yet, for latency-sensitive and bursty workloads, this advantage can be difficult to harness due to high FPGA spin-up costs. We propose that a hybrid FPGA and CPU computing framework can harness the energy efficiency benefits of FPGAs for such workloads at rea… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 13 pages

  24. arXiv:2303.01229  [pdf, other

    cs.CL cs.AI

    Almanac: Retrieval-Augmented Language Models for Clinical Medicine

    Authors: Cyril Zakka, Akash Chaurasia, Rohan Shad, Alex R. Dalal, Jennifer L. Kim, Michael Moor, Kevin Alexander, Euan Ashley, Jack Boyd, Kathleen Boyd, Karen Hirsch, Curt Langlotz, Joanna Nelson, William Hiesinger

    Abstract: Large-language models have recently demonstrated impressive zero-shot capabilities in a variety of natural language tasks such as summarization, dialogue generation, and question-answering. Despite many promising applications in clinical medicine, adoption of these models in real-world settings has been largely limited by their tendency to generate incorrect and sometimes even toxic statements. In… ▽ More

    Submitted 31 May, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

  25. Local Hamiltonians with no low-energy stabilizer states

    Authors: Nolan J. Coble, Matthew Coudron, Jon Nelson, Seyed Sajjad Nezhadi

    Abstract: The recently-defined No Low-energy Sampleable States (NLSS) conjecture of Gharibian and Le Gall [GL22] posits the existence of a family of local Hamiltonians where all states of low-enough constant energy do not have succinct representations allowing perfect sampling access. States that can be prepared using only Clifford gates (i.e. stabilizer states) are an example of sampleable states, so the N… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  26. arXiv:2302.12170  [pdf, other

    cs.NE

    Language Model Crossover: Variation through Few-Shot Prompting

    Authors: Elliot Meyerson, Mark J. Nelson, Herbie Bradley, Adam Gaier, Arash Moradi, Amy K. Hoover, Joel Lehman

    Abstract: This paper pursues the insight that language models naturally enable an intelligent variation operator similar in spirit to evolutionary crossover. In particular, language models of sufficient scale demonstrate in-context learning, i.e. they can learn from associations between a small number of input patterns to generate outputs incorporating such associations (also called few-shot prompting). Thi… ▽ More

    Submitted 13 May, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

  27. arXiv:2302.06165  [pdf, ps, other

    cs.DS cs.LG

    Sparse Dimensionality Reduction Revisited

    Authors: Mikael Møller Høgsgaard, Lion Kamma, Kasper Green Larsen, Jelani Nelson, Chris Schwiegelshohn

    Abstract: The sparse Johnson-Lindenstrauss transform is one of the central techniques in dimensionality reduction. It supports embedding a set of $n$ points in $\mathbb{R}^d$ into $m=O(\varepsilon^{-2} \lg n)$ dimensions while preserving all pairwise distances to within $1 \pm \varepsilon$. Each input point $x$ is embedded to $Ax$, where $A$ is an $m \times d$ matrix having $s$ non-zeros per column, allowin… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  28. arXiv:2211.12063  [pdf, ps, other

    cs.CR cs.DS

    Generalized Private Selection and Testing with High Confidence

    Authors: Edith Cohen, Xin Lyu, Jelani Nelson, Tamás Sarlós, Uri Stemmer

    Abstract: Composition theorems are general and powerful tools that facilitate privacy accounting across multiple data accesses from per-access privacy bounds. However they often result in weaker bounds compared with end-to-end analysis. Two popular tools that mitigate that are the exponential mechanism (or report noisy max) and the sparse vector technique. They were generalized in a couple of recent private… ▽ More

    Submitted 9 February, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Appeared in ITCS 2023; This version: revised introduction and related works sections;

  29. arXiv:2211.11718  [pdf, ps, other

    cs.DS

    Private Counting of Distinct and k-Occurring Items in Time Windows

    Authors: Badih Ghazi, Ravi Kumar, Pasin Manurangsi, Jelani Nelson

    Abstract: In this work, we study the task of estimating the numbers of distinct and $k$-occurring items in a time window under the constraint of differential privacy (DP). We consider several variants depending on whether the queries are on general time windows (between times $t_1$ and $t_2$), or are restricted to being cumulative (between times $1$ and $t_2$), and depending on whether the DP neighboring re… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: To appear in ITCS 2023

  30. arXiv:2211.06387  [pdf, ps, other

    cs.LG cs.CR cs.DS

    Õptimal Differentially Private Learning of Thresholds and Quasi-Concave Optimization

    Authors: Edith Cohen, Xin Lyu, Jelani Nelson, Tamás Sarlós, Uri Stemmer

    Abstract: The problem of learning threshold functions is a fundamental one in machine learning. Classical learning theory implies sample complexity of $O(ξ^{-1} \log(1/β))$ (for generalization error $ξ$ with confidence $1-β$). The private version of the problem, however, is more challenging and in particular, the sample complexity must depend on the size $|X|$ of the domain. Progress on quantifying this dep… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  31. arXiv:2211.03917  [pdf, ps, other

    cs.DS cs.CC

    On the amortized complexity of approximate counting

    Authors: Ishaq Aden-Ali, Yanjun Han, Jelani Nelson, Huacheng Yu

    Abstract: Naively storing a counter up to value $n$ would require $Ω(\log n)$ bits of memory. Nelson and Yu [NY22], following work of [Morris78], showed that if the query answers need only be $(1+ε)$-approximate with probability at least $1 - δ$, then $O(\log\log n + \log\log(1/δ) + \log(1/ε))$ bits suffice, and in fact this bound is tight. Morris' original motivation for studying this problem though, as we… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  32. arXiv:2210.03305  [pdf, other

    cs.HC

    How Do Data Science Workers Communicate Intermediate Results?

    Authors: Rock Yuren Pang, Ruotong Wang, Joely Nelson, Leilani Battle

    Abstract: Data science workers increasingly collaborate on large-scale projects before communicating insights to a broader audience in the form of visualization. While prior work has modeled how data science teams, oftentimes with distinct roles and work processes, communicate knowledge to outside stakeholders, we have little knowledge of how data science workers communicate intermediately before delivering… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: This paper was accepted for presentation as part of the eighth Symposium on Visualization in Data Science (VDS) at ACM KDD 2022 as well as IEEE VIS 2022. http://www.visualdatascience.org/2022/index.html

  33. arXiv:2207.00956  [pdf, ps, other

    cs.DS cs.CR cs.LG

    Tricking the Hashing Trick: A Tight Lower Bound on the Robustness of CountSketch to Adaptive Inputs

    Authors: Edith Cohen, Jelani Nelson, Tamás Sarlós, Uri Stemmer

    Abstract: CountSketch and Feature Hashing (the "hashing trick") are popular randomized dimensionality reduction methods that support recovery of $\ell_2$-heavy hitters (keys $i$ where $v_i^2 > ε\|\boldsymbol{v}\|_2^2$) and approximate inner products. When the inputs are {\em not adaptive} (do not depend on prior outputs), classic estimators applied to a sketch of size $O(\ell/ε)$ are accurate for a number o… ▽ More

    Submitted 28 August, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

  34. arXiv:2205.09804  [pdf, ps, other

    cs.DS cs.IT cs.LG

    Estimation of Entropy in Constant Space with Improved Sample Complexity

    Authors: Maryam Aliakbarpour, Andrew McGregor, Jelani Nelson, Erik Waingarten

    Abstract: Recent work of Acharya et al. (NeurIPS 2019) showed how to estimate the entropy of a distribution $\mathcal D$ over an alphabet of size $k$ up to $\pmε$ additive error by streaming over $(k/ε^3) \cdot \text{polylog}(1/ε)$ i.i.d. samples and using only $O(1)$ words of memory. In this work, we give a new constant memory scheme that reduces the sample complexity to $(k/ε^2)\cdot \text{polylog}(1/ε)$.… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  35. arXiv:2205.07362  [pdf, ps, other

    cs.LG math.RT stat.ML

    What is an equivariant neural network?

    Authors: Lek-Heng Lim, Bradley J. Nelson

    Abstract: We explain equivariant neural networks, a notion underlying breakthroughs in machine learning from deep convolutional neural networks for computer vision to AlphaFold 2 for protein structure prediction, without assuming knowledge of equivariance or neural networks. The basic mathematical ideas are simple but are often obscured by engineering complications that come with practical realizations. We… ▽ More

    Submitted 16 November, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: 8 pages, 3 figure

    ACM Class: I.2.6

  36. arXiv:2205.01539  [pdf, other

    math.AT cs.CG

    Parameterized Vietoris-Rips Filtrations via Covers

    Authors: Bradley J. Nelson

    Abstract: A challenge in computational topology is to deal with large filtered geometric complexes built from point cloud data such as Vietoris-Rips filtrations. This has led to the development of schemes for parallel computation and compression which restrict simplices to lie in open sets in a cover of the data. We extend the method of acyclic carriers to the setting of persistent homology to give detailed… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 18 pages, 6 figures

    MSC Class: 55N31 (Primary); 68T09 (Secondary)

  37. arXiv:2203.16476  [pdf, ps, other

    cs.DS

    Differentially Private All-Pairs Shortest Path Distances: Improved Algorithms and Lower Bounds

    Authors: Badih Ghazi, Ravi Kumar, Pasin Manurangsi, Jelani Nelson

    Abstract: We study the problem of releasing the weights of all-pair shortest paths in a weighted undirected graph with differential privacy (DP). In this setting, the underlying graph is fixed and two graphs are neighbors if their edge weights differ by at most $1$ in the $\ell_1$-distance. We give an $ε$-DP algorithm with additive error $\tilde{O}(n^{2/3} / ε)$ and an $(ε, δ)$-DP algorithm with additive er… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  38. arXiv:2203.08906  [pdf, other

    cs.AR cs.DC cs.NI

    ORCA: A Network and Architecture Co-design for Offloading us-scale Datacenter Applications

    Authors: Yifan Yuan, Jinghan Huang, Yan Sun, Tianchen Wang, Jacob Nelson, Dan R. K. Ports, Yipeng Wang, Ren Wang, Charlie Tai, Nam Sung Kim

    Abstract: Responding to the "datacenter tax" and "killer microseconds" problems for datacenter applications, diverse solutions including Smart NIC-based ones have been proposed. Nonetheless, they often suffer from high overhead of communications over network and/or PCIe links. To tackle the limitations of the current solutions, this paper proposes ORCA, a holistic network and architecture co-design solution… ▽ More

    Submitted 17 October, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: This paper has been accepted by HPCA'23. This arxiv paper is not the final camera-ready version

  39. arXiv:2203.01599  [pdf, ps, other

    cs.LG cs.CG cs.DS stat.ML

    Uniform Approximations for Randomized Hadamard Transforms with Applications

    Authors: Yeshwanth Cherapanamjeri, Jelani Nelson

    Abstract: Randomized Hadamard Transforms (RHTs) have emerged as a computationally efficient alternative to the use of dense unstructured random matrices across a range of domains in computer science and machine learning. For several applications such as dimensionality reduction and compressed sensing, the theoretical guarantees for methods based on RHTs are comparable to approaches using dense random matric… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: STOC 2022

  40. arXiv:2203.00194  [pdf, other

    cs.CR cs.DS cs.LG

    Private Frequency Estimation via Projective Geometry

    Authors: Vitaly Feldman, Jelani Nelson, Huy Lê Nguyen, Kunal Talwar

    Abstract: In this work, we propose a new algorithm ProjectiveGeometryResponse (PGR) for locally differentially private (LDP) frequency estimation. For a universe size of $k$ and with $n$ users, our $\varepsilon$-LDP algorithm has communication cost $\lceil\log_2k\rceil$ bits in the private coin setting and $\varepsilon\log_2 e + O(1)$ in the public coin setting, and has computation cost… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

  41. arXiv:2202.13736  [pdf, other

    cs.DS cs.LG

    On the Robustness of CountSketch to Adaptive Inputs

    Authors: Edith Cohen, Xin Lyu, Jelani Nelson, Tamás Sarlós, Moshe Shechner, Uri Stemmer

    Abstract: CountSketch is a popular dimensionality reduction technique that maps vectors to a lower dimension using randomized linear measurements. The sketch supports recovering $\ell_2$-heavy hitters of a vector (entries with $v[i]^2 \geq \frac{1}{k}\|\boldsymbol{v}\|^2_2$). We study the robustness of the sketch in adaptive settings where input vectors may depend on the output from prior inputs. Adaptive s… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  42. arXiv:2201.13012  [pdf, other

    cs.LG cs.CG math.OC

    Topology-Preserving Dimensionality Reduction via Interleaving Optimization

    Authors: Bradley J. Nelson, Yuan Luo

    Abstract: Dimensionality reduction techniques are powerful tools for data preprocessing and visualization which typically come with few guarantees concerning the topological correctness of an embedding. The interleaving distance between the persistent homology of Vietoris-Rips filtrations can be used to identify a scale at which topological features such as clusters or holes in an embedding and original dat… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  43. arXiv:2112.06095  [pdf, other

    cs.NI cs.DC

    Unlocking the Power of Inline Floating-Point Operations on Programmable Switches

    Authors: Yifan Yuan, Omar Alama, Amedeo Sapio, Jiawei Fei, Jacob Nelson, Dan R. K. Ports, Marco Canini, Nam Sung Kim

    Abstract: The advent of switches with programmable dataplanes has enabled the rapid development of new network functionality, as well as providing a platform for acceleration of a broad range of application-level functionality. However, existing switch hardware was not designed with application acceleration in mind, and thus applications requiring operations or datatypes not used in traditional network prot… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: This paper has been accepted by NSDI'22. This arxiv paper is not the final camera-ready version

  44. arXiv:2111.10984  [pdf, other

    cs.CV cs.CG math.AT

    Topological Regularization for Dense Prediction

    Authors: Deqing Fu, Bradley J. Nelson

    Abstract: Dense prediction tasks such as depth perception and semantic segmentation are important applications in computer vision that have a concrete topological description in terms of partitioning an image into connected components or estimating a function with a small number of local extrema corresponding to objects in the image. We develop a form of topological regularization based on persistent homolo… ▽ More

    Submitted 24 October, 2022; v1 submitted 21 November, 2021; originally announced November 2021.

  45. arXiv:2111.04867  [pdf, other

    cs.DC cs.LG

    TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches

    Authors: Aashaka Shah, Vijay Chidambaram, Meghan Cowan, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi, Rachee Singh

    Abstract: Machine learning models are increasingly being trained across multiple GPUs and servers. In this setting, data is transferred between GPUs using communication collectives such as AlltoAll and AllReduce, which can become a significant bottleneck in training large models. Thus, it is important to use efficient algorithms for collective communication. We develop TACCL, a tool that enables algorithm d… ▽ More

    Submitted 5 October, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Accepted at NSDI'23. Contains 20 pages, 11 figures, including Appendix

  46. arXiv:2110.08691  [pdf, other

    cs.DS cs.CG cs.LG stat.ML

    Terminal Embeddings in Sublinear Time

    Authors: Yeshwanth Cherapanamjeri, Jelani Nelson

    Abstract: Recently (Elkin, Filtser, Neiman 2017) introduced the concept of a {\it terminal embedding} from one metric space $(X,d_X)$ to another $(Y,d_Y)$ with a set of designated terminals $T\subset X$. Such an embedding $f$ is said to have distortion $ρ\ge 1$ if $ρ$ is the smallest value such that there exists a constant $C>0$ satisfying \begin{equation*} \forall x\in T\ \forall q\in X,\ C d_X(x, q) \… ▽ More

    Submitted 13 March, 2024; v1 submitted 16 October, 2021; originally announced October 2021.

    Journal ref: TheoretiCS, Volume 3 (March 14, 2024) theoretics:9167

  47. arXiv:2109.13120  [pdf

    cs.CV cs.LG

    An End-to-end Entangled Segmentation and Classification Convolutional Neural Network for Periodontitis Stage Grading from Periapical Radiographic Images

    Authors: Tanjida Kabir, Chun-Teh Lee, Jiman Nelson, Sally Sheng, Hsiu-Wan Meng, Luyao Chen, Muhammad F Walji, Xioaqian Jiang, Shayan Shams

    Abstract: Periodontitis is a biofilm-related chronic inflammatory disease characterized by gingivitis and bone loss in the teeth area. Approximately 61 million adults over 30 suffer from periodontitis (42.2%), with 7.8% having severe periodontitis in the United States. The measurement of radiographic bone loss (RBL) is necessary to make a correct periodontal diagnosis, especially if the comprehensive and lo… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 8 pages, 8 figures, 5 tables

  48. arXiv:2109.12115  [pdf

    eess.IV cs.CV cs.LG

    Use of the Deep Learning Approach to Measure Alveolar Bone Level

    Authors: Chun-Teh Lee, Tanjida Kabir, Jiman Nelson, Sally Sheng, Hsiu-Wan Meng, Thomas E. Van Dyke, Muhammad F. Walji, Xiaoqian Jiang, Shayan Shams

    Abstract: Abstract: Aim: The goal was to use a Deep Convolutional Neural Network to measure the radiographic alveolar bone level to aid periodontal diagnosis. Material and methods: A Deep Learning (DL) model was developed by integrating three segmentation networks (bone area, tooth, cementoenamel junction) and image analysis to measure the radiographic bone level and assign radiographic bone loss (RBL)… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Word count: 3485; Number of figures: 4; tables: 2; references: 34

  49. High-quality Thermal Gibbs Sampling with Quantum Annealing Hardware

    Authors: Jon Nelson, Marc Vuffray, Andrey Y. Lokhov, Tameem Albash, Carleton Coffrin

    Abstract: Quantum Annealing (QA) was originally intended for accelerating the solution of combinatorial optimization tasks that have natural encodings as Ising models. However, recent experiments on QA hardware platforms have demonstrated that, in the operating regime corresponding to weak interactions, the QA hardware behaves like a noisy Gibbs sampler at a hardware-specific effective temperature. This wor… ▽ More

    Submitted 23 February, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Report number: LA-UR-21-28692

    Journal ref: Phys. Rev. Applied 17, 044046 (2022)

  50. arXiv:2108.05022  [pdf, other

    math.AT cs.CG

    Accelerating Iterated Persistent Homology Computations with Warm Starts

    Authors: Yuan Luo, Bradley J. Nelson

    Abstract: Persistent homology is a topological feature used in a variety of applications such as generating features for data analysis and penalizing optimization problems. We develop an approach to accelerate persistent homology computations performed on many similar filtered topological spaces which is based on updating associated matrix factorizations. Our approach improves the update scheme of Cohen-Ste… ▽ More

    Submitted 17 January, 2023; v1 submitted 11 August, 2021; originally announced August 2021.