Skip to main content

Showing 1–50 of 122 results for author: Singh, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16816  [pdf, other

    cs.CL

    IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

    Authors: Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar

    Abstract: As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  2. arXiv:2404.15549  [pdf, other

    cs.CL cs.AI

    PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models

    Authors: Shashi Kant Gupta, Aditya Basu, Mauro Nievas, Jerrin Thomas, Nathan Wolfrath, Adhitya Ramamurthi, Bradley Taylor, Anai N. Kothari, Regina Schwind, Therica M. Miller, Sorena Nadaf-Rahrov, Yanshan Wang, Hrituraj Singh

    Abstract: Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 Pages, 8 Figures, Supplementary Work Attached

  3. arXiv:2404.08085  [pdf, ps, other

    cs.DS

    Matrix Multiplication Reductions

    Authors: Ashish Gola, Igor Shinkar, Harsimran Singh

    Abstract: In this paper we study a worst case to average case reduction for the problem of matrix multiplication over finite fields. Suppose we have an efficient average case algorithm, that given two random matrices $A,B$ outputs a matrix that has a non-trivial correlation with their product $A \cdot B$. Can we transform it into a worst case algorithm, that outputs the correct answer for all inputs without… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  4. arXiv:2404.07774  [pdf, other

    cs.LG cs.RO

    Sketch-Plan-Generalize: Continual Few-Shot Learning of Inductively Generalizable Spatial Concepts for Language-Guided Robot Manipulation

    Authors: Namasivayam Kalithasan, Sachit Sachdeva, Himanshu Gaurav Singh, Divyanshu Aggarwal, Gurarmaan Singh Panjeta, Vishal Bindal, Arnav Tuli, Rohan Paul, Parag Singla

    Abstract: Our goal is to build embodied agents that can learn inductively generalizable spatial concepts in a continual manner, e.g, constructing a tower of a given height. Existing work suffers from certain limitations (a) (Liang et al., 2023) and their multi-modal extensions, rely heavily on prior knowledge and are not grounded in the demonstrations (b) (Liu et al., 2023) lack the ability to generalize du… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2404.06680  [pdf, other

    cs.CL

    Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology

    Authors: Shashi Kant Gupta, Aditya Basu, Bradley Taylor, Anai Kothari, Hrituraj Singh

    Abstract: Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed R… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 18 pages

  6. arXiv:2404.04714  [pdf, other

    cs.LG cs.AI cs.CR

    Data Poisoning Attacks on Off-Policy Policy Evaluation Methods

    Authors: Elita Lobo, Harvineet Singh, Marek Petrik, Cynthia Rudin, Himabindu Lakkaraju

    Abstract: Off-policy Evaluation (OPE) methods are a crucial tool for evaluating policies in high-stakes domains such as healthcare, where exploration is often infeasible, unethical, or expensive. However, the extent to which such methods can be trusted under adversarial threats to data quality is largely unexplored. In this work, we make the first attempt at investigating the sensitivity of OPE methods to m… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at UAI 2022

  7. arXiv:2403.19885  [pdf, other

    cs.CV cs.RO

    Towards Long Term SLAM on Thermal Imagery

    Authors: Colin Keil, Aniket Gupta, Pushyami Kaveti, Hanumant Singh

    Abstract: Visual SLAM with thermal imagery, and other low contrast visually degraded environments such as underwater, or in areas dominated by snow and ice, remain a difficult problem for many state of the art (SOTA) algorithms. In addition to challenging front-end data association, thermal imagery presents an additional difficulty for long term relocalization and map reuse. The relative temperatures of obj… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 8 pages, 7 figures, Submitted to IROS 2024

  8. arXiv:2403.13170  [pdf, other

    cs.RO

    On Designing Consistent Covariance Recovery from a Deep Learning Visual Odometry Engine

    Authors: Jagatpreet Singh Nir, Dennis Giaya, Hanumant Singh

    Abstract: Deep learning techniques have significantly advanced in providing accurate visual odometry solutions by leveraging large datasets. However, generating uncertainty estimates for these methods remains a challenge. Traditional sensor fusion approaches in a Bayesian framework are well-established, but deep learning techniques with millions of parameters lack efficient methods for uncertainty estimatio… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Submitted to IROS 2024

  9. arXiv:2403.10425  [pdf, other

    cs.CV cs.AI cs.RO

    NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices

    Authors: Zhiyong Zhang, Huaizu Jiang, Hanumant Singh

    Abstract: Real-time high-accuracy optical flow estimation is a crucial component in various applications, including localization and mapping in robotics, object tracking, and activity recognition in computer vision. While recent learning-based optical flow methods have achieved high accuracy, they often come with heavy computation costs. In this paper, we propose a highly efficient optical flow architecture… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  10. arXiv:2403.01628  [pdf, ps, other

    cs.LG

    Recent Advances, Applications, and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables at ML4H 2023 Symposium

    Authors: Hyewon Jeong, Sarah Jabbour, Yuzhe Yang, Rahul Thapta, Hussein Mozannar, William Jongwon Han, Nikita Mehandru, Michael Wornow, Vladislav Lialin, Xin Liu, Alejandro Lozano, Jiacheng Zhu, Rafal Dariusz Kocielnik, Keith Harrigian, Haoran Zhang, Edward Lee, Milos Vukadinovic, Aparna Balagopalan, Vincent Jeanselme, Katherine Matton, Ilker Demirel, Jason Fries, Parisa Rashidi, Brett Beaulieu-Jones, Xuhai Orson Xu , et al. (18 additional authors not shown)

    Abstract: The third ML4H symposium was held in person on December 10, 2023, in New Orleans, Louisiana, USA. The symposium included research roundtable sessions to foster discussions between participants and senior researchers on timely and relevant topics for the \ac{ML4H} community. Encouraged by the successful virtual roundtables in the previous year, we organized eleven in-person roundtables and four vir… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: ML4H 2023, Research Roundtables

  11. arXiv:2402.17412  [pdf, other

    cs.CV

    DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models

    Authors: Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, Pin-Yu Chen

    Abstract: In the realm of subject-driven text-to-image (T2I) generative models, recent developments like DreamBooth and BLIP-Diffusion have led to impressive results yet encounter limitations due to their intensive fine-tuning demands and substantial parameter requirements. While the low-rank adaptation (LoRA) module within DreamBooth offers a reduction in trainable parameters, it introduces a pronounced se… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Project Page: https://diffusekrona.github.io/

  12. arXiv:2402.14254  [pdf, other

    cs.LG stat.ML

    A hierarchical decomposition for explaining ML performance discrepancies

    Authors: Jean Feng, Harvineet Singh, Fan Xia, Adarsh Subbaswamy, Alexej Gossmann

    Abstract: Machine learning (ML) algorithms can often differ in performance across domains. Understanding $\textit{why}$ their performance differs is crucial for determining what types of interventions (e.g., algorithmic or operational) are most effective at closing the performance gaps. Existing methods focus on $\textit{aggregate decompositions}$ of the total performance gap into the impact of a shift in t… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 11 pages, 5 figures in main body; 14 pages and 2 figures in appendices

  13. arXiv:2402.05892  [pdf, other

    cs.CV

    Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

    Authors: Shufan Li, Harkanwar Singh, Aditya Grover

    Abstract: In recent years, Transformers have become the de-facto architecture for sequence modeling on text and a variety of multi-dimensional data, such as images and video. However, the use of self-attention layers in a Transformer incurs prohibitive compute and memory complexity that scales quadratically w.r.t. the sequence length. A recent architecture, Mamba, based on state space models has been shown… ▽ More

    Submitted 19 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 22 pages, 7 figures

  14. arXiv:2402.04888  [pdf, other

    cs.IT cs.AI cs.HC cs.LG eess.SP

    RSCNet: Dynamic CSI Compression for Cloud-based WiFi Sensing

    Authors: Borna Barahimi, Hakam Singh, Hina Tabassum, Omer Waqar, Mohammad Omer

    Abstract: WiFi-enabled Internet-of-Things (IoT) devices are evolving from mere communication devices to sensing instruments, leveraging Channel State Information (CSI) extraction capabilities. Nevertheless, resource-constrained IoT devices and the intricacies of deep neural networks necessitate transmitting CSI to cloud servers for sensing. Although feasible, this leads to considerable communication overhea… ▽ More

    Submitted 19 January, 2024; originally announced February 2024.

    Comments: The paper has been accepted by IEEE International Conference on Communications (ICC) 2024

  15. arXiv:2402.02656  [pdf, other

    cs.CL q-bio.QM

    RACER: An LLM-powered Methodology for Scalable Analysis of Semi-structured Mental Health Interviews

    Authors: Satpreet Harcharan Singh, Kevin Jiang, Kanchan Bhasin, Ashutosh Sabharwal, Nidal Moukaddam, Ankit B Patel

    Abstract: Semi-structured interviews (SSIs) are a commonly employed data-collection method in healthcare research, offering in-depth qualitative insights into subject experiences. Despite their value, the manual analysis of SSIs is notoriously time-consuming and labor-intensive, in part due to the difficulty of extracting and categorizing emotional responses, and challenges in scaling human evaluation for l… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  16. arXiv:2312.12630  [pdf, other

    math.DS cs.LG math.CV math.FA math.SP

    Data-driven discovery with Limited Data Acquisition for fluid flow across cylinder

    Authors: Dr. Himanshu Singh

    Abstract: One of the central challenge for extracting governing principles of dynamical system via Dynamic Mode Decomposition (DMD) is about the limit data availability or formally called as Limited Data Acquisition in the present paper. In the interest of discovering the governing principles for a dynamical system with limited data acquisition, we provide a variant of Kernelized Extended DMD (KeDMD) based… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 52 Pages, 16 Figures, JULIA Coding Result for Dynamic Mode Decomposition, Part of this work selected for 42nd Annual Dynamic Days 2024 Conference (January 8 to 10) at University of California, Davis

    MSC Class: 37N10 76D05 76D25 47N50 47A25 68T01 28A10 28A35

  17. arXiv:2312.10693  [pdf, other

    cs.LG math.FA

    An appointment with Reproducing Kernel Hilbert Space generated by Generalized Gaussian RBF as $L^2-$measure

    Authors: Himanshu Singh

    Abstract: Gaussian Radial Basis Function (RBF) Kernels are the most-often-employed kernels in artificial intelligence and machine learning routines for providing optimally-best results in contrast to their respective counter-parts. However, a little is known about the application of the Generalized Gaussian Radial Basis Function on various machine learning algorithms namely, kernel regression, support vecto… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 20 pages, MATLAB CODE, 11 figures, Results presented in AMS Spring Eastern Sectional Meeting on April 2023

    MSC Class: NUMBER 68-Computer Science; 68T-Artificial Intelligence and 68T07-Artificial Neural Networks and Deep Learning

  18. A Survey of Classical And Quantum Sequence Models

    Authors: I-Chi Chen, Harshdeep Singh, V L Anukruti, Brian Quanz, Kavitha Yogaraj

    Abstract: Our primary objective is to conduct a brief survey of various classical and quantum neural net sequence models, which includes self-attention and recurrent neural networks, with a focus on recent quantum approaches proposed to work with near-term quantum devices, while exploring some basic enhancements for these quantum models. We re-implement a key representative set of these existing methods, ad… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 6 pages, 10 figures, accepted as a COMSNETS paper

    Journal ref: Conference: 2024 16th International Conference on COMmunication Systems & NETworkS (COMSNETS)

  19. arXiv:2312.09958  [pdf, other

    cs.AI cs.IR

    Distilling Large Language Models for Matching Patients to Clinical Trials

    Authors: Mauro Nievas, Aditya Basu, Yanshan Wang, Hrituraj Singh

    Abstract: The recent success of large language models (LLMs) has paved the way for their adoption in the high-stakes domain of healthcare. Specifically, the application of LLMs in patient-trial matching, which involves assessing patient eligibility against clinical trial's nuanced inclusion and exclusion criteria, has shown promise. Recent research has shown that GPT-3.5, a widely recognized LLM developed b… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  20. arXiv:2312.06738  [pdf, other

    cs.CV

    InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following

    Authors: Shufan Li, Harkanwar Singh, Aditya Grover

    Abstract: The ability to provide fine-grained control for generating and editing visual imagery has profound implications for computer vision and its applications. Previous works have explored extending controllability in two directions: instruction tuning with text-based prompts and multi-modal conditioning. However, these works make one or more unnatural assumptions on the number and/or type of modality i… ▽ More

    Submitted 26 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 29 pages, 14 figures

  21. arXiv:2312.04745  [pdf, other

    stat.AP cs.LG

    A Brief Tutorial on Sample Size Calculations for Fairness Audits

    Authors: Harvineet Singh, Fan Xia, Mi-Ok Kim, Romain Pirracchio, Rumi Chunara, Jean Feng

    Abstract: In fairness audits, a standard objective is to detect whether a given algorithm performs substantially differently between subgroups. Properly powering the statistical analysis of such audits is crucial for obtaining informative fairness assessments, as it ensures a high probability of detecting unfairness when it exists. However, limited guidance is available on the amount of data necessary for a… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 4 pages, 1 figure, 1 table, Workshop on Regulatable Machine Learning at the 37th Conference on Neural Information Processing Systems

  22. arXiv:2312.00655   

    cs.LG

    Machine Learning for Health symposium 2023 -- Findings track

    Authors: Stefan Hegselmann, Antonio Parziale, Divya Shanmugam, Shengpu Tang, Mercy Nyamewaa Asiedu, Serina Chang, Thomas Hartvigsen, Harvineet Singh

    Abstract: A collection of the accepted Findings papers that were presented at the 3rd Machine Learning for Health symposium (ML4H 2023), which was held on December 10, 2023, in New Orleans, Louisiana, USA. ML4H 2023 invited high-quality submissions on relevant problems in a variety of health-related disciplines including healthcare, biomedicine, and public health. Two submission tracks were offered: the arc… ▽ More

    Submitted 15 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    MSC Class: 68Txx ACM Class: I.2; J.3; I.6; I.4

  23. arXiv:2311.11463  [pdf, other

    cs.LG stat.ML

    Designing monitoring strategies for deployed machine learning algorithms: navigating performativity through a causal lens

    Authors: Jean Feng, Adarsh Subbaswamy, Alexej Gossmann, Harvineet Singh, Berkman Sahiner, Mi-Ok Kim, Gene Pennello, Nicholas Petrick, Romain Pirracchio, Fan Xia

    Abstract: After a machine learning (ML)-based system is deployed, monitoring its performance is important to ensure the safety and effectiveness of the algorithm over time. When an ML algorithm interacts with its environment, the algorithm can affect the data-generating mechanism and be a major source of bias when evaluating its standalone performance, an issue known as performativity. Although prior work h… ▽ More

    Submitted 26 February, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  24. Adaptive Search Optimization: Dynamic Algorithm Selection and Caching for Enhanced Database Performance

    Authors: Hakikat Singh

    Abstract: Efficient search operations in databases are paramount for timely retrieval of information various applications. This research introduces a novel approach, combining dynamicalgorithm1 selection and caching2 strategies, to optimize search performance. The proposed dynamic search algorithm intelligently switches between Binary3 and Interpolation 4 Search based on dataset characteristics, significant… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  25. arXiv:2309.10698  [pdf, other

    cs.RO

    OASIS: Optimal Arrangements for Sensing in SLAM

    Authors: Pushyami Kaveti, Matthew Giamou, Hanumant Singh, David M. Rosen

    Abstract: The number and arrangement of sensors on mobile robot dramatically influence its perception capabilities. Ensuring that sensors are mounted in a manner that enables accurate detection, localization, and mapping is essential for the success of downstream control tasks. However, when designing a new robotic platform, researchers and practitioners alike usually mimic standard configurations or maximi… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

  26. arXiv:2309.10348  [pdf, other

    cs.LG cs.CR cs.CV

    Language Guided Adversarial Purification

    Authors: Himanshu Singh, A V Subramanyam

    Abstract: Adversarial purification using generative models demonstrates strong adversarial defense performance. These methods are classifier and attack-agnostic, making them versatile but often computationally intensive. Recent strides in diffusion and score networks have improved image generation and, by extension, adversarial purification. Another highly efficient class of adversarial defense methods know… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    MSC Class: 68T45 (Primary); 68T10 (Secondary) ACM Class: I.5.4

  27. arXiv:2307.07863  [pdf, other

    cs.LG cs.AI

    Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans

    Authors: Anant Mehta, Prajit Sengupta, Divisha Garg, Harpreet Singh, Yosi Shacham Diamand

    Abstract: Plant breeders and agricultural researchers can increase crop productivity by identifying desirable features, disease resistance, and nutritional content by analysing the Dry Bean dataset. This study analyses and compares different Support Vector Machine (SVM) classification algorithms, namely linear, polynomial, and radial basis function (RBF), along with other popular classification algorithms.… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures

  28. arXiv:2307.01362  [pdf, other

    cs.CV

    A Strong Baseline for Point Cloud Registration via Direct Superpoints Matching

    Authors: Aniket Gupta, Yiming Xie, Hanumant Singh, Huaizu Jiang

    Abstract: Deep neural networks endow the downsampled superpoints with highly discriminative feature representations. Previous dominant point cloud registration approaches match these feature representations as the first step, e.g., using the Sinkhorn algorithm. A RANSAC-like method is then usually adopted as a post-processing refinement to filter the outliers. Other dominant method is to directly predict th… ▽ More

    Submitted 29 March, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  29. arXiv:2306.14657  [pdf, other

    cs.RO eess.SY

    A Diversity Analysis of Safety Metrics Comparing Vehicle Performance in the Lead-Vehicle Interaction Regime

    Authors: Harnarayan Singh, Bowen Weng, Sughosh J. Rao, Devin Elsasser

    Abstract: Vehicle performance metrics analyze data sets consisting of subject vehicle's interactions with other road users in a nominal driving environment and provide certain performance measures as outputs. To the best of the authors' knowledge, the vehicle safety performance metrics research dates back to at least 1967. To date, there still does not exist a community-wide accepted metric or a set of metr… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: A modified manuscript of this preprint has been accepted to be published as a regular paper at IEEE Transactions on Intelligent Transportation Systems

  30. arXiv:2306.08522  [pdf, other

    cs.RO

    Challenges of Indoor SLAM: A multi-modal multi-floor dataset for SLAM evaluation

    Authors: Pushyami Kaveti, Aniket Gupta, Dennis Giaya, Madeline Karp, Colin Keil, Jagatpreet Nir, Zhiyong Zhang, Hanumant Singh

    Abstract: Robustness in Simultaneous Localization and Mapping (SLAM) remains one of the key challenges for the real-world deployment of autonomous systems. SLAM research has seen significant progress in the last two and a half decades, yet many state-of-the-art (SOTA) algorithms still struggle to perform reliably in real-world environments. There is a general consensus in the research community that we need… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  31. arXiv:2306.02631  [pdf, other

    cs.RO

    Bridging the Domain Gap between Synthetic and Real-World Data for Autonomous Driving

    Authors: Xiangyu Bai, Yedi Luo, Le Jiang, Aniket Gupta, Pushyami Kaveti, Hanumant Singh, Sarah Ostadabbas

    Abstract: Modern autonomous systems require extensive testing to ensure reliability and build trust in ground vehicles. However, testing these systems in the real-world is challenging due to the lack of large and diverse datasets, especially in edge cases. Therefore, simulations are necessary for their development and evaluation. However, existing open-source simulators often exhibit a significant gap betwe… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  32. arXiv:2306.01704  [pdf, other

    cs.RO

    Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis

    Authors: Yedi Luo, Xiangyu Bai, Le Jiang, Aniket Gupta, Eric Mortin, Hanumant Singh, Sarah Ostadabbas

    Abstract: This paper presents a novel approach, TeFS (Temporal-controlled Frame Swap), to generate synthetic stereo driving data for visual simultaneous localization and mapping (vSLAM) tasks. TeFS is designed to overcome the lack of native stereo vision support in commercial driving simulators, and we demonstrate its effectiveness using Grand Theft Auto V (GTA V), a high-budget open-world video game engine… ▽ More

    Submitted 25 December, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  33. arXiv:2305.17611  [pdf, other

    cs.CV

    Bayesian Decision Making to Localize Visual Queries in 2D

    Authors: Syed Asjad, Aniket Gupta, Hanumant Singh

    Abstract: This report describes our approach for the EGO4D 2023 Visual Query 2D Localization Challenge. Our method aims to reduce the number of False Positives (FP) that occur because of high similarity between the visual crop and the proposed bounding boxes from the baseline's Region Proposal Network (RPN). Our method uses a transformer to determine similarity in higher dimensions which is used as our prio… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: Report for the EGO4D 2023 Visual Query 2D Localization Challenge

  34. arXiv:2305.15074  [pdf, other

    cs.CL cs.AI

    Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models

    Authors: Daman Arora, Himanshu Gaurav Singh, Mausam

    Abstract: The performance of large language models (LLMs) on existing reasoning benchmarks has significantly improved over the past years. In response, we present JEEBench, a considerably more challenging benchmark dataset for evaluating the problem solving abilities of LLMs. We curate 515 challenging pre-engineering mathematics, physics and chemistry problems from the highly competitive IIT JEE-Advanced ex… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  35. arXiv:2305.14562  [pdf, other

    cs.LG eess.SY

    GiPH: Generalizable Placement Learning for Adaptive Heterogeneous Computing

    Authors: Yi Hu, Chaoran Zhang, Edward Andert, Harshul Singh, Aviral Shrivastava, James Laudon, Yanqi Zhou, Bob Iannucci, Carlee Joe-Wong

    Abstract: Careful placement of a computational application within a target device cluster is critical for achieving low application completion time. The problem is challenging due to its NP-hardness and combinatorial nature. In recent years, learning-based approaches have been proposed to learn a placement policy that can be applied to unseen applications, motivated by the problem of placing a neural networ… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: to be published in Proceedings of Machine Learning and Systems 5 (MLSys 2023)

  36. arXiv:2305.14410  [pdf, other

    cs.CV cs.AI cs.CL

    Image Manipulation via Multi-Hop Instructions -- A New Dataset and Weakly-Supervised Neuro-Symbolic Approach

    Authors: Harman Singh, Poorva Garg, Mohit Gupta, Kevin Shah, Ashish Goswami, Satyam Modi, Arnab Kumar Mondal, Dinesh Khandelwal, Dinesh Garg, Parag Singla

    Abstract: We are interested in image manipulation via natural language text -- a task that is useful for multiple AI applications but requires complex reasoning over multi-modal spaces. We extend recently proposed Neuro Symbolic Concept Learning (NSCL), which has been quite effective for the task of Visual Question Answering (VQA), for the task of image manipulation. Our system referred to as NeuroSIM can p… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  37. arXiv:2305.13812  [pdf, other

    cs.CL cs.CV

    Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality

    Authors: Harman Singh, Pengchuan Zhang, Qifan Wang, Mengjiao Wang, Wenhan Xiong, Jingfei Du, Yu Chen

    Abstract: Contrastively trained vision-language models have achieved remarkable progress in vision and language representation learning, leading to state-of-the-art models for various downstream multimodal tasks. However, recent research has highlighted severe limitations of these models in their ability to perform compositional reasoning over objects, attributes, and relations. Scene graphs have emerged as… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (long paper, main conference)

  38. arXiv:2305.07859  [pdf, other

    cs.LG

    HAiVA: Hybrid AI-assisted Visual Analysis Framework to Study the Effects of Cloud Properties on Climate Patterns

    Authors: Subhashis Hazarika, Haruki Hirasawa, Sookyung Kim, Kalai Ramea, Salva R. Cachay, Peetak Mitra, Dipti Hingmire, Hansi Singh, Phil J. Rasch

    Abstract: Clouds have a significant impact on the Earth's climate system. They play a vital role in modulating Earth's radiation budget and driving regional changes in temperature and precipitation. This makes clouds ideal for climate intervention techniques like Marine Cloud Brightening (MCB) which refers to modification in cloud reflectivity, thereby cooling the surrounding region. However, to avoid unint… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

  39. arXiv:2302.13330  [pdf, ps, other

    math.CO cs.DM

    Power of $k$ Choices in the Semi-Random Graph Process

    Authors: Paweł Prałat, Harjas Singh

    Abstract: The semi-random graph process is a single player game in which the player is initially presented an empty graph on $n$ vertices. In each round, a vertex $u$ is presented to the player independently and uniformly at random. The player then adaptively selects a vertex $v$, and adds the edge $uv$ to the graph. For a fixed monotone graph property, the objective of the player is to force the graph to s… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

    Comments: 18 pages

  40. arXiv:2302.03258  [pdf, other

    cs.LG

    Climate Intervention Analysis using AI Model Guided by Statistical Physics Principles

    Authors: Soo Kyung Kim, Kalai Ramea, Salva Rühling Cachay, Haruki Hirasawa, Subhashis Hazarika, Dipti Hingmire, Peetak Mitra, Philip J. Rasch, Hansi A. Singh

    Abstract: The availability of training data remains a significant obstacle for the implementation of machine learning in scientific applications. In particular, estimating how a system might respond to external forcings or perturbations requires specialized labeled data or targeted simulations, which may be computationally intensive to generate at scale. In this study, we propose a novel solution to this ch… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  41. arXiv:2302.01957  [pdf, other

    physics.ao-ph cs.AI

    Accelerating exploration of Marine Cloud Brightening impacts on tipping points Using an AI Implementation of Fluctuation-Dissipation Theorem

    Authors: Haruki Hirasawa, Sookyung Kim, Peetak Mitra, Subhashis Hazarika, Salva Ruhling-Cachay, Dipti Hingmire, Kalai Ramea, Hansi Singh, Philip J. Rasch

    Abstract: Marine cloud brightening (MCB) is a proposed climate intervention technology to partially offset greenhouse gas warming and possibly avoid crossing climate tipping points. The impacts of MCB on regional climate are typically estimated using computationally expensive Earth System Model (ESM) simulations, preventing a thorough assessment of the large possibility space of potential MCB interventions.… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: AAAI Spring Symposium conference full paper

    ACM Class: J.2; I.2.1

  42. arXiv:2301.10363  [pdf, other

    cs.RO cs.NE

    Planning-Assisted Context-Sensitive Autonomous Shepherding of Dispersed Robotic Swarms in Obstacle-Cluttered Environments

    Authors: Jing Liu, Hemant Singh, Saber Elsayed, Robert Hunjet, Hussein Abbass

    Abstract: Robotic shepherding is a bio-inspired approach to autonomously guiding a swarm of agents towards a desired location. The research area has earned increasing research interest recently due to the efficacy of controlling a large number of agents in a swarm (sheep) using a smaller number of actuators (sheepdogs). However, shepherding a highly dispersed swarm in an obstacle-cluttered environment remai… ▽ More

    Submitted 27 February, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: 17 pages, 6 figures

  43. arXiv:2301.04447  [pdf, other

    cs.CV cs.LG

    VS-Net: Multiscale Spatiotemporal Features for Lightweight Video Salient Document Detection

    Authors: Hemraj Singh, Mridula Verma, Ramalingaswamy Cheruku

    Abstract: Video Salient Document Detection (VSDD) is an essential task of practical computer vision, which aims to highlight visually salient document regions in video frames. Previous techniques for VSDD focus on learning features without considering the cooperation among and across the appearance and motion cues and thus fail to perform in practical scenarios. Moreover, most of the previous techniques dem… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Journal ref: https://ictai.computer.org/2022/

  44. arXiv:2212.06207  [pdf, other

    quant-ph cs.LG

    Quantum Phase Recognition using Quantum Tensor Networks

    Authors: Shweta Sahoo, Utkarsh Azad, Harjinder Singh

    Abstract: Machine learning (ML) has recently facilitated many advances in solving problems related to many-body physical systems. Given the intrinsic quantum nature of these problems, it is natural to speculate that quantum-enhanced machine learning will enable us to unveil even greater details than we currently have. With this motivation, this paper examines a quantum machine learning approach based on sha… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Accepted in European Physical Journal Plus (EPJP). 10 pages, 6 figures, 4 tables

  45. Learning Neuro-symbolic Programs for Language Guided Robot Manipulation

    Authors: Namasivayam Kalithasan, Himanshu Singh, Vishal Bindal, Arnav Tuli, Vishwajeet Agrawal, Rahul Jain, Parag Singla, Rohan Paul

    Abstract: Given a natural language instruction and an input scene, our goal is to train a model to output a manipulation program that can be executed by the robot. Prior approaches for this task possess one of the following limitations: (i) rely on hand-coded symbols for concepts limiting generalization beyond those seen during training [1] (ii) infer action sequences from instructions but require dense sub… ▽ More

    Submitted 10 March, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: International Conference on Robotics and Automation (ICRA), 2023

  46. arXiv:2210.10769  [pdf, other

    cs.LG stat.ML

    "Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

    Authors: Haoran Zhang, Harvineet Singh, Marzyeh Ghassemi, Shalmali Joshi

    Abstract: Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate distributions, or changes in the relationship between label and features. When a model does fail during deployment, attributing performance change to these factors is cr… ▽ More

    Submitted 6 June, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Published in ICML 2023

  47. arXiv:2210.07466  [pdf, other

    cs.CV cs.GR

    Synthetic-to-real Composite Semantic Segmentation in Additive Manufacturing

    Authors: Aliaksei Petsiuk, Harnoor Singh, Himanshu Dadhwal, Joshua M. Pearce

    Abstract: The application of computer vision and machine learning methods in the field of additive manufacturing (AM) for semantic segmentation of the structural elements of 3-D printed products will improve real-time failure analysis systems and can potentially reduce the number of defects by enabling in situ corrections. This work demonstrates the possibilities of using physics-based rendering for labeled… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  48. Design and Evaluation of a Generic Visual SLAM Framework for Multi-Camera Systems

    Authors: Pushyami Kaveti, Shankara Narayanan Vaidyanathan, Arvind Thamilchelvan, Hanumant Singh

    Abstract: Multi-camera systems have been shown to improve the accuracy and robustness of SLAM estimates, yet state-of-the-art SLAM systems predominantly support monocular or stereo setups. This paper presents a generic sparse visual SLAM framework capable of running on any number of cameras and in any arrangement. Our SLAM system uses the generalized camera model, which allows us to represent an arbitrary m… ▽ More

    Submitted 9 May, 2024; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: Nov 2023 IEEE Robotics and Automation Letters PP(99):1-8

  49. arXiv:2209.15301  [pdf, other

    cs.CL

    Medical Question Understanding and Answering with Knowledge Grounding and Semantic Self-Supervision

    Authors: Khalil Mrini, Harpreet Singh, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter Chang, Emilia Farcas, Ndapa Nakashole

    Abstract: Current medical question answering systems have difficulty processing long, detailed and informally worded questions submitted by patients, called Consumer Health Questions (CHQs). To address this issue, we introduce a medical question understanding and answering system with knowledge grounding and semantic self-supervision. Our system is a pipeline that first summarizes a long, medical, user-writ… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted as Main Conference Long paper at COLING 2022

  50. Towards Robust Off-Policy Evaluation via Human Inputs

    Authors: Harvineet Singh, Shalmali Joshi, Finale Doshi-Velez, Himabindu Lakkaraju

    Abstract: Off-policy Evaluation (OPE) methods are crucial tools for evaluating policies in high-stakes domains such as healthcare, where direct deployment is often infeasible, unethical, or expensive. When deployment environments are expected to undergo changes (that is, dataset shifts), it is important for OPE methods to perform robust evaluation of the policies amidst such changes. Existing approaches con… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 10 pages, 5 figures, 1 table. Appeared at AIES '22: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society. Expanded version of arXiv:2103.15933