Skip to main content

Showing 1–50 of 51 results for author: Boddeti, V

Searching in archive cs. Search in all archives.
.
  1. The Dark Side of Dataset Scaling: Evaluating Racial Classification in Multimodal Models

    Authors: Abeba Birhane, Sepehr Dehdashtian, Vinay Uday Prabhu, Vishnu Boddeti

    Abstract: Scale the model, scale the data, scale the GPU farms is the reigning sentiment in the world of generative AI today. While model scaling has been extensively studied, data scaling and its downstream impacts on model performance remain under-explored. This is particularly important in the context of multimodal datasets whose main source is the World Wide Web, condensed and packaged as the Common Cra… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: To appear in the proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT 24), June 3 to 6, 2024, Rio de Janeiro, Brazil. arXiv admin note: text overlap with arXiv:2306.13141

  2. arXiv:2404.16255  [pdf, other

    cs.CR cs.CV

    Enhancing Privacy in Face Analytics Using Fully Homomorphic Encryption

    Authors: Bharat Yalavarthi, Arjun Ramesh Kaushik, Arun Ross, Vishnu Boddeti, Nalini Ratha

    Abstract: Modern face recognition systems utilize deep neural networks to extract salient features from a face. These features denote embeddings in latent space and are often stored as templates in a face recognition system. These embeddings are susceptible to data leakage and, in some cases, can even be used to reconstruct the original face image. To prevent compromising identities, template protection sch… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  3. arXiv:2404.09454  [pdf, other

    cs.CV cs.CY cs.LG

    Utility-Fairness Trade-Offs and How to Find Them

    Authors: Sepehr Dehdashtian, Bashir Sadeghi, Vishnu Naresh Boddeti

    Abstract: When building classification systems with demographic fairness considerations, there are two objectives to satisfy: 1) maximizing utility for the specific task and 2) ensuring fairness w.r.t. a known demographic attribute. These objectives often compete, so optimizing both can lead to a trade-off between utility and fairness. While existing works acknowledge the trade-offs and study their limits,… ▽ More

    Submitted 23 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

  4. arXiv:2403.15593  [pdf, other

    cs.CV cs.LG

    FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

    Authors: Sepehr Dehdashtian, Lan Wang, Vishnu Naresh Boddeti

    Abstract: Large pre-trained vision-language models such as CLIP provide compact and general-purpose representations of text and images that are demonstrably effective across multiple downstream zero-shot prediction tasks. However, owing to the nature of their training process, these models have the potential to 1) propagate or amplify societal biases in the training data and 2) learn to rely on spurious fea… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: The Twelfth International Conference on Learning Representations (ICLR) 2024

  5. arXiv:2403.07198  [pdf, other

    cs.CV

    Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions

    Authors: Lan Wang, Vishnu Boddeti, Sernam Lim

    Abstract: We introduce a novel text-to-pose video editing method, ReimaginedAct. While existing video editing tasks are limited to changes in attributes, backgrounds, and styles, our method aims to predict open-ended human action changes in video. Moreover, our method can accept not only direct instructional text prompts but also `what if' questions to predict possible action changes. ReimaginedAct comprise… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  6. arXiv:2402.15492  [pdf, other

    cs.LG eess.SP

    Mechanics-Informed Autoencoder Enables Automated Detection and Localization of Unforeseen Structural Damage

    Authors: Xuyang Li, Hamed Bolandi, Mahdi Masmoudi, Talal Salem, Nizar Lajnef, Vishnu Naresh Boddeti

    Abstract: Structural health monitoring (SHM) is vital for ensuring the safety and longevity of structures like buildings and bridges. As the volume and scale of structures and the impact of their failure continue to grow, there is a dire need for SHM techniques that are scalable, inexpensive, operate passively without human intervention, and customized for each mechanical structure without the need for comp… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  7. arXiv:2311.03449  [pdf, other

    cs.CY

    Into the LAIONs Den: Investigating Hate in Multimodal Datasets

    Authors: Abeba Birhane, Vinay Prabhu, Sang Han, Vishnu Naresh Boddeti, Alexandra Sasha Luccioni

    Abstract: 'Scale the model, scale the data, scale the compute' is the reigning sentiment in the world of generative AI today. While the impact of model scaling has been extensively studied, we are only beginning to scratch the surface of data scaling and its consequences. This is especially of critical importance in the context of vision-language datasets such as LAION. These datasets are continually growin… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: To appear at 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Datasets and Benchmarks Track. arXiv admin note: substantial text overlap with arXiv:2306.13141

  8. arXiv:2310.08012  [pdf, other

    cs.LG cs.CR

    AutoFHE: Automated Adaption of CNNs for Efficient Evaluation over FHE

    Authors: Wei Ao, Vishnu Naresh Boddeti

    Abstract: Secure inference of deep convolutional neural networks (CNNs) under RNS-CKKS involves polynomial approximation of unsupported non-linear activation functions. However, existing approaches have three main limitations: 1) Inflexibility: The polynomial approximation and associated homomorphic evaluation architecture are customized manually for each CNN architecture and do not generalize to other netw… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: USENIX Security Symposium 2024

  9. arXiv:2308.11043  [pdf, other

    cs.LG stat.ML

    Spurious Correlations and Where to Find Them

    Authors: Gautam Sreekumar, Vishnu Naresh Boddeti

    Abstract: Spurious correlations occur when a model learns unreliable features from the data and are a well-known drawback of data-driven learning. Although there are several algorithms proposed to mitigate it, we are yet to jointly derive the indicators of spurious correlations. As a result, the solutions built upon standalone hypotheses fail to beat simple ERM baselines. We collect some of the commonly stu… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 2nd Workshop on SCIS, ICML 2023

  10. arXiv:2308.06515  [pdf, other

    cs.CV cs.DC

    Seed Feature Maps-based CNN Models for LEO Satellite Remote Sensing Services

    Authors: Zhichao Lu, Chuntao Ding, Shangguang Wang, Ran Cheng, Felix Juefei-Xu, Vishnu Naresh Boddeti

    Abstract: Deploying high-performance convolutional neural network (CNN) models on low-earth orbit (LEO) satellites for rapid remote sensing image processing has attracted significant interest from industry and academia. However, the limited resources available on LEO satellites contrast with the demands of resource-intensive CNN models, necessitating the adoption of ground-station server assistance for trai… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 11 pages

  11. arXiv:2308.02066  [pdf, other

    cs.CV cs.AI cs.LG

    Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing with Non-Learnable Primitives

    Authors: Chuntao Ding, Zhichao Lu, Shangguang Wang, Ran Cheng, Vishnu Naresh Boddeti

    Abstract: Multi-task learning (MTL) seeks to learn a single model to accomplish multiple tasks by leveraging shared information among the tasks. Existing MTL models, however, have been known to suffer from negative interference among tasks. Efforts to mitigate task interference have focused on either loss/gradient balancing or implicit parameter partitioning with partial overlaps among the tasks. In this pa… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: CVPR 2023

  12. arXiv:2308.02065  [pdf, other

    cs.CV cs.AI cs.LG

    On the Biometric Capacity of Generative Face Models

    Authors: Vishnu Naresh Boddeti, Gautam Sreekumar, Arun Ross

    Abstract: There has been tremendous progress in generating realistic faces with high fidelity over the past few years. Despite this progress, a crucial question remains unanswered: "Given a generative face model, how many unique identities can it generate?" In other words, what is the biometric capacity of the generative face model? A scientific basis for answering this question will benefit evaluating and… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: IJCB 2023

  13. arXiv:2307.16890  [pdf, other

    cs.RO cs.AI cs.LG cs.NE

    Discovering Adaptable Symbolic Algorithms from Scratch

    Authors: Stephen Kelly, Daniel S. Park, Xingyou Song, Mitchell McIntire, Pranav Nashikkar, Ritam Guha, Wolfgang Banzhaf, Kalyanmoy Deb, Vishnu Naresh Boddeti, Jie Tan, Esteban Real

    Abstract: Autonomous robots deployed in the real world will need control policies that rapidly adapt to environmental changes. To this end, we propose AutoRobotics-Zero (ARZ), a method based on AutoML-Zero that discovers zero-shot adaptable policies from scratch. In contrast to neural network adaptation policies, where only model parameters are optimized, ARZ can build control algorithms with the full expre… ▽ More

    Submitted 13 October, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: Published and Best Overall Paper Finalist at International Conference on Intelligent Robots and Systems (IROS) 2023. See https://youtu.be/sEFP1Hay4nE for associated video file

  14. arXiv:2306.13141  [pdf, other

    cs.CY

    On Hate Scaling Laws For Data-Swamps

    Authors: Abeba Birhane, Vinay Prabhu, Sang Han, Vishnu Naresh Boddeti

    Abstract: `Scale the model, scale the data, scale the GPU-farms' is the reigning sentiment in the world of generative AI today. While model scaling has been extensively studied, data scaling and its downstream impacts remain under explored. This is especially of critical importance in the context of visio-linguistic datasets whose main source is the World Wide Web, condensed and packaged as the CommonCrawl… ▽ More

    Submitted 28 June, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  15. arXiv:2302.07734  [pdf, other

    cs.CV cs.DC cs.LG

    TFormer: A Transmission-Friendly ViT Model for IoT Devices

    Authors: Zhichao Lu, Chuntao Ding, Felix Juefei-Xu, Vishnu Naresh Boddeti, Shangguang Wang, Yun Yang

    Abstract: Deploying high-performance vision transformer (ViT) models on ubiquitous Internet of Things (IoT) devices to provide high-quality vision services will revolutionize the way we live, work, and interact with the world. Due to the contradiction between the limited resources of IoT devices and resource-intensive ViT models, the use of cloud servers to assist ViT model training has become mainstream. H… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: IEEE Transactions on Parallel and Distributed Systems

  16. arXiv:2301.02580  [pdf, other

    physics.geo-ph cs.CE cs.LG

    Neuro-DynaStress: Predicting Dynamic Stress Distributions in Structural Components

    Authors: Hamed Bolandi, Gautam Sreekumar, Xuyang Li, Nizar Lajnef, Vishnu Naresh Boddeti

    Abstract: Structural components are typically exposed to dynamic loading, such as earthquakes, wind, and explosions. Structural engineers should be able to conduct real-time analysis in the aftermath or during extreme disaster events requiring immediate corrections to avoid fatal failures. As a result, it is crucial to predict dynamic stress distributions during highly disruptive events in real-time. Curren… ▽ More

    Submitted 18 December, 2022; originally announced January 2023.

    Comments: 16 pages, 12 figures. arXiv admin note: text overlap with arXiv:2211.16190

  17. arXiv:2212.11005  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Revisiting Residual Networks for Adversarial Robustness: An Architectural Perspective

    Authors: Shihua Huang, Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti

    Abstract: Efforts to improve the adversarial robustness of convolutional neural networks have primarily focused on developing more effective adversarial training methods. In contrast, little attention was devoted to analyzing the role of architectural elements (such as topology, depth, and width) on adversarial robustness. This paper seeks to bridge this gap and present a holistic study on the impact of arc… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  18. arXiv:2211.16190  [pdf, other

    cs.LG

    Physics Informed Neural Network for Dynamic Stress Prediction

    Authors: Hamed Bolandi, Gautam Sreekumar, Xuyang Li, Nizar Lajnef, Vishnu Naresh Boddeti

    Abstract: Structural failures are often caused by catastrophic events such as earthquakes and winds. As a result, it is crucial to predict dynamic stress distributions during highly disruptive events in real time. Currently available high-fidelity methods, such as Finite Element Models (FEMs), suffer from their inherent high complexity. Therefore, to reduce computational cost while maintaining accuracy, a P… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 14 pages, 13 figures

  19. arXiv:2208.12771  [pdf, other

    cs.LG cs.CV

    NeuralSI: Structural Parameter Identification in Nonlinear Dynamical Systems

    Authors: Xuyang Li, Hamed Bolandi, Talal Salem, Nizar Lajnef, Vishnu Naresh Boddeti

    Abstract: Structural monitoring for complex built environments often suffers from mismatch between design, laboratory testing, and actual built parameters. Additionally, real-world structural identification problems encounter many challenges. For example, the lack of accurate baseline models, high dimensionality, and complex multivariate partial differential equations (PDEs) pose significant difficulties in… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: ECCV 2022 Workshop on Computer Vision for Civil and Infrastructure Engineering

  20. arXiv:2208.07241  [pdf, other

    cs.CV cs.CR

    HEFT: Homomorphically Encrypted Fusion of Biometric Templates

    Authors: Luke Sperling, Nalini Ratha, Arun Ross, Vishnu Naresh Boddeti

    Abstract: This paper proposes a non-interactive end-to-end solution for secure fusion and matching of biometric templates using fully homomorphic encryption (FHE). Given a pair of encrypted feature vectors, we perform the following ciphertext operations, i) feature concatenation, ii) fusion and dimensionality reduction through a learned linear projection, iii) scale normalization to unit $\ell_2$-norm, and… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: IJCB 2022

  21. Towards Transmission-Friendly and Robust CNN Models over Cloud and Device

    Authors: Chuntao Ding, Zhichao Lu, Felix Juefei-Xu, Vishnu Naresh Boddeti, Yidong Li, Jiannong Cao

    Abstract: Deploying deep convolutional neural network (CNN) models on ubiquitous Internet of Things (IoT) devices has attracted much attention from industry and academia since it greatly facilitates our lives by providing various rapid-response services. Due to the limited resources of IoT devices, cloud-assisted training of CNN models has become the mainstream. However, most existing related works suffer f… ▽ More

    Submitted 13 December, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: IEEE Transactions on Mobile Computing

  22. arXiv:2204.00762  [pdf, other

    cs.CV

    Do learned representations respect causal relationships?

    Authors: Lan Wang, Vishnu Naresh Boddeti

    Abstract: Data often has many semantic attributes that are causally associated with each other. But do attribute-specific learned representations of data also respect the same causal relations? We answer this question in three steps. First, we introduce NCINet, an approach for observational causal discovery from high-dimensional data. It is trained purely on synthetically generated representations and can b… ▽ More

    Submitted 7 April, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

  23. arXiv:2112.00879  [pdf, other

    cs.CV

    Generating Diverse 3D Reconstructions from a Single Occluded Face Image

    Authors: Rahul Dey, Vishnu Naresh Boddeti

    Abstract: Occlusions are a common occurrence in unconstrained face images. Single image 3D reconstruction from such face images often suffers from corruption due to the presence of occlusions. Furthermore, while a plurality of 3D reconstructions is plausible in the occluded regions, existing approaches are limited to generating only a single solution. To address both of these challenges, we present Diverse3… ▽ More

    Submitted 31 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: CVPR 2022

  24. arXiv:2110.10395  [pdf, other

    cs.CV

    3DFaceFill: An Analysis-By-Synthesis Approach to Face Completion

    Authors: Rahul Dey, Vishnu Boddeti

    Abstract: Existing face completion solutions are primarily driven by end-to-end models that directly generate 2D completions of 2D masked faces. By having to implicitly account for geometric and photometric variations in facial shape and appearance, such approaches result in unrealistic completions, especially under large variations in pose, shape, illumination and mask sizes. To alleviate these limitations… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: Winter Conference on Applications of Computer Vision, WACV 2022

  25. arXiv:2109.05535  [pdf, other

    cs.LG

    Adversarial Representation Learning With Closed-Form Solvers

    Authors: Bashir Sadeghi, Lan Wang, Vishnu Naresh Boddeti

    Abstract: Adversarial representation learning aims to learn data representations for a target task while removing unwanted sensitive information at the same time. Existing methods learn model parameters iteratively through stochastic gradient descent-ascent, which is often unstable and unreliable in practice. To overcome this challenge, we adopt closed-form solvers for the adversary and target task. We mode… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

  26. arXiv:2109.03386  [pdf, other

    cs.LG

    On Characterizing the Trade-off in Invariant Representation Learning

    Authors: Bashir Sadeghi, Sepehr Dehdashtian, Vishnu Boddeti

    Abstract: Many applications of representation learning, such as privacy preservation, algorithmic fairness, and domain adaptation, desire explicit control over semantic information being discarded. This goal is formulated as satisfying two objectives: maximizing utility for predicting a target attribute while simultaneously being invariant (independent) to a known semantic attribute. Solutions to invariant… ▽ More

    Submitted 22 December, 2022; v1 submitted 7 September, 2021; originally announced September 2021.

  27. arXiv:2108.08617  [pdf, other

    cs.CV cs.LG

    Spatially-Adaptive Image Restoration using Distortion-Guided Networks

    Authors: Kuldeep Purohit, Maitreya Suin, A. N. Rajagopalan, Vishnu Naresh Boddeti

    Abstract: We present a general learning-based solution for restoring images suffering from spatially-varying degradations. Prior approaches are typically degradation-specific and employ the same processing across different images and different pixels within. However, we hypothesize that such spatially rigid processing is suboptimal for simultaneously restoring the degraded pixels as well as reconstructing t… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted at ICCV 2021

  28. arXiv:2007.10396  [pdf, other

    cs.CV cs.LG cs.NE

    NSGANetV2: Evolutionary Multi-Objective Surrogate-Assisted Neural Architecture Search

    Authors: Zhichao Lu, Kalyanmoy Deb, Erik Goodman, Wolfgang Banzhaf, Vishnu Naresh Boddeti

    Abstract: In this paper, we propose an efficient NAS algorithm for generating task-specific models that are competitive under multiple competing objectives. It comprises of two surrogates, one at the architecture level to improve sample efficiency and one at the weights level, through a supernet, to improve gradient descent training efficiency. On standard benchmark datasets (C10, C100, ImageNet), the resul… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: Accepted for oral presentation at ECCV 2020

  29. Neural Architecture Transfer

    Authors: Zhichao Lu, Gautam Sreekumar, Erik Goodman, Wolfgang Banzhaf, Kalyanmoy Deb, Vishnu Naresh Boddeti

    Abstract: Neural architecture search (NAS) has emerged as a promising avenue for automatically designing task-specific neural networks. Existing NAS approaches require one complete search for each deployment specification of hardware or objective. This is a computationally impractical endeavor given the potentially large number of application scenarios. In this paper, we propose Neural Architecture Transfer… ▽ More

    Submitted 21 March, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Code is available at https://github.com/human-analysis/neural-architecture-transfer

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

  30. arXiv:2003.13880  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    MUXConv: Information Multiplexing in Convolutional Neural Networks

    Authors: Zhichao Lu, Kalyanmoy Deb, Vishnu Naresh Boddeti

    Abstract: Convolutional neural networks have witnessed remarkable improvements in computational efficiency in recent years. A key driving force has been the idea of trading-off model expressivity and efficiency through a combination of $1\times 1$ and depth-wise separable convolutions in lieu of a standard convolutional layer. The price of the efficiency, however, is the sub-optimal flow of information acro… ▽ More

    Submitted 7 April, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  31. arXiv:2003.12197  [pdf, other

    cs.CV cs.CR cs.LG

    HERS: Homomorphically Encrypted Representation Search

    Authors: Joshua J. Engelsma, Anil K. Jain, Vishnu Naresh Boddeti

    Abstract: We present a method to search for a probe (or query) image representation against a large gallery in the encrypted domain. We require that the probe and gallery images be represented in terms of a fixed-length representation, which is typical for representations obtained from learned networks. Our encryption scheme is agnostic to how the fixed-length representation is obtained and can therefore be… ▽ More

    Submitted 18 June, 2022; v1 submitted 26 March, 2020; originally announced March 2020.

    Comments: Published in the Trustworthy Biometrics Special Issue of IEEE Transactions on Biometrics, Behavior, and Identity Science 2021

  32. arXiv:1912.01369  [pdf, other

    cs.CV cs.LG cs.NE

    Multi-Objective Evolutionary Design of Deep Convolutional Neural Networks for Image Classification

    Authors: Zhichao Lu, Ian Whalen, Yashesh Dhebar, Kalyanmoy Deb, Erik Goodman, Wolfgang Banzhaf, Vishnu Naresh Boddeti

    Abstract: Early advancements in convolutional neural networks (CNNs) architectures are primarily driven by human expertise and by elaborate design processes. Recently, neural architecture search was proposed with the aim of automating the network design process and generating task-dependent architectures. While existing approaches have achieved competitive performance in image classification, they are not w… ▽ More

    Submitted 15 September, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Published in IEEE Transactions on Evolutionary Computation, 23 pages

  33. arXiv:1910.07423  [pdf, other

    cs.LG cs.CV stat.ML

    On the Global Optima of Kernelized Adversarial Representation Learning

    Authors: Bashir Sadeghi, Runyi Yu, Vishnu Naresh Boddeti

    Abstract: Adversarial representation learning is a promising paradigm for obtaining data representations that are invariant to certain sensitive attributes while retaining the information necessary for predicting target attributes. Existing approaches solve this problem through iterative adversarial minimax optimization and lack theoretical guarantees. In this paper, we first study the "linear" form of this… ▽ More

    Submitted 25 December, 2019; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: Accepted for publication at ICCV 2019. This version includes additional theoretical and experimental analysis. Minor update to the GMM experiment

  34. arXiv:1904.05514  [pdf, other

    cs.LG cs.CV stat.ML

    Mitigating Information Leakage in Image Representations: A Maximum Entropy Approach

    Authors: Proteek Chandan Roy, Vishnu Naresh Boddeti

    Abstract: Image recognition systems have demonstrated tremendous progress over the past few decades thanks, in part, to our ability of learning compact and robust representations of images. As we witness the wide spread adoption of these systems, it is imperative to consider the problem of unintended leakage of information from an image representation, which might compromise the privacy of the data owner. T… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted for oral presentation at CVPR 2019

  35. arXiv:1812.08196  [pdf, other

    cs.CV

    RankGAN: A Maximum Margin Ranking GAN for Generating Faces

    Authors: Rahul Dey, Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

    Abstract: We present a new stage-wise learning paradigm for training generative adversarial networks (GANs). The goal of our work is to progressively strengthen the discriminator and thus, the generators, with each subsequent stage without changing the network architecture. We call this proposed method the RankGAN. We first propose a margin-based loss for the GAN discriminator. We then extend it to a margin… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Best Student Paper Award at Asian Conference on Computer Vision (ACCV), 2018 at Perth, Australia. Includes main paper and supplementary material. Total 32 pages including references

  36. arXiv:1810.03522  [pdf, other

    cs.CV cs.LG cs.NE

    NSGA-Net: Neural Architecture Search using Multi-Objective Genetic Algorithm

    Authors: Zhichao Lu, Ian Whalen, Vishnu Boddeti, Yashesh Dhebar, Kalyanmoy Deb, Erik Goodman, Wolfgang Banzhaf

    Abstract: This paper introduces NSGA-Net -- an evolutionary approach for neural architecture search (NAS). NSGA-Net is designed with three goals in mind: (1) a procedure considering multiple and conflicting objectives, (2) an efficient procedure balancing exploration and exploitation of the space of potential neural network architectures, and (3) a procedure finding a diverse set of trade-off network archit… ▽ More

    Submitted 18 April, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: GECCO 2019

  37. arXiv:1806.01817  [pdf, other

    cs.CV cs.LG

    Perturbative Neural Networks

    Authors: Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

    Abstract: Convolutional neural networks are witnessing wide adoption in computer vision systems with numerous applications across a range of visual recognition tasks. Much of this progress is fueled through advances in convolutional neural network architectures and learning algorithms even as the basic premise of a convolutional layer has remained unchanged. In this paper, we seek to revisit the convolution… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

    Comments: To appear in CVPR 2018. http://xujuefei.com/pnn.html

  38. arXiv:1805.00577  [pdf, other

    cs.CV

    Secure Face Matching Using Fully Homomorphic Encryption

    Authors: Vishnu Naresh Boddeti

    Abstract: Face recognition technology has demonstrated tremendous progress over the past few years, primarily due to advances in representation learning. As we witness the widespread adoption of these systems, it is imperative to consider the security of face representations. In this paper, we explore the practicality of using a fully homomorphic encryption based framework to secure a database of face templ… ▽ More

    Submitted 13 July, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: BTAS 2018

  39. arXiv:1803.09672  [pdf, other

    cs.CV stat.ML

    On the Intrinsic Dimensionality of Image Representations

    Authors: Sixue Gong, Vishnu Naresh Boddeti, Anil K. Jain

    Abstract: This paper addresses the following questions pertaining to the intrinsic dimensionality of any given image representation: (i) estimate its intrinsic dimensionality, (ii) develop a deep neural network based non-linear mapping, dubbed DeepMDS, that transforms the ambient representation to the minimal intrinsic space, and (iii) validate the veracity of the mapping through image matching in the intri… ▽ More

    Submitted 10 April, 2019; v1 submitted 26 March, 2018; originally announced March 2018.

    Comments: Accepted for publication at CVPR 2019

  40. arXiv:1710.02277  [pdf, other

    cs.CV cs.LG stat.ML

    Efficient K-Shot Learning with Regularized Deep Networks

    Authors: Donghyun Yoo, Haoqi Fan, Vishnu Naresh Boddeti, Kris M. Kitani

    Abstract: Feature representations from pre-trained deep neural networks have been known to exhibit excellent generalization and utility across a variety of related tasks. Fine-tuning is by far the simplest and most widely used approach that seeks to exploit and adapt these feature representations to novel tasks with limited data. Despite the effectiveness of fine-tuning, itis often sub-optimal and requires… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

  41. arXiv:1709.10433  [pdf, other

    cs.CV stat.ML

    On the Capacity of Face Representation

    Authors: Sixue Gong, Vishnu Naresh Boddeti, Anil K. Jain

    Abstract: In this paper we address the following question, given a face representation, how many identities can it resolve? In other words, what is the capacity of the face representation? A scientific basis for estimating the capacity of a given face representation will not only benefit the evaluation and comparison of different representation methods, but will also establish an upper bound on the scalabil… ▽ More

    Submitted 11 April, 2019; v1 submitted 29 September, 2017; originally announced September 2017.

  42. arXiv:1707.05938  [pdf, other

    cs.CV

    Face Alignment Robust to Pose, Expressions and Occlusions

    Authors: Vishnu Naresh Boddeti, Myung-Cheol Roh, Jongju Shin, Takaharu Oguri, Takeo Kanade

    Abstract: We propose an Ensemble of Robust Constrained Local Models for alignment of faces in the presence of significant occlusions and of any unknown pose and expression. To account for partial occlusions we introduce, Robust Constrained Local Models, that comprises of a deformable shape and local landmark appearance model and reasons over binary occlusion labels. Our occlusion reasoning proceeds by a hyp… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

  43. arXiv:1704.04865  [pdf, other

    cs.CV cs.LG

    Gang of GANs: Generative Adversarial Networks with Maximum Margin Ranking

    Authors: Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

    Abstract: Traditional generative adversarial networks (GAN) and many of its variants are trained by minimizing the KL or JS-divergence loss that measures how close the generated data distribution is from the true data distribution. A recent advance called the WGAN based on Wasserstein distance can improve on the KL and JS-divergence based GANs, and alleviate the gradient vanishing, instability, and mode col… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

    Comments: 16 pages. 11 figures

  44. arXiv:1704.02203  [pdf, other

    cs.CV cs.CR

    Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption

    Authors: Ryo Yonetani, Vishnu Naresh Boddeti, Kris M. Kitani, Yoichi Sato

    Abstract: We propose a privacy-preserving framework for learning visual classifiers by leveraging distributed private image data. This framework is designed to aggregate multiple classifiers updated locally using private data and to ensure that no private information about the data is exposed during and after its learning procedure. We utilize a homomorphic cryptosystem that can aggregate the local classifi… ▽ More

    Submitted 28 July, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

    Comments: To appear in ICCV 2017

  45. arXiv:1701.08837  [pdf, other

    cs.LG cs.CV

    Emergence of Selective Invariance in Hierarchical Feed Forward Networks

    Authors: Dipan K. Pal, Vishnu Boddeti, Marios Savvides

    Abstract: Many theories have emerged which investigate how in- variance is generated in hierarchical networks through sim- ple schemes such as max and mean pooling. The restriction to max/mean pooling in theoretical and empirical studies has diverted attention away from a more general way of generating invariance to nuisance transformations. We con- jecture that hierarchically building selective invariance… ▽ More

    Submitted 30 January, 2017; originally announced January 2017.

  46. arXiv:1612.05234  [pdf, other

    cs.CV

    Visual Compiler: Synthesizing a Scene-Specific Pedestrian Detector and Pose Estimator

    Authors: Namhoon Lee, Xinshuo Weng, Vishnu Naresh Boddeti, Yu Zhang, Fares Beainy, Kris Kitani, Takeo Kanade

    Abstract: We introduce the concept of a Visual Compiler that generates a scene specific pedestrian detector and pose estimator without any pedestrian observations. Given a single image and auxiliary scene information in the form of camera parameters and geometric layout of the scene, the Visual Compiler first infers geometrically and photometrically accurate images of humans in that scene through the use of… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: submitted to CVPR 2017

  47. arXiv:1612.02889  [pdf, other

    cs.CV

    Gesture-based Bootstrapping for Egocentric Hand Segmentation

    Authors: Yubo Zhang, Vishnu Naresh Boddeti, Kris M. Kitani

    Abstract: Accurately identifying hands in images is a key sub-task for human activity understanding with wearable first-person point-of-view cameras. Traditional hand segmentation approaches rely on a large corpus of manually labeled data to generate robust hand detectors. However, these approaches still face challenges as the appearance of the hand varies greatly across users, tasks, environments or illumi… ▽ More

    Submitted 11 June, 2018; v1 submitted 8 December, 2016; originally announced December 2016.

  48. arXiv:1612.00478  [pdf, other

    cs.CV

    In Teacher We Trust: Learning Compressed Models for Pedestrian Detection

    Authors: Jonathan Shen, Noranart Vesdapunt, Vishnu N. Boddeti, Kris M. Kitani

    Abstract: Deep convolutional neural networks continue to advance the state-of-the-art in many domains as they grow bigger and more complex. It has been observed that many of the parameters of a large network are redundant, allowing for the possibility of learning a smaller network that mimics the outputs of the large network through a process called Knowledge Distillation. We show, however, that standard Kn… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

  49. arXiv:1608.06049  [pdf, other

    cs.LG cs.CV

    Local Binary Convolutional Neural Networks

    Authors: Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

    Abstract: We propose local binary convolution (LBC), an efficient alternative to convolutional layers in standard convolutional neural networks (CNN). The design principles of LBC are motivated by local binary patterns (LBP). The LBC layer comprises of a set of fixed sparse pre-defined binary convolutional filters that are not updated during the training process, a non-linear activation function and a set o… ▽ More

    Submitted 1 July, 2017; v1 submitted 22 August, 2016; originally announced August 2016.

    Comments: To appear in CVPR 2017 as Spotlight

  50. Zero-Aliasing Correlation Filters for Object Recognition

    Authors: Joseph A. Fernandez, Vishnu Naresh Boddeti, Andres Rodriguez, B. V. K. Vijaya Kumar

    Abstract: Correlation filters (CFs) are a class of classifiers that are attractive for object localization and tracking applications. Traditionally, CFs have been designed in the frequency domain using the discrete Fourier transform (DFT), where correlation is efficiently implemented. However, existing CF designs do not account for the fact that the multiplication of two DFTs in the frequency domain corresp… ▽ More

    Submitted 19 November, 2014; v1 submitted 9 November, 2014; originally announced November 2014.

    Comments: 14 pages, to appear in IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)