Skip to main content

Showing 1–50 of 90 results for author: Deng, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20188  [pdf, other

    cs.CV cs.GR

    SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid Registration

    Authors: Yuxin Yao, Bailin Deng, Junhui Hou, Juyong Zhang

    Abstract: Existing optimization-based methods for non-rigid registration typically minimize an alignment error metric based on the point-to-point or point-to-plane distance between corresponding point pairs on the source surface and target surface. However, these metrics can result in slow convergence or a loss of detail. In this paper, we propose SPARE, a novel formulation that utilizes a symmetrized point… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.17069  [pdf, other

    cs.CV cs.LG

    Training-free Editioning of Text-to-Image Models

    Authors: Jinqi Wang, Yunfei Fu, Zhangcan Ding, Bailin Deng, Yu-Kun Lai, Yipeng Qin

    Abstract: Inspired by the software industry's practice of offering different editions or versions of a product tailored to specific user groups or use cases, we propose a novel task, namely, training-free editioning, for text-to-image models. Specifically, we aim to create variations of a base text-to-image model without retraining, enabling the model to cater to the diverse needs of different user groups o… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2405.09883  [pdf, other

    cs.CV

    RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

    Authors: Xiaosu Zhu, Hualian Sheng, Sijia Cai, Bing Deng, Shaopeng Yang, Qiao Liang, Ken Chen, Lianli Gao, Jingkuan Song, Jieping Ye

    Abstract: We introduce RoScenes, the largest multi-view roadside perception dataset, which aims to shed light on the development of vision-centric Bird's Eye View (BEV) approaches for more challenging traffic scenes. The highlights of RoScenes include significantly large perception area, full scene coverage and crowded traffic. More specifically, our dataset achieves surprising 21.13M 3D annotations within… ▽ More

    Submitted 19 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Technical report. 32 pages, 21 figures, 13 tables. https://github.com/xiaosu-zhu/RoScenes

  4. arXiv:2405.07105  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG

    Overcoming systematic softening in universal machine learning interatomic potentials by fine-tuning

    Authors: Bowen Deng, Yunyeong Choi, Peichen Zhong, Janosh Riebesell, Shashwat Anand, Zhuohan Li, KyuJung Jun, Kristin A. Persson, Gerbrand Ceder

    Abstract: Machine learning interatomic potentials (MLIPs) have introduced a new paradigm for atomic simulations. Recent advancements have seen the emergence of universal MLIPs (uMLIPs) that are pre-trained on diverse materials datasets, providing opportunities for both ready-to-use universal force fields and robust foundations for downstream machine learning refinements. However, their performance in extrap… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  5. arXiv:2404.16647  [pdf

    physics.optics cs.LG

    Application of RESNET50 Convolution Neural Network for the Extraction of Optical Parameters in Scattering Media

    Authors: Bowen Deng, Yihan Zhang, Andrew Parkes, Alex Bentley, Amanda Wright, Michael Pound, Michael Somekh

    Abstract: Estimation of the optical properties of scattering media such as tissue is important in diagnostics as well as in the development of techniques to image deeper. As light penetrates the sample scattering events occur that alter the propagation direction of the photons in a random manner leading degradation of image quality. The distribution of the scattered light does, however, give a measure of th… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  6. arXiv:2404.09531  [pdf, other

    cs.CV cs.GR

    Oblique-MERF: Revisiting and Improving MERF for Oblique Photography

    Authors: Xiaoyi Zeng, Kaiwen Song, Leyuan Yang, Bailin Deng, Juyong Zhang

    Abstract: Neural implicit fields have established a new paradigm for scene representation, with subsequent work achieving high-quality real-time rendering. However, reconstructing 3D scenes from oblique aerial photography presents unique challenges, such as varying spatial scale distributions and a constrained range of tilt angles, often resulting in high memory consumption and reduced rendering quality at… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2403.17638  [pdf, other

    cs.CV

    Learning with Unreliability: Fast Few-shot Voxel Radiance Fields with Relative Geometric Consistency

    Authors: Yingjie Xu, Bangzhen Liu, Hao Tang, Bailin Deng, Shengfeng He

    Abstract: We propose a voxel-based optimization framework, ReVoRF, for few-shot radiance fields that strategically address the unreliability in pseudo novel view synthesis. Our method pivots on the insight that relative depth relationships within neighboring regions are more reliable than the absolute color values in disoccluded areas. Consequently, we devise a bilateral geometric consistency loss that care… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 final version

  8. arXiv:2402.16310  [pdf, other

    cs.LG cs.AI

    REPLAY: Modeling Time-Varying Temporal Regularities of Human Mobility for Location Prediction over Sparse Trajectories

    Authors: Bangchao Deng, Bingqing Qu, Pengyang Wang, Dingqi Yang, Benjamin Fankhauser, Philippe Cudre-Mauroux

    Abstract: Location prediction forecasts a user's location based on historical user mobility traces. To tackle the intrinsic sparsity issue of real-world user mobility traces, spatiotemporal contexts have been shown as significantly useful. Existing solutions mostly incorporate spatiotemporal distances between locations in mobility traces, either by feeding them as additional inputs to Recurrent Neural Netwo… ▽ More

    Submitted 6 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  9. arXiv:2401.13639  [pdf, other

    cs.GR

    Winding Clearness for Differentiable Point Cloud Optimization

    Authors: Dong Xiao, Yueji Ma, Zuoqiang Shi, Shiqing Xin, Wenping Wang, Bailin Deng, Bin Wang

    Abstract: We propose to explore the properties of raw point clouds through the \emph{winding clearness}, a concept we first introduce for assessing the clarity of the interior/exterior relationships represented by the winding number field of the point cloud. In geometric modeling, the winding number is a powerful tool for distinguishing the interior and exterior of a given surface $\partial Ω$, and it has b… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  10. arXiv:2401.12235  [pdf

    cs.LG eess.SY

    Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning

    Authors: Bairong Deng, Tao Yu, Zhenning Pan, Xuehan Zhang, Yufeng Wu, Qiaoyi Ding

    Abstract: Reinforcement learning is an emerging approaches to facilitate multi-stage sequential decision-making problems. This paper studies a real-time multi-stage stochastic power dispatch considering multivariate uncertainties. Current researches suffer from low generalization and practicality, that is, the learned dispatch policy can only handle a specific dispatch scenario, its performance degrades sig… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  11. arXiv:2312.16060  [pdf, ps, other

    cs.LG cs.NE math.DS

    Error-free Training for Artificial Neural Network

    Authors: Bo Deng

    Abstract: Conventional training methods for artificial neural network (ANN) models never achieve zero error rate systematically for large data. A new training method consists of three steps: first create an auxiliary data from conventionally trained parameters which correspond exactly to a global minimum for the loss function of the cloned data; second create a one-parameter homotopy (hybrid) of the auxilia… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 10 pages, 3 figures, Matlab mfiles available for online download

  12. arXiv:2312.03996  [pdf

    cs.CV

    Stable Diffusion for Data Augmentation in COCO and Weed Datasets

    Authors: Boyang Deng

    Abstract: Generative models have increasingly impacted relative tasks, from computer vision to interior design and other fields. Stable diffusion is an outstanding diffusion model that paves the way for producing high-resolution images with thorough details from text prompts or reference images. It will be an interesting topic about gaining improvements for small datasets with image-sparse categories. This… ▽ More

    Submitted 16 January, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  13. arXiv:2311.11120  [pdf

    cs.AI

    An Improved Neural Network Model Based On CNN Using For Fruit Sugar Degree Detection

    Authors: Boyang Deng, Xin Wen, Zhan Gao

    Abstract: Artificial Intelligence(AI) widely applies in Image Classification and Recognition, Text Understanding and Natural Language Processing, which makes great progress. In this paper, we introduced AI into the fruit quality detection field. We designed a fruit sugar degree regression model using an Artificial Neural Network based on spectra of fruits within the visible/near-infrared(V/NIR)range. After… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  14. arXiv:2311.10990  [pdf, other

    cs.CY cs.CR econ.GN q-fin.TR

    "Centralized or Decentralized?": Concerns and Value Judgments of Stakeholders in the Non-Fungible Tokens (NFTs) Market

    Authors: Yunpeng Xiao, Bufan Deng, Siqi Chen, Kyrie Zhixuan Zhou, Ray LC, Luyao Zhang, Xin Tong

    Abstract: Non-fungible tokens (NFTs) are decentralized digital tokens to represent the unique ownership of items. Recently, NFTs have been gaining popularity and at the same time bringing up issues, such as scams, racism, and sexism. Decentralization, a key attribute of NFT, contributes to some of the issues that are easier to regulate under centralized schemes, which are intentionally left out of the NFT m… ▽ More

    Submitted 21 November, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted by CSCW 2024

    ACM Class: J.4; K.4.1

  15. arXiv:2310.12505  [pdf, other

    cs.CL cs.CR cs.LG

    Attack Prompt Generation for Red Teaming and Defending Large Language Models

    Authors: Boyi Deng, Wenjie Wang, Fuli Feng, Yang Deng, Qifan Wang, Xiangnan He

    Abstract: Large language models (LLMs) are susceptible to red teaming attacks, which can induce LLMs to generate harmful content. Previous research constructs attack prompts via manual or automatic methods, which have their own limitations on construction cost and quality. To address these issues, we propose an integrated approach that combines manual and automatic methods to economically generate high-qual… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Findings)

  16. arXiv:2309.13631  [pdf, other

    cs.RO

    6-DOF All-Terrain Cyclocopter

    Authors: Jingwei Li, Boyuan Deng, Xinyu Zhang, Kangyao Huang

    Abstract: This paper presents the design of a 6-DOF all-terrain micro aerial vehicle and two control strategies for multimodal flight, which are experimentally validated. The micro aerial vehicle is propelled by four motors and controlled by a single servo for the control of the cycloidal rotors(cyclorotors) speed and lift direction. Despite the addition of the servo, the system remains underactuated. To ad… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  17. arXiv:2308.14920  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Matbench Discovery -- A framework to evaluate machine learning crystal stability predictions

    Authors: Janosh Riebesell, Rhys E. A. Goodall, Philipp Benner, Yuan Chiang, Bowen Deng, Alpha A. Lee, Anubhav Jain, Kristin A. Persson

    Abstract: Matbench Discovery simulates the deployment of machine learning (ML) energy models in a high-throughput search for stable inorganic crystals. We address the disconnect between (i) thermodynamic stability and formation energy and (ii) in-domain vs out-of-distribution performance. Alongside this paper, we publish a Python package to aid with future model submissions and a growing online leaderboard… ▽ More

    Submitted 4 February, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 31 pages, 18 figures, 4 tables

  18. arXiv:2308.10003  [pdf, other

    cs.CV cs.GR

    Efficient Multi-View Inverse Rendering Using a Hybrid Differentiable Rendering Method

    Authors: Xiangyang Zhu, Yiling Pan, Bailin Deng, Bin Wang

    Abstract: Recovering the shape and appearance of real-world objects from natural 2D images is a long-standing and challenging inverse rendering problem. In this paper, we introduce a novel hybrid differentiable rendering method to efficiently reconstruct the 3D geometry and reflectance of a scene from multi-view images captured by conventional hand-held cameras. Our method follows an analysis-by-synthesis a… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: IJCAI2023

  19. arXiv:2308.00640  [pdf, other

    cs.RO

    VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes

    Authors: Yuhao Lu, Yixuan Fan, Beixing Deng, Fangfu Liu, Yali Li, Shengjin Wang

    Abstract: Robotic grasping faces new challenges in human-robot-interaction scenarios. We consider the task that the robot grasps a target object designated by human's language directives. The robot not only needs to locate a target based on vision-and-language information, but also needs to predict the reasonable grasp pose candidate at various views and postures. In this work, we propose a novel interactiv… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures, IROS 2023

  20. arXiv:2307.14377  [pdf, other

    cs.CL cs.AI

    How Can Large Language Models Help Humans in Design and Manufacturing?

    Authors: Liane Makatura, Michael Foshey, Bohan Wang, Felix HähnLein, Pingchuan Ma, Bolei Deng, Megan Tjandrasuwita, Andrew Spielberg, Crystal Elaine Owens, Peter Yichen Chen, Allan Zhao, Amy Zhu, Wil J Norton, Edward Gu, Joshua Jacob, Yifei Li, Adriana Schulz, Wojciech Matusik

    Abstract: The advancement of Large Language Models (LLMs), including GPT-4, provides exciting new opportunities for generative design. We investigate the application of this tool across the entire design and manufacturing workflow. Specifically, we scrutinize the utility of LLMs in tasks such as: converting a text-based prompt into a design specification, transforming a design into manufacturing instruction… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  21. arXiv:2306.10548  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    MARBLE: Music Audio Representation Benchmark for Universal Evaluation

    Authors: Ruibin Yuan, Yinghao Ma, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Le Zhuo, Yiqi Liu, Jiawen Huang, Zeyue Tian, Binyue Deng, Ningzhi Wang, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Roger Dannenberg, Wenhu Chen, Gus Xia, Wei Xue, Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu

    Abstract: In the era of extensive intersection between art and Artificial Intelligence (AI), such as image generation and fiction co-creation, AI for music remains relatively nascent, particularly in music understanding. This is evident in the limited work on deep music representations, the scarcity of large-scale datasets, and the absence of a universal and community-driven benchmark. To address this issue… ▽ More

    Submitted 23 November, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: camera-ready version for NeurIPS 2023

  22. arXiv:2306.04001  [pdf, other

    cs.LG cs.AI eess.SP

    One-Dimensional Deep Image Prior for Curve Fitting of S-Parameters from Electromagnetic Solvers

    Authors: Sriram Ravula, Varun Gorti, Bo Deng, Swagato Chakraborty, James Pingenot, Bhyrav Mutnury, Doug Wallace, Doug Winterberg, Adam Klivans, Alexandros G. Dimakis

    Abstract: A key problem when modeling signal integrity for passive filters and interconnects in IC packages is the need for multiple S-parameter measurements within a desired frequency band to obtain adequate resolution. These samples are often computationally expensive to obtain using electromagnetic (EM) field solvers. Therefore, a common approach is to select a small subset of the necessary samples and u… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  23. arXiv:2305.19545  [pdf

    physics.chem-ph cs.CE cs.LG

    Catalysis distillation neural network for the few shot open catalyst challenge

    Authors: Bowen Deng

    Abstract: The integration of artificial intelligence and science has resulted in substantial progress in computational chemistry methods for the design and discovery of novel catalysts. Nonetheless, the challenges of electrocatalytic reactions and developing a large-scale language model in catalysis persist, and the recent success of ChatGPT's (Chat Generative Pre-trained Transformer) few-shot methods surpa… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 24 pages, 4 figures

  24. arXiv:2305.11092  [pdf, other

    cs.LG cs.CV

    Universal Domain Adaptation from Foundation Models: A Baseline Study

    Authors: Bin Deng, Kui Jia

    Abstract: Foundation models (e.g., CLIP or DINOv2) have shown their impressive learning and transfer capabilities in a wide range of visual tasks, by training on a large corpus of data and adapting to specific downstream tasks. It is, however, interesting that foundation models have not been fully explored for universal domain adaptation (UniDA), which is to learn models using labeled data in a source domai… ▽ More

    Submitted 2 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 27 pages

  25. arXiv:2304.14369  [pdf, other

    cs.LG cs.GR

    Learning Neural Constitutive Laws From Motion Observations for Generalizable PDE Dynamics

    Authors: Pingchuan Ma, Peter Yichen Chen, Bolei Deng, Joshua B. Tenenbaum, Tao Du, Chuang Gan, Wojciech Matusik

    Abstract: We propose a hybrid neural network (NN) and PDE approach for learning generalizable PDE dynamics from motion observations. Many NN approaches learn an end-to-end model that implicitly models both the governing PDE and constitutive models (or material models). Without explicit PDE knowledge, these approaches cannot guarantee physical correctness and have limited generalizability. We argue that the… ▽ More

    Submitted 15 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Homepage: https://sites.google.com/view/nclaw

  26. arXiv:2304.13153  [pdf, other

    cs.CV cs.GR cs.LG

    LumiGAN: Unconditional Generation of Relightable 3D Human Faces

    Authors: Boyang Deng, Yifan Wang, Gordon Wetzstein

    Abstract: Unsupervised learning of 3D human faces from unstructured 2D image data is an active research area. While recent works have achieved an impressive level of photorealism, they commonly lack control of lighting, which prevents the generated assets from being deployed in novel environments. To this end, we introduce LumiGAN, an unconditional Generative Adversarial Network (GAN) for 3D human faces wit… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Project page: https://boyangdeng.com/projects/lumigan

  27. arXiv:2304.05176  [pdf

    cs.LG cs.AI

    Decoupling anomaly discrimination and representation learning: self-supervised learning for anomaly detection on attributed graph

    Authors: YanMing Hu, Chuan Chen, BoWen Deng, YuJing Lai, Hao Lin, ZiBin Zheng, Jing Bian

    Abstract: Anomaly detection on attributed graphs is a crucial topic for its practical application. Existing methods suffer from semantic mixture and imbalance issue because they mainly focus on anomaly discrimination, ignoring representation learning. It conflicts with the assortativity assumption that anomalous nodes commonly connect with normal nodes directly. Additionally, there are far fewer anomalous n… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  28. arXiv:2304.02163  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    GINA-3D: Learning to Generate Implicit Neural Assets in the Wild

    Authors: Bokui Shen, Xinchen Yan, Charles R. Qi, Mahyar Najibi, Boyang Deng, Leonidas Guibas, Yin Zhou, Dragomir Anguelov

    Abstract: Modeling the 3D world from sensor data for simulation is a scalable way of developing testing and validation environments for robotic learning problems such as autonomous driving. However, manually creating or re-creating real-world-like environments is difficult, expensive, and not scalable. Recent generative model techniques have shown promising progress to address such challenges by learning 3D… ▽ More

    Submitted 28 August, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023; Our WOD-ObjectAsset can be accessed through waymo.com/open

  29. arXiv:2302.14231  [pdf, other

    cond-mat.mtrl-sci cs.LG

    CHGNet: Pretrained universal neural network potential for charge-informed atomistic modeling

    Authors: Bowen Deng, Peichen Zhong, KyuJung Jun, Janosh Riebesell, Kevin Han, Christopher J. Bartel, Gerbrand Ceder

    Abstract: The simulation of large-scale systems with complex electron interactions remains one of the greatest challenges for the atomistic modeling of materials. Although classical force fields often fail to describe the coupling between electronic states and ionic rearrangements, the more accurate \textit{ab-initio} molecular dynamics suffers from computational complexity that prevents long-time and large… ▽ More

    Submitted 20 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  30. arXiv:2302.06070  [pdf, other

    cs.RO eess.SY

    Time-attenuating Twin Delayed DDPG Reinforcement Learning for Trajectory Tracking Control of Quadrotors

    Authors: Boyuan Deng, Jian Sun, Zhuo Li, Gang Wang

    Abstract: Continuous trajectory tracking control of quadrotors is complicated when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a Time-attenuating Twin Delayed DDPG, a model-free algorithm t… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  31. arXiv:2302.01078  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Computational Discovery of Microstructured Composites with Optimal Stiffness-Toughness Trade-Offs

    Authors: Beichen Li, Bolei Deng, Wan Shou, Tae-Hyun Oh, Yuanming Hu, Yiyue Luo, Liang Shi, Wojciech Matusik

    Abstract: The conflict between stiffness and toughness is a fundamental problem in engineering materials design. However, the systematic discovery of microstructured composites with optimal stiffness-toughness trade-offs has never been demonstrated, hindered by the discrepancies between simulation and reality and the lack of data-efficient exploration of the entire Pareto front. We introduce a generalizable… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 January, 2023; originally announced February 2023.

  32. arXiv:2301.12643  [pdf, other

    cs.CV

    Adversarial Style Augmentation for Domain Generalization

    Authors: Yabin Zhang, Bin Deng, Ruihuang Li, Kui Jia, Lei Zhang

    Abstract: It is well-known that the performance of well-trained deep neural networks may degrade significantly when they are applied to data with even slightly shifted distributions. Recent studies have shown that introducing certain perturbation on feature statistics (\eg, mean and standard deviation) during training can enhance the cross-domain generalization ability. Existing methods typically conduct su… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

    Comments: Initially finished in March 2022; Code will be available at \url{https://github.com/YBZh/AdvStyle}

  33. arXiv:2212.05253  [pdf, other

    cs.CR

    Graph Analysis in Decentralized Online Social Networks with Fine-Grained Privacy Protection

    Authors: Lele Zheng, Bowen Deng, Tao Zhang, Yulong Shen, Yang Cao

    Abstract: Graph analysts cannot directly obtain the global structure in decentralized social networks, and analyzing such a network requires collecting local views of the social graph from individual users. Since the edges between users may reveal sensitive social interactions in the local view, applying differential privacy in the data collection process is often desirable, which provides strong and rigoro… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

  34. Point normal orientation and surface reconstruction by incorporating isovalue constraints to Poisson equation

    Authors: Dong Xiao, Zuoqiang Shi, Siyu Li, Bailin Deng, Bin Wang

    Abstract: Oriented normals are common pre-requisites for many geometric algorithms based on point clouds, such as Poisson surface reconstruction. However, it is not trivial to obtain a consistent orientation. In this work, we bridge orientation and reconstruction in the implicit space and propose a novel approach to orient point cloud normals by incorporating isovalue constraints to the Poisson equation. In… ▽ More

    Submitted 30 April, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted by Computer Aided Geometric Design from GMP 2023

  35. Counterfactual Supervision-based Information Bottleneck for Out-of-Distribution Generalization

    Authors: Bin Deng, Kui Jia

    Abstract: Learning invariant (causal) features for out-of-distribution (OOD) generalization has attracted extensive attention recently, and among the proposals invariant risk minimization (IRM) is a notable solution. In spite of its theoretical promise for linear regression, the challenges of using IRM in linear classification problems remain. By introducing the information bottleneck (IB) principle into th… ▽ More

    Submitted 16 January, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: Theoretical Understanding of OOD Generalization

  36. arXiv:2207.09332  [pdf, other

    cs.CV

    Rethinking IoU-based Optimization for Single-stage 3D Object Detection

    Authors: Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao, Gim Hee Lee

    Abstract: Since Intersection-over-Union (IoU) based optimization maintains the consistency of the final IoU prediction metric and losses, it has been widely used in both regression and classification branches of single-stage 2D object detectors. Recently, several 3D object detection methods adopt IoU-based optimization and directly replace the 2D IoU with 3D IoU. However, such a direct computation in 3D is… ▽ More

    Submitted 20 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV2022. The code is available at https://github.com/hlsheng1/RDIoU

  37. arXiv:2206.11141  [pdf, other

    cs.RO cs.AI cs.CV

    Hybrid Physical Metric For 6-DoF Grasp Pose Detection

    Authors: Yuhao Lu, Beixing Deng, Zhenyu Wang, Peiyuan Zhi, Yali Li, Shengjin Wang

    Abstract: 6-DoF grasp pose detection of multi-grasp and multi-object is a challenge task in the field of intelligent robot. To imitate human reasoning ability for grasping objects, data driven methods are widely studied. With the introduction of large-scale datasets, we discover that a single physical metric usually generates several discrete levels of grasp confidence scores, which cannot finely distinguis… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 7 pages, 7 figures, accepted by ICRA 2022

  38. arXiv:2206.05730  [pdf, other

    cs.CV

    Object Occlusion of Adding New Categories in Objection Detection

    Authors: Boyang Deng, Meiyan Lin, Shoulun Long

    Abstract: Building instance detection models that are data efficient and can handle rare object categories is an important challenge in computer vision. But data collection methods and metrics are lack of research towards real scenarios application using neural network. Here, we perform a systematic study of the Object Occlusion data collection and augmentation methods where we imitate object occlusion rela… ▽ More

    Submitted 14 June, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

  39. arXiv:2206.03410  [pdf, other

    cs.CV cs.GR

    Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization

    Authors: Yuxin Yao, Bailin Deng, Weiwei Xu, Juyong Zhang

    Abstract: Non-rigid 3D registration, which deforms a source 3D shape in a non-rigid way to align with a target 3D shape, is a classical problem in computer vision. Such problems can be challenging because of imperfect data (noise, outliers and partial overlap) and high degrees of freedom. Existing methods typically adopt the $\ell_p$ type robust norm to measure the alignment error and regularize the smoothn… ▽ More

    Submitted 19 February, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  40. arXiv:2206.02997  [pdf, ps, other

    cs.CV

    TadML: A fast temporal action detection with Mechanics-MLP

    Authors: Bowen Deng, Dongchang Liu

    Abstract: Temporal Action Detection(TAD) is a crucial but challenging task in video understanding.It is aimed at detecting both the type and start-end frame for each action instance in a long, untrimmed video.Most current models adopt both RGB and Optical-Flow streams for the TAD task. Thus, original RGB frames must be converted manually into Optical-Flow frames with additional computation and time cost, wh… ▽ More

    Submitted 2 February, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 8 pages,3 figures

  41. arXiv:2203.15958  [pdf, other

    cs.CV cs.AI

    High-resolution Face Swapping via Latent Semantics Disentanglement

    Authors: Yangyang Xu, Bailin Deng, Junle Wang, Yanqing Jing, Jia Pan, Shengfeng He

    Abstract: We present a novel high-resolution face swapping method using the inherent prior knowledge of a pre-trained GAN model. Although previous research can leverage generative priors to produce high-resolution results, their quality can suffer from the entangled semantics of the latent space. We explicitly disentangle the latent semantics by utilizing the progressive nature of the generator, deriving st… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Paper is Acctpted by CVPR2022

  42. arXiv:2203.07858  [pdf, other

    cs.CV cs.GR cs.LG

    A Survey of Non-Rigid 3D Registration

    Authors: Bailin Deng, Yuxin Yao, Roberto M. Dyke, Juyong Zhang

    Abstract: Non-rigid registration computes an alignment between a source surface with a target surface in a non-rigid manner. In the past decade, with the advances in 3D sensing technologies that can measure time-varying surfaces, non-rigid registration has been applied for the acquisition of deformable shapes and has a wide range of applications. This survey presents a comprehensive review of non-rigid regi… ▽ More

    Submitted 16 March, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: Accepted to Eurographics 2022 State-of-the-Art Reports

  43. arXiv:2203.07116  [pdf, other

    cs.CV

    Deep Transformers Thirst for Comprehensive-Frequency Data

    Authors: Rui Xia, Chao Xue, Boyu Deng, Fang Wang, Jingchao Wang

    Abstract: Current researches indicate that inductive bias (IB) can improve Vision Transformer (ViT) performance. However, they introduce a pyramid structure concurrently to counteract the incremental FLOPs and parameters caused by introducing IB. This structure destroys the unification of computer vision and natural language processing (NLP) and complicates the model. We study an NLP model called LSRA, whic… ▽ More

    Submitted 17 November, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: 7 pages, 10 figures

  44. arXiv:2201.09367  [pdf, other

    cs.GR cs.CV cs.LG

    Sketch2PQ: Freeform Planar Quadrilateral Mesh Design via a Single Sketch

    Authors: Zhi Deng, Yang Liu, Hao Pan, Wassim Jabi, Juyong Zhang, Bailin Deng

    Abstract: The freeform architectural modeling process often involves two important stages: concept design and digital modeling. In the first stage, architects usually sketch the overall 3D shape and the panel layout on a physical or digital paper briefly. In the second stage, a digital 3D model is created using the sketch as a reference. The digital model needs to incorporate geometric requirements for its… ▽ More

    Submitted 25 April, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: To appear in IEEE Transactions on Visualization and Computer Graphics

  45. arXiv:2201.09329  [pdf, other

    cs.LG cond-mat.mtrl-sci

    ULSA: Unified Language of Synthesis Actions for Representation of Synthesis Protocols

    Authors: Zheren Wang, Kevin Cruse, Yuxing Fei, Ann Chia, Yan Zeng, Haoyan Huo, Tanjin He, Bowen Deng, Olga Kononova, Gerbrand Ceder

    Abstract: Applying AI power to predict syntheses of novel materials requires high-quality, large-scale datasets. Extraction of synthesis information from scientific publications is still challenging, especially for extracting synthesis actions, because of the lack of a comprehensive labeled dataset using a solid, robust, and well-established ontology for describing synthesis procedures. In this work, we pro… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  46. arXiv:2112.07787  [pdf, other

    cs.CV cs.RO

    Revisiting 3D Object Detection From an Egocentric Perspective

    Authors: Boyang Deng, Charles R. Qi, Mahyar Najibi, Thomas Funkhouser, Yin Zhou, Dragomir Anguelov

    Abstract: 3D object detection is a key module for safety-critical robotics applications such as autonomous driving. For these applications, we care most about how the detections affect the ego-agent's behavior and safety (the egocentric perspective). Intuitively, we seek more accurate descriptions of object geometry when it's more likely to interfere with the ego-agent's motion trajectory. However, current… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: Published in NeurIPS 2021

  47. arXiv:2111.03195  [pdf, other

    cs.CV

    Addressing Multiple Salient Object Detection via Dual-Space Long-Range Dependencies

    Authors: Bowen Deng, Andrew P. French, Michael P. Pound

    Abstract: Salient object detection plays an important role in many downstream tasks. However, complex real-world scenes with varying scales and numbers of salient objects still pose a challenge. In this paper, we directly address the problem of detecting multiple salient objects across complex scenes. We propose a network architecture incorporating non-local feature information in both the spatial and chann… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 10 pages, 9 figures

  48. arXiv:2108.13821  [pdf, other

    cs.GR

    GeodesicEmbedding (GE): A High-Dimensional Embedding Approach for Fast Geodesic Distance Queries

    Authors: Qianwei Xia, Juyong Zhang, Zheng Fang, Jin Li, Mingyue Zhang, Bailin Deng, Ying He

    Abstract: In this paper, we develop a novel method for fast geodesic distance queries. The key idea is to embed the mesh into a high-dimensional space, such that the Euclidean distance in the high-dimensional space can induce the geodesic distance in the original manifold surface. However, directly solving the high-dimensional embedding problem is not feasible due to the large number of variables and the fa… ▽ More

    Submitted 1 September, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Presented at Computational Visual Media 2021; to be published in IEEE Transactions on Visualization and Computer Graphics

  49. arXiv:2108.11682  [pdf, other

    cs.CV

    A Robust Loss for Point Cloud Registration

    Authors: Zhi Deng, Yuxin Yao, Bailin Deng, Juyong Zhang

    Abstract: The performance of surface registration relies heavily on the metric used for the alignment error between the source and target shapes. Traditionally, such a metric is based on the point-to-point or point-to-plane distance from the points on the source surface to their closest points on the target surface, which is susceptible to failure due to instability of the closest-point correspondence. In t… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  50. arXiv:2108.10723  [pdf, other

    cs.CV

    Improving 3D Object Detection with Channel-wise Transformer

    Authors: Hualian Sheng, Sijia Cai, Yuan Liu, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao

    Abstract: Though 3D object detection from point clouds has achieved rapid progress in recent years, the lack of flexible and high-performance proposal refinement remains a great hurdle for existing state-of-the-art two-stage detectors. Previous works on refining 3D proposals have relied on human-designed components such as keypoints sampling, set abstraction and multi-scale feature fusion to produce powerfu… ▽ More

    Submitted 14 September, 2021; v1 submitted 22 August, 2021; originally announced August 2021.

    Comments: Accepted by ICCV2021