Skip to main content

Showing 1–24 of 24 results for author: Oh, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.02944  [pdf, other

    cs.CV cs.LG

    Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity

    Authors: Hagyeong Lee, Minkyu Kim, Jun-Hyuk Kim, Seungeon Kim, Dokwan Oh, Jaeho Lee

    Abstract: Recent advances in text-guided image compression have shown great potential to enhance the perceptual quality of reconstructed images. These methods, however, tend to have significantly degraded pixel-wise fidelity, limiting their practicality. To fill this gap, we develop a new text-guided image compression algorithm that achieves both high perceptual and pixel-wise fidelity. In particular, we pr… ▽ More

    Submitted 21 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: The first two authors contributed equally

  2. arXiv:2402.02350  [pdf, other

    cs.NI cs.LG

    Interference-Aware Emergent Random Access Protocol for Downlink LEO Satellite Networks

    Authors: Chang-Yong Lim, Jihong Park, Jinho Choi, Ju-Hyung Lee, Daesub Oh, Heewook Kim

    Abstract: In this article, we propose a multi-agent deep reinforcement learning (MADRL) framework to train a multiple access protocol for downlink low earth orbit (LEO) satellite networks. By improving the existing learned protocol, emergent random access channel (eRACH), our proposed method, coined centralized and compressed emergent signaling for eRACH (Ce2RACH), can mitigate inter-satellite interference… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 2 pages, 4 figures, 1 table; submitted to IEEE for possible publication

  3. arXiv:2312.12392  [pdf, other

    cs.GR

    Recursive Camera Painting: A Method for Real-Time Painterly Renderings of 3D Scenes

    Authors: Ergun Akleman, Cassie Mullins, Christopher Morrison, David Oh

    Abstract: In this work, we present the recursive camera-painting approach to obtain painterly smudging in real-time rendering applications. We have implemented recursive camera painting as both a GPU-based ray-tracing and in a Virtual Reality game environment. Using this approach, we can obtain dynamic 3D Paintings in real-time. In a camera painting, each pixel has a separate associated camera whose paramet… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 10 pages

  4. arXiv:2308.15791  [pdf, other

    cs.CV eess.IV

    Neural Video Compression with Temporal Layer-Adaptive Hierarchical B-frame Coding

    Authors: Yeongwoong Kim, Suyong Bahk, Seungeon Kim, Won Hee Lee, Dokwan Oh, Hui Yong Kim

    Abstract: Neural video compression (NVC) is a rapidly evolving video coding research area, with some models achieving superior coding efficiency compared to the latest video coding standard Versatile Video Coding (VVC). In conventional video coding standards, the hierarchical B-frame coding, which utilizes a bidirectional prediction structure for higher compression, had been well-studied and exploited. In N… ▽ More

    Submitted 5 September, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  5. arXiv:2302.14273  [pdf, other

    cs.RO eess.SY

    QP Chaser: Polynomial Trajectory Generation for Autonomous Aerial Tracking

    Authors: Yunwoo Lee, Jungwon Park, Seungwoo Jung, Boseong Jeon, Dahyun Oh, H. Jin Kim

    Abstract: Maintaining the visibility of the targets is one of the major objectives of aerial tracking applications. This paper proposes QP Chaser, a trajectory planning pipeline that can enhance the visibility of single- and dual-target in both static and dynamic environments. As the name suggests, the proposed planner generates a target-visible trajectory via quadratic programming problems. First, the pred… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 15 pages, 13 figures

  6. arXiv:2301.08078  [pdf, other

    cs.RO eess.SY

    Stable Contact Guaranteeing Motion/Force Control for an Aerial Manipulator on an Arbitrarily Tilted Surface

    Authors: Jeonghyun Byun, Byeongjun Kim, Changhyeon Kim, Donggeon David Oh, H. Jin Kim

    Abstract: This study aims to design a motion/force controller for an aerial manipulator which guarantees the tracking of time-varying motion/force trajectories as well as the stability during the transition between free and contact motions. To this end, we model the force exerted on the end-effector as the Kelvin-Voigt linear model and estimate its parameters by recursive least-squares estimator. Then, the… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: to be presented in 2023 IEEE International Conference on Robotics and Automations (ICRA), London, United Kingdom, 2023

  7. arXiv:2212.07026  [pdf, other

    cs.LG cs.CV

    Improving group robustness under noisy labels using predictive uncertainty

    Authors: Dongpin Oh, Dae Lee, Jeunghyun Byun, Bonggun Shin

    Abstract: The standard empirical risk minimization (ERM) can underperform on certain minority groups (i.e., waterbirds in lands or landbirds in water) due to the spurious correlation between the input and its label. Several studies have improved the worst-group accuracy by focusing on the high-loss samples. The hypothesis behind this is that such high-loss samples are \textit{spurious-cue-free} (SCF) sample… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  8. arXiv:2209.05972  [pdf, other

    cs.CL cs.AI

    Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Pooling

    Authors: Dongsuk Oh, Yejin Kim, Hodong Lee, H. Howie Huang, Heuiseok Lim

    Abstract: Recent pre-trained language models (PLMs) achieved great success on many natural language processing tasks through learning linguistic features and contextualized sentence representation. Since attributes captured in stacked layers of PLMs are not clearly identified, straightforward approaches such as embedding the last layer are commonly preferred to derive sentence representations from PLMs. Thi… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022

  9. arXiv:2112.09368  [pdf, other

    cs.LG stat.ML

    Improving evidential deep learning via multi-task learning

    Authors: Dongpin Oh, Bonggun Shin

    Abstract: The Evidential regression network (ENet) estimates a continuous target and its predictive uncertainty without costly Bayesian model averaging. However, it is possible that the target is inaccurately predicted due to the gradient shrinkage problem of the original loss function of the ENet, the negative log marginal likelihood (NLL) loss. In this paper, the objective is to improve the prediction acc… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI-2022

  10. arXiv:2112.08619  [pdf, other

    cs.CL cs.AI

    Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

    Authors: Yoonna Jang, Jungwoo Lim, Yuna Hur, Dongsuk Oh, Suhyune Son, Yeonsoo Lee, Donghoon Shin, Seungryong Kim, Heuiseok Lim

    Abstract: Humans usually have conversations by making use of prior knowledge about a topic and background information of the people whom they are talking to. However, existing conversational agents and datasets do not consider such comprehensive information, and thus they have a limitation in generating the utterances where the knowledge and persona are fused properly. To address this issue, we introduce a… ▽ More

    Submitted 16 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted paper at the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

  11. arXiv:2109.09041  [pdf, other

    cs.RO eess.SY

    Online Distributed Trajectory Planning for Quadrotor Swarm with Feasibility Guarantee using Linear Safe Corridor

    Authors: Jungwon Park, Dabin Kim, Gyeong Chan Kim, Dahyun Oh, H. Jin Kim

    Abstract: This paper presents a new online multi-agent trajectory planning algorithm that guarantees to generate safe, dynamically feasible trajectories in a cluttered environment. The proposed algorithm utilizes a linear safe corridor (LSC) to formulate the distributed trajectory optimization problem with only feasible constraints, so it does not resort to slack variables or soft constraints to avoid optim… ▽ More

    Submitted 3 January, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: 8 pages, RA-L 2022 under review

  12. arXiv:2107.00844  [pdf, ps, other

    cs.LG physics.data-an

    Deep learning-based statistical noise reduction for multidimensional spectral data

    Authors: Younsik Kim, Dongjin Oh, Soonsang Huh, Dongjoon Song, Sunbeom Jeong, Junyoung Kwon, Minsoo Kim, Donghan Kim, Hanyoung Ryu, Jongkeun Jung, Wonshik Kyung, Byungmin Sohn, Suyoung Lee, Jounghoon Hyun, Yeonghoon Lee, Yeongkwan Kimand Changyoung Kim

    Abstract: In spectroscopic experiments, data acquisition in multi-dimensional phase space may require long acquisition time, owing to the large phase space volume to be covered. In such case, the limited time available for data acquisition can be a serious constraint for experiments in which multidimensional spectral data are acquired. Here, taking angle-resolved photoemission spectroscopy (ARPES) as an exa… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 8 pages, 8 figures

    Journal ref: Review of Scientific Instruments 92, 073901 (2021)

  13. arXiv:2105.01869  [pdf, other

    cs.LG cs.IT

    Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression

    Authors: Baeseong Park, Se Jung Kwon, Daehwan Oh, Byeongwook Kim, Dongsoo Lee

    Abstract: Even though fine-grained pruning techniques achieve a high compression ratio, conventional sparsity representations (such as CSR) associated with irregular sparsity degrade parallelism significantly. Practical pruning methods, thus, usually lower pruning rates (by structured pruning) to improve parallelism. In this paper, we study fixed-to-fixed (lossless) encoding architecture/algorithm to suppor… ▽ More

    Submitted 30 January, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: ICLR 2022 Accepted

  14. arXiv:2105.01868  [pdf, ps, other

    cs.LG math.OC

    Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization

    Authors: Byeongwook Kim, Dongsoo Lee, Yeonju Ro, Yongkweon Jeon, Se Jung Kwon, Baeseong Park, Daehwan Oh

    Abstract: Various post-training uniform quantization methods have usually been studied based on convex optimization. As a result, most previous ones rely on the quantization error minimization and/or quadratic approximations. Such approaches are computationally efficient and reasonable when a large number of quantization bits are employed. When the number of quantization bits is relatively low, however, non… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  15. arXiv:2103.01152  [pdf, other

    cond-mat.supr-con cs.GR

    PHIDL: Python CAD layout and geometry creation for nanolithography

    Authors: A. N. McCaughan, A. M. Tait, S. M. Buckley, D. M. Oh, J. T. Chiles, J. M. Shainline, S. W. Nam

    Abstract: Computer-aided design (CAD) has become a critical element in the creation of nanopatterned structures and devices. In particular, with the increased adoption of easy-to-learn programming languages like Python there has been a significant rise in the amount of lithographic geometries generated through scripting and programming. However, there are currently unaddressed gaps in usability for open-sou… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Journal ref: J. Vac. Sci. Technol. B 39, 062601 (2021)

  16. arXiv:2011.06769  [pdf

    cs.LG nlin.CD

    Toward the Fully Physics-Informed Echo State Network -- an ODE Approximator Based on Recurrent Artificial Neurons

    Authors: Dong Keun Oh

    Abstract: Inspired by recent theoretical arguments, physics-informed echo state network (ESN) is discussed on the attempt to train a reservoir model absolutely in physics-informed manner. As the plainest work on such a purpose, an ODE (ordinary differential equation) approximator is designed to replicate the solution in sequence with respect to the recurrent evaluations. On the principal invariance of diffe… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 30 pages, 12 figures, research paper

    MSC Class: 68T27; 65L99 ACM Class: I.2.8; J.2

  17. arXiv:2011.00766  [pdf, other

    cs.CL cs.AI

    I Know What You Asked: Graph Path Learning using AMR for Commonsense Reasoning

    Authors: Jungwoo Lim, Dongsuk Oh, Yoonna Jang, Kisu Yang, Heuiseok Lim

    Abstract: CommonsenseQA is a task in which a correct answer is predicted through commonsense reasoning with pre-defined knowledge. Most previous works have aimed to improve the performance with distributed representation without considering the process of predicting the answer from the semantic representation of the question. To shed light upon the semantic interpretation of the question, we propose an AMR-… ▽ More

    Submitted 5 November, 2020; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted to COLING 2020

  18. arXiv:2009.04703  [pdf, other

    cs.CL cs.AI

    Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection

    Authors: Taesun Whang, Dongyub Lee, Dongsuk Oh, Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

    Abstract: In this paper, we study the task of selecting the optimal response given a user and system utterance history in retrieval-based multi-turn dialog systems. Recently, pre-trained language models (e.g., BERT, RoBERTa, and ELECTRA) showed significant improvements in various natural language processing tasks. This and similar response selection tasks can also be solved using such language models by for… ▽ More

    Submitted 16 December, 2020; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: Accepted to AAAI 2021

  19. arXiv:2006.16166  [pdf, other

    cs.CV

    Automatic Operating Room Surgical Activity Recognition for Robot-Assisted Surgery

    Authors: Aidean Sharghi, Helene Haugerud, Daniel Oh, Omid Mohareri

    Abstract: Automatic recognition of surgical activities in the operating room (OR) is a key technology for creating next generation intelligent surgical devices and workflow monitoring/support systems. Such systems can potentially enhance efficiency in the OR, resulting in lower costs and improved care delivery to the patients. In this paper, we investigate automatic surgical activity recognition in robot-as… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'20)

  20. arXiv:1911.04015   

    cs.CL

    Word Sense Disambiguation using Knowledge-based Word Similarity

    Authors: Sunjae Kwon, Dongsuk Oh, Youngjoong Ko

    Abstract: In natural language processing, word-sense disambiguation (WSD) is an open problem concerned with identifying the correct sense of words in a particular context. To address this problem, we introduce a novel knowledge-based WSD system. We suggest the adoption of two methods in our system. First, we suggest a novel method to encode the word vector representation by considering the graphical semanti… ▽ More

    Submitted 21 June, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Comments: Since we changed some hyper-parameters, experimental results must be changed. We will resubmit with the retest results

  21. arXiv:1908.04812  [pdf, other

    cs.CL cs.LG

    An Effective Domain Adaptive Post-Training Method for BERT in Response Selection

    Authors: Taesun Whang, Dongyub Lee, Chanhee Lee, Kisu Yang, Dongsuk Oh, HeuiSeok Lim

    Abstract: We focus on multi-turn response selection in a retrieval-based dialog system. In this paper, we utilize the powerful pre-trained language model Bi-directional Encoder Representations from Transformer (BERT) for a multi-turn dialog system and propose a highly effective post-training method on domain-specific corpus. Although BERT is easily adopted to various NLP tasks and outperforms previous basel… ▽ More

    Submitted 26 July, 2020; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: INTERSPEECH 2020

  22. arXiv:1811.02628  [pdf, other

    cs.CV cs.LG stat.ML

    Learning Bone Suppression from Dual Energy Chest X-rays using Adversarial Networks

    Authors: Dong Yul Oh, Il Dong Yun

    Abstract: Suppressing bones on chest X-rays such as ribs and clavicle is often expected to improve pathologies classification. These bones can interfere with a broad range of diagnostic tasks on pulmonary disease except for musculoskeletal system. Current conventional method for acquisition of bone suppressed X-rays is dual energy imaging, which captures two radiographs at a very short interval with differe… ▽ More

    Submitted 4 November, 2018; originally announced November 2018.

  23. arXiv:1003.4057  [pdf, ps, other

    cs.IT

    Construction of optimal codes in deletion and insertion metric

    Authors: Hyun Kwang Kim, Joon Yop Lee, Dong Yeol Oh

    Abstract: We improve Levenshtein's upper bound for the cardinality of a code of length four that is capable of correcting single deletions over an alphabet of even size. We also illustrate that the new upper bound is sharp. Furthermore we construct an optimal perfect code that is capable of correcting single deletions for the same parameters.

    Submitted 22 March, 2010; originally announced March 2010.

    Comments: 20 pages, The material of this paper was presented in part at the 10th International Workshop on Algebraic and Combinatorial Coding Theory, Zvenigorod, Russia, September 2006

  24. arXiv:0810.3729   

    cs.IT cs.DM math.CO

    Optimal codes in deletion and insertion metric

    Authors: Hyun Kwang Kim, Joon Yop Lee, Dong Yeol Oh

    Abstract: We improve the upper bound of Levenshtein for the cardinality of a code of length 4 capable of correcting single deletions over an alphabet of even size. We also illustrate that the new upper bound is sharp. Furthermore we will construct an optimal perfect code capable of correcting single deletions for the same parameters.

    Submitted 22 March, 2010; v1 submitted 20 October, 2008; originally announced October 2008.

    Comments: 19 pages,The material of this paper was presented in part at the 10th International Workshop on Algebraic and Combinatorial Coding Theory, Zvenigorod, Russia, September 2006

    MSC Class: 94B60