Skip to main content

Showing 1–50 of 63 results for author: Chung, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16212  [pdf, other

    cs.CR cs.CV cs.LG

    An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

    Authors: Sifat Muhammad Abdullah, Aravind Cheruvu, Shravya Kanchi, Taejoong Chung, Peng Gao, Murtuza Jadliwala, Bimal Viswanath

    Abstract: Deepfake or synthetic images produced using deep generative models pose serious risks to online platforms. This has triggered several research efforts to accurately detect deepfake images, achieving excellent performance on publicly available deepfake datasets. In this work, we study 8 state-of-the-art detectors and argue that they are far from being ready for deployment due to two recent developm… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted to IEEE S&P 2024; 19 pages, 10 figures

  2. arXiv:2404.14563  [pdf, other

    cs.HC

    Exploring Algorithmic Explainability: Generating Explainable AI Insights for Personalized Clinical Decision Support Focused on Cannabis Intoxication in Young Adults

    Authors: Tongze Zhang, Tammy Chung, Anind Dey, Sang Won Bae

    Abstract: This study explores the possibility of facilitating algorithmic decision-making by combining interpretable artificial intelligence (XAI) techniques with sensor data, with the aim of providing researchers and clinicians with personalized analyses of cannabis intoxication behavior. SHAP analyzes the importance and quantifies the impact of specific factors such as environmental noise or heart rate, e… ▽ More

    Submitted 29 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 2024 International Conference on Activity and Behavior Computing

  3. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  4. arXiv:2403.10774  [pdf, other

    cs.CL

    Detecting Bias in Large Language Models: Fine-tuned KcBERT

    Authors: J. K. Lee, T. M. Chung

    Abstract: The rapid advancement of large language models (LLMs) has enabled natural language processing capabilities similar to those of humans, and LLMs are being widely utilized across various societal domains such as education and healthcare. While the versatility of these models has increased, they have the potential to generate subjective and normative language, leading to discriminatory treatment or o… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 14 pages, 5 figures

  5. arXiv:2403.10764  [pdf, other

    cs.CL cs.AI

    ECRC: Emotion-Causality Recognition in Korean Conversation for GCN

    Authors: J. K. Lee, T. M. Chung

    Abstract: In this multi-task learning study on simultaneous analysis of emotions and their underlying causes in conversational contexts, deep neural network methods were employed to effectively process and train large labeled datasets. However, these approaches are typically limited to conducting context analyses across the entire corpus because they rely on one of the two methods: word- or sentence-level e… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 10 pages, 5 figures

  6. MENTOR: Multilingual tExt detectioN TOward leaRning by analogy

    Authors: Hsin-Ju Lin, Tsu-Chun Chung, Ching-Chun Hsiao, Pin-Yu Chen, Wei-Chen Chiu, Ching-Chun Huang

    Abstract: Text detection is frequently used in vision-based mobile robots when they need to interpret texts in their surroundings to perform a given task. For instance, delivery robots in multilingual cities need to be capable of doing multilingual text detection so that the robots can read traffic signs and road markings. Moreover, the target languages change from region to region, implying the need of eff… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures, published to IROS 2023

    Journal ref: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 2023, pp. 3248-3255

  7. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Machel Reid, Nikolay Savinov, Denis Teplyashin, Dmitry, Lepikhin, Timothy Lillicrap, Jean-baptiste Alayrac, Radu Soricut, Angeliki Lazaridou, Orhan Firat, Julian Schrittwieser, Ioannis Antonoglou, Rohan Anil, Sebastian Borgeaud, Andrew Dai, Katie Millican, Ethan Dyer, Mia Glaese, Thibault Sottiaux, Benjamin Lee, Fabio Viola, Malcolm Reynolds, Yuanzhong Xu, James Molloy , et al. (683 additional authors not shown)

    Abstract: In this report, we present the latest model of the Gemini family, Gemini 1.5 Pro, a highly compute-efficient multimodal mixture-of-experts model capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. Gemini 1.5 Pro achieves near-perfect recall on long-context retrieval tasks across modalit… ▽ More

    Submitted 25 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  8. arXiv:2402.18372  [pdf, other

    cs.LG cs.AI cs.DC

    FedUV: Uniformity and Variance for Heterogeneous Federated Learning

    Authors: Ha Min Son, Moon-Hyun Kim, Tai-Myoung Chung, Chao Huang, Xin Liu

    Abstract: Federated learning is a promising framework to train neural networks with widely distributed data. However, performance degrades heavily with heterogeneously distributed data. Recent work has shown this is due to the final layer of the network being most prone to local bias, some finding success freezing the final layer as an orthogonal classifier. We investigate the training dynamics of the class… ▽ More

    Submitted 1 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 11 pages, 4 figures, 5 tables, to appear at CVPR 2024

  9. Security and Privacy Issues and Solutions in Federated Learning for Digital Healthcare

    Authors: Hyejun Jeong, Tai-Myoung Chung

    Abstract: The advent of Federated Learning has enabled the creation of a high-performing model as if it had been trained on a considerable amount of data. A multitude of participants and a server cooperatively train a model without the need for data disclosure or collection. The healthcare industry, where security and privacy are paramount, can substantially benefit from this new learning paradigm, as data… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Journal ref: International Conference on Future Data and Security Engineering (2022) 316-331

  10. arXiv:2401.07326  [pdf, other

    eess.IV cs.CV

    Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound Diagnosis

    Authors: Dat T. Chung, Minh-Anh Dang, Mai-Anh Vu, Minh T. Nguyen, Thanh-Huy Nguyen, Vinh Q. Dinh

    Abstract: Breast Ultrasound plays a vital role in cancer diagnosis as a non-invasive approach with cost-effective. In recent years, with the development of deep learning, many CNN-based approaches have been widely researched in both tumor localization and cancer classification tasks. Even though previous single models achieved great performance in both tasks, these methods have some limitations in inference… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 7 pages, 3 figures

  11. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1320 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 2 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  12. arXiv:2311.05600  [pdf, other

    cs.RO eess.SY

    FogROS2-Sky: Optimizing Latency and Cost for Multi-Cloud Robot Applications

    Authors: Kaiyuan Chen, Kush Hari, Rohil Khare, Charlotte Le, Trinity Chung, Jaimyn Drake, Karthik Dharmarajan, Simeon Adebola, Jeffrey Ichnowski, John Kubiatowicz, Ken Goldberg

    Abstract: This paper studies the cost-performance tradeoffs in cloud robotics with heterogeneous cloud service providers, which have complex pricing models and varying application requirements. We present FogROS2-Sky, a cost-efficient open source robotics platform that offloads unmodified ROS2 applications to multiple cloud providers and enables fine-grained cost analysis for ROS2 applications' communicatio… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  13. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie, Anthony Brohan, Antonin Raffin , et al. (256 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 2 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  14. arXiv:2310.08795  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Mitigating Bias for Question Answering Models by Tracking Bias Influence

    Authors: Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, Nanyun Peng

    Abstract: Models of various NLP tasks have been shown to exhibit stereotypes, and the bias in the question answering (QA) models is especially harmful as the output answers might be directly consumed by the end users. There have been datasets to evaluate bias in QA models, while bias mitigation technique for the QA models is still under-explored. In this work, we propose BMBI, an approach to mitigate the bi… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  15. arXiv:2309.13457  [pdf, other

    cs.LG cs.CV physics.comp-ph physics.flu-dyn

    Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data

    Authors: Wai Tong Chung, Bassem Akoush, Pushan Sharma, Alex Tamkin, Ki Sung Jung, Jacqueline H. Chen, Jack Guo, Davy Brouzet, Mohsen Talei, Bruno Savard, Alexei Y. Poludnenko, Matthias Ihme

    Abstract: Analysis of compressible turbulent flows is essential for applications related to propulsion, energy generation, and the environment. Here, we present BLASTNet 2.0, a 2.2 TB network-of-datasets containing 744 full-domain samples from 34 high-fidelity direct numerical simulations, which addresses the current limited availability of 3D high-fidelity reacting and non-reacting compressible turbulent f… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted in Adv. in Neural Information Processing Systems 36 (NeurIPS 2023). Link: https://nips.cc/virtual/2023/poster/73433 . 55 pages, 21 figures. Keywords: Super-resolution, 3D, Neural Scaling, Physics-informed Loss, Computational Fluid Dynamics, Partial Differential Equations, Turbulent Reacting Flows, Direct Numerical Simulation, Fluid Mechanics, Combustion, Computer Vision

  16. arXiv:2306.11825  [pdf, other

    cs.CL

    On Compositionality and Improved Training of NADO

    Authors: Sidi Lu, Wenbo Zhao, Chenyang Tao, Arpit Gupta, Shanchan Wu, Tagyoung Chung, Nanyun Peng

    Abstract: NeurAlly-Decomposed Oracle (NADO) is a powerful approach for controllable generation with large language models. Differentiating from finetuning/prompt tuning, it has the potential to avoid catastrophic forgetting of the large base model and achieve guaranteed convergence to an entropy-maximized closed-form solution without significantly limiting the model capacity. Despite its success, several ch… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  17. arXiv:2306.06893  [pdf, other

    cs.CV cs.AI

    In-context Cross-Density Adaptation on Noisy Mammogram Abnormalities Detection

    Authors: Huy T. Nguyen, Thinh B. Lam, Quan D. D. Tran, Minh T. Nguyen, Dat T. Chung, Vinh Q. Dinh

    Abstract: This paper investigates the impact of breast density distribution on the generalization performance of deep-learning models on mammography images using the VinDr-Mammo dataset. We explore the use of domain adaptation techniques, specifically Domain Adaptive Object Detection (DAOD) with the Noise Latent Transferability Exploration (NLTE) framework, to improve model performance across breast densiti… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  18. arXiv:2305.19228  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Unsupervised Melody-to-Lyric Generation

    Authors: Yufei Tian, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Gunnar Sigurdsson, Chenyang Tao, Wenbo Zhao, Yiwen Chen, Tagyoung Chung, Jing Huang, Nanyun Peng

    Abstract: Automatic melody-to-lyric generation is a task in which song lyrics are generated to go with a given melody. It is of significant practical interest and more challenging than unconstrained lyric generation as the music imposes additional constraints onto the lyrics. The training data is limited as most songs are copyrighted, resulting in models that underfit the complicated cross-modal relationshi… ▽ More

    Submitted 22 December, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: ACL 2023. arXiv admin note: substantial text overlap with arXiv:2305.07760

  19. arXiv:2305.07760  [pdf, other

    cs.AI cs.CL cs.MM

    Unsupervised Melody-Guided Lyrics Generation

    Authors: Yufei Tian, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Gunnar Sigurdsson, Chenyang Tao, Wenbo Zhao, Tagyoung Chung, Jing Huang, Nanyun Peng

    Abstract: Automatic song writing is a topic of significant practical interest. However, its research is largely hindered by the lack of training data due to copyright concerns and challenged by its creative nature. Most noticeably, prior works often fall short of modeling the cross-modal correlation between melody and lyrics due to limited parallel data, hence generating lyrics that are less singable. Exist… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Presented at AAAI23 CreativeAI workshop (Non-Archival). A later version is accepted to ACL23

  20. arXiv:2301.10915  [pdf, other

    cs.CL cs.AI

    Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning

    Authors: Mingyu Derek Ma, Jiun-Yu Kao, Shuyang Gao, Arpit Gupta, Di Jin, Tagyoung Chung, Nanyun Peng

    Abstract: Dialogue state tracking (DST) is an important step in dialogue management to keep track of users' beliefs. Existing works fine-tune all language model (LM) parameters to tackle the DST task, which requires significant data and computing resources for training and hosting. The cost grows exponentially in the real-world deployment where dozens of fine-tuned LM are used for different domains and task… ▽ More

    Submitted 29 May, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: In the INTERSPEECH 2023, and the Second Workshop on Efficient Natural Language and Speech Processing (ENLSP) at NeurIPS 2022

  21. arXiv:2212.01976  [pdf, other

    cs.CR cs.AI

    FedCC: Robust Federated Learning against Model Poisoning Attacks

    Authors: Hyejun Jeong, Hamin Son, Seohu Lee, Jayun Hyun, Tai-Myoung Chung

    Abstract: Federated Learning has emerged to cope with raising concerns about privacy breaches in using Machine or Deep Learning models. This new paradigm allows the leverage of deep learning models in a distributed manner, enhancing privacy preservation. However, the server's blindness to local datasets introduces its vulnerability to model poisoning attacks and data heterogeneity, tampering with the global… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  22. arXiv:2211.08507  [pdf, other

    cs.LG

    Decision-Aware Learning for Optimizing Health Supply Chains

    Authors: Tsai-Hsuan Chung, Vahid Rostami, Hamsa Bastani, Osbert Bastani

    Abstract: We study the problem of allocating limited supply of medical resources in developing countries, in particular, Sierra Leone. We address this problem by combining machine learning (to predict demand) with optimization (to optimize allocations). A key challenge is the need to align the loss function used to train the machine learning model with the decision loss associated with the downstream optimi… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 8 pages

  23. arXiv:2210.13522  [pdf, other

    cs.CL

    Context-Situated Pun Generation

    Authors: Jiao Sun, Anjali Narayan-Chen, Shereen Oraby, Shuyang Gao, Tagyoung Chung, Jing Huang, Yang Liu, Nanyun Peng

    Abstract: Previous work on pun generation commonly begins with a given pun word (a pair of homophones for heterographic pun generation and a polyseme for homographic pun generation) and seeks to generate an appropriate pun. While this may enable efficient pun generation, we believe that a pun is most entertaining if it fits appropriately within a given context, e.g., a given situation or dialogue. In this w… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 main conference

  24. arXiv:2210.13513  [pdf, other

    cs.CL

    ExPUNations: Augmenting Puns with Keywords and Explanations

    Authors: Jiao Sun, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Tagyoung Chung, Jing Huang, Yang Liu, Nanyun Peng

    Abstract: The tasks of humor understanding and generation are challenging and subjective even for humans, requiring commonsense and real-world knowledge to master. Puns, in particular, add the challenge of fusing that knowledge with the ability to interpret lexical-semantic ambiguity. In this paper, we present the ExPUNations (ExPUN) dataset, in which we augment an existing dataset of puns with detailed cro… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 main conference

  25. arXiv:2208.01916  [pdf, other

    cs.CV

    N-RPN: Hard Example Learning for Region Proposal Networks

    Authors: MyeongAh Cho, Tae-young Chung, Hyeongmin Lee, Sangyoun Lee

    Abstract: The region proposal task is to generate a set of candidate regions that contain an object. In this task, it is most important to propose as many candidates of ground-truth as possible in a fixed number of proposals. In a typical image, however, there are too few hard negative examples compared to the vast number of easy negatives, so region proposal networks struggle to train on hard negatives. Be… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  26. arXiv:2207.12546  [pdf, other

    cs.LG physics.flu-dyn

    The Bearable Lightness of Big Data: Towards Massive Public Datasets in Scientific Machine Learning

    Authors: Wai Tong Chung, Ki Sung Jung, Jacqueline H. Chen, Matthias Ihme

    Abstract: In general, large datasets enable deep learning models to perform with good accuracy and generalizability. However, massive high-fidelity simulation datasets (from molecular chemistry, astrophysics, computational fluid dynamics (CFD), etc. can be challenging to curate due to dimensionality and storage constraints. Lossy compression algorithms can help mitigate limitations from storage, as long as… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in ICML 2022 2nd AI for Science Workshop. 10 pages, 8 figures

    Journal ref: ICML 2022 2nd AI for Science Workshop

  27. arXiv:2205.07319  [pdf

    cs.SD cs.AI cs.LG eess.AS

    cMelGAN: An Efficient Conditional Generative Model Based on Mel Spectrograms

    Authors: Tracy Qian, Jackson Kaunismaa, Tony Chung

    Abstract: Analysing music in the field of machine learning is a very difficult problem with numerous constraints to consider. The nature of audio data, with its very high dimensionality and widely varying scales of structure, is one of the primary reasons why it is so difficult to model. There are many applications of machine learning in music, like the classifying the mood of a piece of music, conditional… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  28. arXiv:2112.00407  [pdf, other

    cs.LG cs.AI cs.DC

    Compare Where It Matters: Using Layer-Wise Regularization To Improve Federated Learning on Heterogeneous Data

    Authors: Ha Min Son, Moon Hyun Kim, Tai-Myoung Chung

    Abstract: Federated Learning is a widely adopted method to train neural networks over distributed data. One main limitation is the performance degradation that occurs when data is heterogeneously distributed. While many works have attempted to address this problem, these methods under-perform because they are founded on a limited understanding of neural networks. In this work, we verify that only certain im… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 8 pages, 5 figures, 4 tables

  29. arXiv:2111.11576  [pdf, other

    cs.LG cs.CL cs.CV

    Building Goal-Oriented Dialogue Systems with Situated Visual Context

    Authors: Sanchit Agarwal, Jan Jezabek, Arijit Biswas, Emre Barut, Shuyang Gao, Tagyoung Chung

    Abstract: Most popular goal-oriented dialogue agents are capable of understanding the conversational context. However, with the surge of virtual assistants with screen, the next generation of agents are required to also understand screen context in order to provide a proper interactive experience, and better understand users' goals. In this paper, we propose a novel multimodal conversational framework, wher… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  30. arXiv:2109.12211  [pdf, other

    cs.CL

    Style Control for Schema-Guided Natural Language Generation

    Authors: Alicia Y. Tsai, Shereen Oraby, Vittorio Perera, Jiun-Yu Kao, Yuheng Du, Anjali Narayan-Chen, Tagyoung Chung, Dilek Hakkani-Tur

    Abstract: Natural Language Generation (NLG) for task-oriented dialogue systems focuses on communicating specific content accurately, fluently, and coherently. While these attributes are crucial for a successful dialogue, it is also desirable to simultaneously accomplish specific stylistic goals, such as response length, point-of-view, descriptiveness, sentiment, formality, and empathy. In this work, we focu… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted at the 3rd Workshop on NLP for ConvAI at EMNLP '21

  31. arXiv:2109.00202  [pdf, other

    cs.LG cs.AI

    Federated Learning: Issues in Medical Application

    Authors: Joo Hun Yoo, Hyejun Jeong, Jaehyeok Lee, Tai-Myoung Chung

    Abstract: Since the federated learning, which makes AI learning possible without moving local data around, was introduced by google in 2017 it has been actively studied particularly in the field of medicine. In fact, the idea of machine learning in AI without collecting data from local clients is very attractive because data remain in local sites. However, federated learning techniques still have various op… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 20 pages, 3 figures, 1 table, submitted to FDSE2021

  32. arXiv:2108.04551  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    ABC-FL: Anomalous and Benign client Classification in Federated Learning

    Authors: Hyejun Jeong, Joonyong Hwang, Tai Myung Chung

    Abstract: Federated Learning is a distributed machine learning framework designed for data privacy preservation i.e., local data remain private throughout the entire training and testing procedure. Federated Learning is gaining popularity because it allows one to use machine learning techniques while preserving privacy. However, it inherits the vulnerabilities and susceptibilities raised in deep learning te… ▽ More

    Submitted 1 December, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

  33. arXiv:2108.01903  [pdf, other

    cs.LG cs.AI

    Personalized Federated Learning with Clustering: Non-IID Heart Rate Variability Data Application

    Authors: Joo Hun Yoo, Ha Min Son, Hyejun Jeong, Eun-Hye Jang, Ah Young Kim, Han Young Yu, Hong Jin Jeon, Tai-Myoung Chung

    Abstract: While machine learning techniques are being applied to various fields for their exceptional ability to find complex relations in large datasets, the strengthening of regulations on data ownership and privacy is causing increasing difficulty in its application to medical data. In light of this, Federated Learning has recently been proposed as a solution to train on private data without breach of co… ▽ More

    Submitted 10 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 6 pages with two columns, 4 figures, 3 tables

  34. arXiv:2107.01152  [pdf, other

    stat.ML cs.AI cs.CV cs.IT cs.LG

    Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive Learners With FlatNCE

    Authors: Junya Chen, Zhe Gan, Xuan Li, Qing Guo, Liqun Chen, Shuyang Gao, Tagyoung Chung, Yi Xu, Belinda Zeng, Wenlian Lu, Fan Li, Lawrence Carin, Chenyang Tao

    Abstract: InfoNCE-based contrastive representation learners, such as SimCLR, have been tremendously successful in recent years. However, these contrastive schemes are notoriously resource demanding, as their effectiveness breaks down with small-batch training (i.e., the log-K curse, whereas K is the batch-size). In this work, we reveal mathematically why contrastive learners fail in the small-batch-size reg… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  35. arXiv:2105.00402  [pdf, other

    eess.IV cs.CV

    AG-CUResNeSt: A Novel Method for Colon Polyp Segmentation

    Authors: Dinh Viet Sang, Tran Quang Chung, Phan Ngoc Lan, Dao Viet Hang, Dao Van Long, Nguyen Thi Thuy

    Abstract: Colorectal cancer is among the most common malignancies and can develop from high-risk colon polyps. Colonoscopy is an effective screening tool to detect and remove polyps, especially in the case of precancerous lesions. However, the missing rate in clinical practice is relatively high due to many factors. The procedure could benefit greatly from using AI models for automatic polyp segmentation, w… ▽ More

    Submitted 1 March, 2022; v1 submitted 2 May, 2021; originally announced May 2021.

  36. arXiv:2104.09088  [pdf, other

    cs.CL cs.LG

    Alexa Conversations: An Extensible Data-driven Approach for Building Task-oriented Dialogue Systems

    Authors: Anish Acharya, Suranjit Adhikari, Sanchit Agarwal, Vincent Auvray, Nehal Belgamwar, Arijit Biswas, Shubhra Chandra, Tagyoung Chung, Maryam Fazel-Zarandi, Raefer Gabriel, Shuyang Gao, Rahul Goel, Dilek Hakkani-Tur, Jan Jezabek, Abhay Jha, Jiun-Yu Kao, Prakash Krishnan, Peter Ku, Anuj Goyal, Chien-Wei Lin, Qing Liu, Arindam Mandal, Angeliki Metallinou, Vishal Naik, Yi Pan , et al. (6 additional authors not shown)

    Abstract: Traditional goal-oriented dialogue systems rely on various components such as natural language understanding, dialogue state tracking, policy learning and response generation. Training each component requires annotations which are hard to obtain for every new domain, limiting scalability of such systems. Similarly, rule-based dialogue systems require extensive writing and maintenance of rules and… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Journal ref: NAACL 2021 System Demonstrations Track

  37. arXiv:2103.06397  [pdf, other

    physics.flu-dyn cs.LG stat.ML

    Interpretable Data-driven Methods for Subgrid-scale Closure in LES for Transcritical LOX/GCH4 Combustion

    Authors: Wai Tong Chung, Aashwin Ananda Mishra, Matthias Ihme

    Abstract: Many practical combustion systems such as those in rockets, gas turbines, and internal combustion engines operate under high pressures that surpass the thermodynamic critical limit of fuel-oxidizer mixtures. These conditions require the consideration of complex fluid behaviors that pose challenges for numerical simulations, casting doubts on the validity of existing subgrid-scale (SGS) models in l… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: 22 pages, 13 figures

    Journal ref: Combustion and Flame 2021

  38. arXiv:2101.06779  [pdf, other

    cs.CL

    Few Shot Dialogue State Tracking using Meta-learning

    Authors: Saket Dingliwal, Bill Gao, Sanchit Agarwal, Chien-Wei Lin, Tagyoung Chung, Dilek Hakkani-Tur

    Abstract: Dialogue State Tracking (DST) forms a core component of automated chatbot based systems designed for specific goals like hotel, taxi reservation, tourist information, etc. With the increasing need to deploy such systems in new domains, solving the problem of zero/few-shot DST has become necessary. There has been a rising trend for learning to transfer knowledge from resource-rich domains to unknow… ▽ More

    Submitted 5 April, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: To appear in EACL 2021

  39. arXiv:2011.08243  [pdf, other

    cs.CL cs.AI cs.LG

    Dialog Simulation with Realistic Variations for Training Goal-Oriented Conversational Systems

    Authors: Chien-Wei Lin, Vincent Auvray, Daniel Elkind, Arijit Biswas, Maryam Fazel-Zarandi, Nehal Belgamwar, Shubhra Chandra, Matt Zhao, Angeliki Metallinou, Tagyoung Chung, Charlie Shucheng Zhu, Suranjit Adhikari, Dilek Hakkani-Tur

    Abstract: Goal-oriented dialog systems enable users to complete specific goals like requesting information about a movie or booking a ticket. Typically the dialog system pipeline contains multiple ML models, including natural language understanding, state tracking and action prediction (policy learning). These models are trained through a combination of supervised or reinforcement learning methods and there… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: To be presented at Human in the Loop Dialogue Systems Workshop, NeurIPS 2020

  40. arXiv:2010.13424  [pdf, other

    cs.CV

    Multi-object tracking with self-supervised associating network

    Authors: Tae-young Chung, Heansung Lee, Myeong Ah Cho, Suhwan Cho, Sangyoun Lee

    Abstract: Multi-Object Tracking (MOT) is the task that has a lot of potential for development, and there are still many problems to be solved. In the traditional tracking by detection paradigm, There has been a lot of work on feature based object re-identification methods. However, this method has a lack of training data problem. For labeling multi-object tracking dataset, every detection in a video sequenc… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  41. arXiv:2009.04023  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph stat.AP

    Data-assisted combustion simulations with dynamic submodel assignment using random forests

    Authors: Wai Tong Chung, Aashwin Ananda Mishra, Nikolaos Perakis, Matthias Ihme

    Abstract: In this investigation, we outline a data-assisted approach that employs random forest classifiers for local and dynamic combustion submodel assignment in turbulent-combustion simulations. This method is applied in simulations of a single-element GOX/GCH4 rocket combustor; a priori as well as a posteriori assessments are conducted to (i) evaluate the accuracy and adjustability of the classifier for… ▽ More

    Submitted 17 January, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

    Comments: Accepted version; 23 pages, 12 figures

    Journal ref: Combustion and Flame 227 (2021) 172-185

  42. arXiv:2006.01791  [pdf, other

    cs.LG stat.ML

    SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better Regularization

    Authors: A. F. M. Shahab Uddin, Mst. Sirazam Monira, Wheemyung Shin, TaeChoong Chung, Sung-Ho Bae

    Abstract: Advanced data augmentation strategies have widely been studied to improve the generalization ability of deep learning models. Regional dropout is one of the popular solutions that guides the model to focus on less discriminative parts by randomly removing image regions, resulting in improved regularization. However, such information removal is undesirable. On the other hand, recent strategies sugg… ▽ More

    Submitted 27 July, 2021; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 12 pages, 5 figures, 5 tables

    MSC Class: 68T07 ACM Class: I.2; I.4

    Journal ref: International Conference On Learning Representations (ICLR) 2021

  43. arXiv:2005.05480  [pdf, other

    cs.CL

    Schema-Guided Natural Language Generation

    Authors: Yuheng Du, Shereen Oraby, Vittorio Perera, Minmin Shen, Anjali Narayan-Chen, Tagyoung Chung, Anu Venkatesh, Dilek Hakkani-Tur

    Abstract: Neural network based approaches to data-to-text natural language generation (NLG) have gained popularity in recent years, with the goal of generating a natural language prompt that accurately realizes an input meaning representation. To facilitate the training of neural network models, researchers created large datasets of paired utterances and their meaning representations. However, the creation… ▽ More

    Submitted 4 November, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: Accepted as a long paper at INLG 2020

  44. arXiv:2005.03759  [pdf, other

    cond-mat.mtrl-sci cs.LG

    DeePore: a deep learning workflow for rapid and comprehensive characterization of porous materials

    Authors: Arash Rabbani, Masoud Babaei, Reza Shams, Ying Da Wang, Traiwit Chung

    Abstract: DeePore is a deep learning workflow for rapid estimation of a wide range of porous material properties based on the binarized micro-tomography images. By combining naturally occurring porous textures we generated 17700 semi-real 3-D micro-structures of porous geo-materials with size of 256^3 voxels and 30 physical properties of each sample are calculated using physical simulations on the correspon… ▽ More

    Submitted 10 October, 2020; v1 submitted 3 May, 2020; originally announced May 2020.

    Journal ref: Advances in Water Resources, 2020, 103787

  45. arXiv:2004.11675  [pdf, other

    physics.flu-dyn cs.LG stat.ML

    ML-LBM: Machine Learning Aided Flow Simulation in Porous Media

    Authors: Ying Da Wang, Traiwit Chung, Ryan T. Armstrong, Peyman Mostaghimi

    Abstract: Simulation of fluid flow in porous media has many applications, from the micro-scale (cell membranes, filters, rocks) to macro-scale (groundwater, hydrocarbon reservoirs, and geothermal) and beyond. Direct simulation of flow in porous media requires significant computational resources to solve within reasonable timeframes. An integrated method combining predictions of fluid flow (fast, limited acc… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: 23 pages, 16 figures

  46. arXiv:2004.05827  [pdf, other

    cs.CL cs.AI cs.LG

    From Machine Reading Comprehension to Dialogue State Tracking: Bridging the Gap

    Authors: Shuyang Gao, Sanchit Agarwal, Tagyoung Chung, Di Jin, Dilek Hakkani-Tur

    Abstract: Dialogue state tracking (DST) is at the heart of task-oriented dialogue systems. However, the scarcity of labeled data is an obstacle to building accurate and robust state tracking systems that work across a variety of domains. Existing approaches generally require some dialogue data with state information and their ability to generalize to unknown domains is limited. In this paper, we propose usi… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  47. arXiv:2002.03651  [pdf, other

    cs.CV

    CRVOS: Clue Refining Network for Video Object Segmentation

    Authors: Suhwan Cho, MyeongAh Cho, Tae-young Chung, Heansung Lee, Sangyoun Lee

    Abstract: The encoder-decoder based methods for semi-supervised video object segmentation (Semi-VOS) have received extensive attention due to their superior performances. However, most of them have complex intermediate networks which generate strong specifiers to be robust against challenging scenarios, and this is quite inefficient when dealing with relatively simple scenarios. To solve this problem, we pr… ▽ More

    Submitted 2 June, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: ICIP 2020. Code: https://github.com/suhwan-cho/CRVOS

  48. arXiv:2001.02090  [pdf, other

    cs.CV

    AD-VO: Scale-Resilient Visual Odometry Using Attentive Disparity Map

    Authors: Joosung Lee, Sangwon Hwang, Kyungjae Lee, Woo Jin Kim, Junhyeop Lee, Tae-young Chung, Sangyoun Lee

    Abstract: Visual odometry is an essential key for a localization module in SLAM systems. However, previous methods require tuning the system to adapt environment changes. In this paper, we propose a learning-based approach for frame-to-frame monocular visual odometry estimation. The proposed network is only learned by disparity maps for not only covering the environment changes but also solving the scale pr… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

    Comments: 5 pages, 5 figures, 2018.02 papers

  49. arXiv:1910.00458  [pdf, other

    cs.CL cs.LG

    MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension

    Authors: Di Jin, Shuyang Gao, Jiun-Yu Kao, Tagyoung Chung, Dilek Hakkani-tur

    Abstract: Machine Reading Comprehension (MRC) for question answering (QA), which aims to answer a question given the relevant context passages, is an important way to test the ability of intelligence systems to understand human language. Multiple-Choice QA (MCQA) is one of the most difficult tasks in MRC because it often requires more advanced reading comprehension skills such as logical reasoning, summariz… ▽ More

    Submitted 18 November, 2019; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Accepted by AAAI 2020

  50. arXiv:1908.01946  [pdf, other

    cs.CL cs.LG

    Dialog State Tracking: A Neural Reading Comprehension Approach

    Authors: Shuyang Gao, Abhishek Sethi, Sanchit Agarwal, Tagyoung Chung, Dilek Hakkani-Tur

    Abstract: Dialog state tracking is used to estimate the current belief state of a dialog given all the preceding conversation. Machine reading comprehension, on the other hand, focuses on building systems that read passages of text and answer questions that require some understanding of passages. We formulate dialog state tracking as a reading comprehension task to answer the question… ▽ More

    Submitted 14 August, 2019; v1 submitted 6 August, 2019; originally announced August 2019.

    Comments: 10 pages, to appear in Special Interest Group on Discourse and Dialogue (SIGDIAL) 2019 (ORAL)