Skip to main content

Showing 1–50 of 540 results for author: Ji, H

.
  1. arXiv:2405.20015  [pdf, other

    cs.AI cs.CL

    Efficient LLM-Jailbreaking by Introducing Visual Modality

    Authors: Zhenxing Niu, Yuyao Sun, Haodong Ren, Haoxuan Ji, Quan Wang, Xiaoke Ma, Gang Hua, Rong Jin

    Abstract: This paper focuses on jailbreaking attacks against large language models (LLMs), eliciting them to generate objectionable content in response to harmful user queries. Unlike previous LLM-jailbreaks that directly orient to LLMs, our approach begins by constructing a multimodal large language model (MLLM) through the incorporation of a visual module into the target LLM. Subsequently, we conduct an e… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.17303  [pdf, other

    astro-ph.SR

    High-Resolution Observation and Magnetic Modeling of a Solar Minifilament: the Formation, Eruption and Failing Mechanisms

    Authors: Weilin Teng, Yingna Su, Rui Liu, Jialin Chen, Yanjie Liu, Jun Dai, Wenda Cao, Jinhua Shen, Haisheng Ji

    Abstract: Minifilaments are widespread small-scale structures in the solar atmosphere. To better understand their formation and eruption mechanisms, we investigate the entire life of a sigmoidal minifilament located below a large quiescent filament observed by BBSO/GST on 2015 August 3. The Hα structure initially appears as a group of arched threads, then transforms into two J-shaped arcades, and finally fo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2405.15028  [pdf, other

    cs.CL cs.IR

    AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings

    Authors: Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar

    Abstract: Ranking is a fundamental and popular problem in search. However, existing ranking algorithms usually restrict the granularity of ranking to full passages or require a specific dense index for each desired level of granularity. Such lack of flexibility in granularity negatively affects many applications that can benefit from more granular ranking, such as sentence-level ranking for open-domain ques… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.14203  [pdf, other

    cs.LG cs.AI physics.chem-ph

    GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices

    Authors: Thao Nguyen, Tiara Torres-Flores, Changhyun Hwang, Carl Edwards, Ying Diao, Heng Ji

    Abstract: This paper presents a novel approach for predicting Power Conversion Efficiency (PCE) of Organic Photovoltaic (OPV) devices, called GLaD: synergizing molecular Graphs and Language Descriptors for enhanced PCE prediction. Due to the lack of high-quality experimental data, we collect a dataset consisting of 500 pairs of OPV donor and acceptor molecules along with their corresponding PCE values, whic… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: In progress

  5. arXiv:2405.13179  [pdf, other

    cs.CL

    RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

    Authors: Yuelyu Ji, Zhuochun Li, Rui Meng, Sonish Sivarajkumar, Yanshan Wang, Zeshui Yu, Hui Ji, Yushui Han, Hanyu Zeng, Daqing He

    Abstract: This paper introduces the RAG-RLRC-LaySum framework, designed to make complex biomedical research understandable to laymen through advanced Natural Language Processing (NLP) techniques. Our Retrieval Augmented Generation (RAG) solution, enhanced by a reranking method, utilizes multiple knowledge sources to ensure the precision and pertinence of lay summaries. Additionally, our Reinforcement Learni… ▽ More

    Submitted 27 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  6. arXiv:2405.13005  [pdf

    cs.CL cs.AI cs.SI

    Understanding the Rare Inflammatory Disease Using Large Language Models and Social Media Data

    Authors: Nan Miles Xi, Hong-Long Ji, Lin Wang

    Abstract: Sarcoidosis is a rare inflammatory disease characterized by the formation of granulomas in various organs. The disease presents diagnostic and treatment challenges due to its diverse manifestations and unpredictable nature. In this study, we employed a Large Language Model (LLM) to analyze sarcoidosis-related discussions on the social media platform Reddit. Our findings underscore the efficacy of… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  7. arXiv:2405.05481  [pdf, other

    quant-ph

    Achieving millisecond coherence fluxonium through overlap Josephson junctions

    Authors: Fei Wang, Kannan Lu, Huijuan Zhan, Lu Ma, Feng Wu, Hantao Sun, Hao Deng, Yang Bai, Feng Bao, Xu Chang, Ran Gao, Xun Gao, Guicheng Gong, Lijuan Hu, Ruizi Hu, Honghong Ji, Xizheng Ma, Liyong Mao, Zhijun Song, Chengchun Tang, Hongcheng Wang, Tenghui Wang, Ziang Wang, Tian Xia, Hongxin Xu , et al. (10 additional authors not shown)

    Abstract: Fluxonium qubits are recognized for their high coherence times and high operation fidelities, attributed to their unique design incorporating over 100 Josephson junctions per superconducting loop. However, this complexity poses significant fabrication challenges, particularly in achieving high yield and junction uniformity with traditional methods. Here, we introduce an overlap process for Josephs… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2405.04602  [pdf, other

    cs.SE

    An Empirical Study of Kotlin-Java Interactions

    Authors: Qiong Feng, Huan Ji, Xiaotian Ma, Peng Liang

    Abstract: Background: Since Google introduced Kotlin as an official programming language for developing Android apps in 2017, Kotlin has gained widespread adoption in Android development. The interoperability of Java and Kotlin's design nature allows them to coexist and interact with each other smoothly within a project. Aims: However, there is limited research on how Java and Kotlin interact with each othe… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  9. arXiv:2405.03446  [pdf, other

    cs.CR

    SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence

    Authors: Hangyuan Ji, Jian Yang, Linzheng Chai, Chaoren Wei, Liqun Yang, Yunlong Duan, Yunli Wang, Tianzhen Sun, Hongcheng Guo, Tongliang Li, Changyu Ren, Zhoujun Li

    Abstract: To address the increasing complexity and frequency of cybersecurity incidents emphasized by the recent cybersecurity threat reports with over 10 billion instances, cyber threat intelligence (CTI) plays a critical role in the modern cybersecurity landscape by offering the insights required to understand and combat the constantly evolving nature of cyber threats. Inspired by the powerful capability… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  10. arXiv:2404.17512  [pdf, other

    math.PR

    On the spectral edge of non-Hermitian random matrices

    Authors: Andrew Campbell, Giorgio Cipolloni, László Erdős, Hong Chang Ji

    Abstract: For general non-Hermitian random matrices $X$ and deterministic deformation matrices $A$, we prove that the local eigenvalue statistics of $A+X$ close to the typical edge points of its spectrum are universal. Furthermore, we show that under natural assumptions on $A$ the spectrum of $A+X$ does not have outliers at a distance larger than the natural fluctuation scale of the eigenvalues. As a conseq… ▽ More

    Submitted 6 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 51 pages

    MSC Class: 15B52; 60B20

  11. arXiv:2404.16792  [pdf, other

    cs.LG cs.AI cs.CL

    Weak-to-Strong Extrapolation Expedites Alignment

    Authors: Chujie Zheng, Ziqi Wang, Heng Ji, Minlie Huang, Nanyun Peng

    Abstract: The open-source community is experiencing a surge in the release of large language models (LLMs) that are trained to follow instructions and align with human preference. However, further training to improve them still requires expensive computational resources and data annotations. Is it possible to bypass additional training and cost-effectively acquire better-aligned models? Inspired by the lite… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Add theoretical explanation and more evaluation results

  12. arXiv:2404.12666  [pdf, other

    cs.DC cs.CR cs.ET

    A Survey on Federated Analytics: Taxonomy, Enabling Techniques, Applications and Open Issues

    Authors: Zibo Wang, Haichao Ji, Yifei Zhu, Dan Wang, Zhu Han

    Abstract: The escalating influx of data generated by networked edge devices, coupled with the growing awareness of data privacy, has promoted a transformative shift in computing paradigms from centralized data processing to privacy-preserved distributed data processing. Federated analytics (FA) is an emerging technique to support collaborative data analytics among diverse data owners without centralizing th… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: This survey has been submitted to IEEE Communications Surveys & Tutorials

  13. arXiv:2404.12135  [pdf, other

    cs.MA cs.CR cs.DC

    mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

    Authors: Wei Zhang, Hongcheng Guo, Jian Yang, Yi Zhang, Chaoran Yan, Zhoujin Tian, Hangyuan Ji, Zhoujun Li, Tongliang Li, Tieqiao Zheng, Chao Chen, Yi Liang, Xu Shi, Liangfan Zheng, Bo Zhang

    Abstract: The escalating complexity of micro-services architecture in cloud-native technologies poses significant challenges for maintaining system stability and efficiency. To conduct root cause analysis (RCA) and resolution of alert events, we propose a pioneering framework, multi-Agent Blockchain-inspired Collaboration for root cause analysis in micro-services architecture (mABC), to revolutionize the AI… ▽ More

    Submitted 3 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  14. arXiv:2404.06479  [pdf, other

    cs.CL cs.AI cs.CV

    Text-Based Reasoning About Vector Graphics

    Authors: Zhenhailong Wang, Joy Hsu, Xingyao Wang, Kuan-Hao Huang, Manling Li, Jiajun Wu, Heng Ji

    Abstract: While large multimodal models excel in broad vision-language benchmarks, they often struggle with tasks requiring precise perception of low-level visual details, such as comparing line lengths or solving simple mazes. In particular, this failure mode persists in question-answering tasks about vector graphics -- images composed purely of 2D objects and shapes. To address this challenge, we propose… ▽ More

    Submitted 24 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Project page: https://mikewangwzhl.github.io/VDLM/

  15. arXiv:2404.01652  [pdf, other

    cs.CL cs.AI

    Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization

    Authors: Zixuan Zhang, Revanth Gangi Reddy, Kevin Small, Tong Zhang, Heng Ji

    Abstract: Open-domain Question Answering (OpenQA) aims at answering factual questions with an external large-scale knowledge corpus. However, real-world knowledge is not static; it updates and evolves continually. Such a dynamic characteristic of knowledge poses a vital challenge for these models, as the trained models need to constantly adapt to the latest information to make sure that the answers remain a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024 Findings

  16. arXiv:2403.19081  [pdf

    physics.optics

    Surface variation analysis of freeform optical systems over surface frequency bands for prescribed wavefront errors

    Authors: Rundong Fan, Shili Wei, Huiru JI, Zhuang Qian, Hao Tan, Yan Mo, Donglin MA

    Abstract: The surface errors of freeform surfaces reflect the manufacturing complexities and significantly impact the feasibility of processing designed optical systems. With multiple degrees of freedom, freeform surfaces pose challenges in surface tolerance analysis in the field. Nevertheless, current research has neglected the influence of surface slopes on the directions of ray propagation. A sudden alte… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  17. arXiv:2403.18671  [pdf, other

    cs.CL cs.LG

    Fact Checking Beyond Training Set

    Authors: Payam Karisani, Heng Ji

    Abstract: Evaluating the veracity of everyday claims is time consuming and in some cases requires domain expertise. We empirically demonstrate that the commonly used fact checking pipeline, known as the retriever-reader, suffers from performance deterioration when it is trained on the labeled data from one domain and used in another domain. Afterwards, we delve into each component of the pipeline and propos… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: NAACL 2024

  18. arXiv:2403.16823  [pdf, ps, other

    eess.SY cs.LG

    Resource and Mobility Management in Hybrid LiFi and WiFi Networks: A User-Centric Learning Approach

    Authors: Han Ji, Xiping Wu

    Abstract: Hybrid light fidelity (LiFi) and wireless fidelity (WiFi) networks (HLWNets) are an emerging indoor wireless communication paradigm, which combines the advantages of the capacious optical spectra of LiFi and ubiquitous coverage of WiFi. Meanwhile, load balancing (LB) becomes a key challenge in resource management for such hybrid networks. The existing LB methods are mostly network-centric, relying… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 12 figures, 3 tables, submitted to IEEE TWC

  19. arXiv:2403.12027  [pdf, other

    cs.CL cs.AI cs.CV

    From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

    Abstract: Data visualization in the form of charts plays a pivotal role in data analysis, offering critical insights and aiding in informed decision-making. Automatic chart understanding has witnessed significant advancements with the rise of large foundation models in recent years. Foundation models, such as large language models, have revolutionized various natural language processing tasks and are increa… ▽ More

    Submitted 25 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  20. arXiv:2403.08069  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Noncentrosymmetric Triangular Magnet CaMnTeO$_6$: Strong Quantum Fluctuations and Role of s0 vs. s2 Electronic States in Competing Exchange Interactions

    Authors: Xudong Huai, Emmanuel Acheampong, Erich Delles, Michał J. Winiarski, Maurice Sorolla II, Lila Nassar, Mingli Liang, Caleb Ramette, Huiwen Ji, Allen Scheie, Stuart Calder, Martin Mourigal, Thao T. Tran

    Abstract: Noncentrosymmetric triangular magnets offer a unique platform for realizing strong quantum fluctuations. However, designing these quantum materials remains an open challenge attributable to a knowledge gap in the tunability of competing exchange interactions at the atomic level. Here, we create a new noncentrosymmetric triangular S = 3/2 magnet CaMnTeO$_6$ based on careful chemical and physical co… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  21. arXiv:2403.06093  [pdf, other

    cs.CV

    Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors

    Authors: Haoxuanye Ji, Pengpeng Liang, Erkang Cheng

    Abstract: Multi-camera-based 3D object detection has made notable progress in the past several years. However, we observe that there are cases (e.g. faraway regions) in which popular 2D object detectors are more reliable than state-of-the-art 3D detectors. In this paper, to improve the performance of query-based 3D object detectors, we present a novel query generating approach termed QAF2D, which infers 3D… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  22. arXiv:2403.05159  [pdf, other

    cs.CV

    LVIC: Multi-modality segmentation by Lifting Visual Info as Cue

    Authors: Zichao Dong, Bowen Pang, Xufeng Huang, Hang Ji, Xin Zhan, Junbo Chen

    Abstract: Multi-modality fusion is proven an effective method for 3d perception for autonomous driving. However, most current multi-modality fusion pipelines for LiDAR semantic segmentation have complicated fusion mechanisms. Point painting is a quite straight forward method which directly bind LiDAR points with visual information. Unfortunately, previous point painting like methods suffer from projection e… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  23. arXiv:2403.00791  [pdf, other

    cs.CL cs.AI q-bio.BM q-bio.QM

    $\textit{L+M-24}$: Building a Dataset for Language + Molecules @ ACL 2024

    Authors: Carl Edwards, Qingyun Wang, Lawrence Zhao, Heng Ji

    Abstract: Language-molecule models have emerged as an exciting direction for molecular discovery and understanding. However, training these models is challenging due to the scarcity of molecule-language pair datasets. At this point, datasets have been released which are 1) small and scraped from existing databases, 2) large but noisy and constructed by performing entity linking on the scientific literature,… ▽ More

    Submitted 22 February, 2024; originally announced March 2024.

    Comments: The dataset, finetuned baselines, and evaluation code are released publicly at https://github.com/language-plus-molecules/LPM-24-Dataset through https://huggingface.co/language-plus-molecules

  24. arXiv:2402.19275  [pdf, other

    eess.SY cs.LG

    Adaptive Testing Environment Generation for Connected and Automated Vehicles with Dense Reinforcement Learning

    Authors: Jingxuan Yang, Ruoxuan Bai, Haoyuan Ji, Yi Zhang, Jianming Hu, Shuo Feng

    Abstract: The assessment of safety performance plays a pivotal role in the development and deployment of connected and automated vehicles (CAVs). A common approach involves designing testing scenarios based on prior knowledge of CAVs (e.g., surrogate models), conducting tests in these scenarios, and subsequently evaluating CAVs' safety performances. However, substantial differences between CAVs and the prio… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  25. arXiv:2402.18077  [pdf, ps, other

    astro-ph.SR

    Locating heating channels of the solar corona in a plage region with the aid of high-resolution 10830 Å filtergrams

    Authors: Parida Hashim, Fangyu Xu, Ya Wang, Weijie Men, Jinhua Shen, Yingna Su, Jianping Li, Zhenyu Jin, Haisheng Ji

    Abstract: In this paper, with a set of high-resolution He I 10830 Å filtergrams, we select an area in a plage, very likely an EUV moss area, as an interface layer to follow the clues of coronal heating channels down to the photosphere. The filtergrams are obtained from the 1-meter aperture New Vacuum Solar Telescope (NVST). We make a distinction between the darker and the brighter regions in the selected ar… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: ApJ accepted for publication. 11 pages, 7 figures

  26. arXiv:2402.16315  [pdf, other

    cs.CV cs.CL

    Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models

    Authors: Jeonghwan Kim, Heng Ji

    Abstract: Recent advances in instruction-tuned Large Vision-Language Models (LVLMs) have imbued the models with the ability to generate high-level, image-grounded explanations with ease. While such capability is largely attributed to the rich world knowledge contained within the Large Language Models (LLMs), our work reveals their shortcomings in fine-grained visual categorization (FGVC) across six differen… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  27. arXiv:2402.15796  [pdf

    cs.AI cs.HC

    Construction and application of artificial intelligence crowdsourcing map based on multi-track GPS data

    Authors: Yong Wang, Yanlin Zhou, Huan Ji, Zheng He, Xinyu Shen

    Abstract: In recent years, the rapid development of high-precision map technology combined with artificial intelligence has ushered in a new development opportunity in the field of intelligent vehicles. High-precision map technology is an important guarantee for intelligent vehicles to achieve autonomous driving. However, due to the lack of research on high-precision map technology, it is difficult to ratio… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  28. arXiv:2402.14312  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.EP astro-ph.GA

    The Jiao Tong University Spectroscopic Telescope Project

    Authors: JUST Team, Chengze Liu, Ying Zu, Fabo Feng, Zhaoyu Li, Yu Yu, Hua Bai, Xiangqun Cui, Bozhong Gu, Yizhou Gu, Jiaxin Han, Yonghui Hou, Zhongwen Hu, Hangxin Ji, Yipeng Jing, Wei Li, Zhaoxiang Qi, Xianyu Tan, Cairang Tian, Dehua Yang, Xiangyan Yuan, Chao Zhai, Congcong Zhang, Jun Zhang, Haotong Zhang , et al. (6 additional authors not shown)

    Abstract: The Jiao Tong University Spectroscopic Telescope (JUST) is a 4.4-meter f/6.0 segmentedmirror telescope dedicated to spectroscopic observations. The JUST primary mirror is composed of 18 hexagonal segments, each with a diameter of 1.1 m. JUST provides two Nasmyth platforms for placing science instruments. One Nasmyth focus fits a field of view of 10 arcmin and the other has an extended field of vie… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 28 pages, 6 figures

  29. arXiv:2402.14221  [pdf, other

    cs.DC

    Towards singular optimality in the presence of local initial knowledge

    Authors: Hongyan Ji, Sriram V. Pemmaraju

    Abstract: The Knowledge Till rho CONGEST model is a variant of the classical CONGEST model of distributed computing in which each vertex v has initial knowledge of the radius-rho ball centered at v. The most commonly studied variants of the CONGEST model are KT0 CONGEST in which nodes initially know nothing about their neighbors and KT1 CONGEST in which nodes initially know the IDs of all their neighbors. I… ▽ More

    Submitted 22 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  30. arXiv:2402.11943  [pdf, other

    cs.CL

    LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation

    Authors: Keyang Xuan, Li Yi, Fan Yang, Ruochen Wu, Yi R. Fung, Heng Ji

    Abstract: The rise of multimodal misinformation on social platforms poses significant challenges for individuals and societies. Its increased credibility and broader impact compared to textual misinformation make detection complex, requiring robust reasoning across diverse media types and profound knowledge for accurate verification. The emergence of Large Vision Language Model (LVLM) offers a potential sol… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  31. arXiv:2402.11324  [pdf, other

    cs.CL

    EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries

    Authors: Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Heng Ji

    Abstract: The dynamic nature of real-world information necessitates efficient knowledge editing (KE) in large language models (LLMs) for knowledge updating. However, current KE approaches, which typically operate on (subject, relation, object) triples, ignore the contextual information and the relation among different knowledge. Such editing methods could thus encounter an uncertain editing boundary, leavin… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  32. arXiv:2402.11060  [pdf, other

    cs.CL cs.AI cs.IR

    Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

    Authors: Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi R. Fung, Hou Pong Chan, ChengXiang Zhai, Heng Ji

    Abstract: The increasing demand for personalized interactions with large language models (LLMs) calls for the development of methodologies capable of accurately and efficiently identifying user opinions and preferences. Retrieval augmentation emerges as an effective strategy, as it can accommodate a vast number of users without the costs from fine-tuning. Existing research, however, has largely focused on e… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  33. arXiv:2402.10980  [pdf, other

    physics.chem-ph cs.AI cs.CE cs.LG

    ChemReasoner: Heuristic Search over a Large Language Model's Knowledge Space using Quantum-Chemical Feedback

    Authors: Henry W. Sprueill, Carl Edwards, Khushbu Agarwal, Mariefel V. Olarte, Udishnu Sanyal, Conrad Johnston, Hongbin Liu, Heng Ji, Sutanay Choudhury

    Abstract: The discovery of new catalysts is essential for the design of new and more efficient chemical processes in order to transition to a sustainable future. We introduce an AI-guided computational screening framework unifying linguistic reasoning with quantum-chemistry based feedback from 3D atomistic representations. Our approach formulates catalyst discovery as an uncertain environment where an agent… ▽ More

    Submitted 6 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 8 pages, accepted by ICML 2024

  34. arXiv:2402.09463  [pdf

    eess.IV

    Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

    Authors: Kelly Payette, Céline Steger, Roxane Licandro, Priscille de Dumast, Hongwei Bran Li, Matthew Barkovich, Liu Li, Maik Dannecker, Chen Chen, Cheng Ouyang, Niccolò McConnell, Alina Miron, Yongmin Li, Alena Uus, Irina Grigorescu, Paula Ramirez Gilliland, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Haoyu Wang, Ziyan Huang, Jin Ye, Mireia Alenyà, Valentin Comte, Oscar Camara , et al. (42 additional authors not shown)

    Abstract: Segmentation is a critical step in analyzing the developing human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648

  35. arXiv:2402.09369  [pdf, other

    cs.CL

    Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking

    Authors: Yi Fung, Ruining Zhao, Jae Doo, Chenkai Sun, Heng Ji

    Abstract: Pretrained large language models have revolutionized many applications but still face challenges related to cultural bias and a lack of cultural commonsense knowledge crucial for guiding cross-culture communication and interactions. Recognizing the shortcomings of existing methods in capturing the diverse and rich cultures across the world, this paper introduces a novel approach for massively mult… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: preprint

  36. arXiv:2402.07401  [pdf, other

    cs.CL

    Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate

    Authors: Kyungha Kim, Sangyun Lee, Kung-Hsiang Huang, Hou Pong Chan, Manling Li, Heng Ji

    Abstract: Fact-checking research has extensively explored verification but less so the generation of natural-language explanations, crucial for user trust. While Large Language Models (LLMs) excel in text generation, their capability for producing faithful explanations in fact-checking remains underexamined. Our study investigates LLMs' ability to generate such explanations, finding that zero-shot prompts o… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  37. arXiv:2402.07016  [pdf, other

    cs.AI

    REALM: RAG-Driven Enhancement of Multimodal Electronic Health Records Analysis via Large Language Models

    Authors: Yinghao Zhu, Changyu Ren, Shiyun Xie, Shukai Liu, Hangyuan Ji, Zixiang Wang, Tao Sun, Long He, Zhoujun Li, Xi Zhu, Chengwei Pan

    Abstract: The integration of multimodal Electronic Health Records (EHR) data has significantly improved clinical predictive capabilities. Leveraging clinical notes and multivariate time-series EHR, existing models often lack the medical context relevent to clinical tasks, prompting the incorporation of external knowledge, particularly from the knowledge graph (KG). Previous approaches with KG knowledge have… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  38. arXiv:2402.06193  [pdf, other

    astro-ph.SR physics.plasm-ph

    Experimental study of Alfvén wave reflection from an Alfvén-speed gradient relevant to the solar coronal holes

    Authors: Sayak Bose, Jason M. TenBarge, Troy Carter, Michael Hahn, Hantao Ji, James Juno, Daniel Wolf Savin, Shreekrishna Tripathi, Stephen Vincena

    Abstract: We report the first experimental detection of a reflected Alfvén wave from an Alfvén-speed gradient under conditions similar to those in coronal holes. The experiments were conducted in the Large Plasma Device at the University of California, Los Angeles. We present the experimentally measured dependence of the coefficient of reflection versus the wave inhomogeneity parameter, i.e., the ratio of t… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  39. arXiv:2402.06190  [pdf, other

    cs.CV cs.LG

    Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain

    Authors: Amin Karimi Monsefi, Payam Karisani, Mengxi Zhou, Stacey Choi, Nathan Doble, Heng Ji, Srinivasan Parthasarathy, Rajiv Ramnath

    Abstract: Standard modern machine-learning-based imaging methods have faced challenges in medical applications due to the high cost of dataset construction and, thereby, the limited labeled training data available. Additionally, upon deployment, these methods are usually used to process a large volume of data on a daily basis, imposing a high maintenance cost on medical facilities. In this paper, we introdu… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  40. arXiv:2402.05923  [pdf, other

    math.OC math.NA physics.flu-dyn

    Mean field control of droplet dynamics with high order finite element computations

    Authors: Guosheng Fu, Hangjie Ji, Will Pazner, Wuchen Li

    Abstract: Liquid droplet dynamics are widely used in biological and engineering applications, which contain complex interfacial instabilities and pattern formulation such as droplet merging, splitting, and transport. This paper studies a class of mean field control formulation towards these droplet dynamics. They are used to control and maintain the manipulation of droplets in applications. We first formula… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 26 pages, 13 figures

  41. arXiv:2402.03005  [pdf, other

    cond-mat.supr-con

    Topological metal and high-order Dirac point in cubic Rashba model

    Authors: Haijiao Ji, Ning Zhang, Noah F. Q. Yuan

    Abstract: We investigate the properties of the two-dimensional model with Rashba-type spin-orbit coupling cubic in electron momentum. In the normal phase, edge states emerge on open boundaries. In the superconducting phase, edge states could evolve into gapped fermionic edge states. Applications to realistic materials of interface superconductors are also discussed.

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures, 1 table

  42. arXiv:2402.02462  [pdf, ps, other

    quant-ph

    Quantum teleportation based on the elegant joint measurement

    Authors: Dong Ding, Ming-Xing Yu, Ying-Qiu He, Hao-Sen Ji, Ting Gao, Feng-Li Yan

    Abstract: As a generalization of the well-known Bell state measurement (BSM), the elegant joint measurement (EJM) is a kind of novel two-qubit joint measurement, parameterized by a subtle phase factor $θ\in [0,π/2]$. We explore quantum teleportation based on the EJM, inspired by Gisin's idea that quantum entanglement not only provides quantum channel and also quantum joint measurement for quantum teleportat… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 8 pages, 3 figures

  43. arXiv:2402.01030  [pdf, other

    cs.CL cs.AI

    Executable Code Actions Elicit Better LLM Agents

    Authors: Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji

    Abstract: Large Language Model (LLM) agents, capable of performing a broad range of actions, such as invoking tools and controlling robots, show great potential in tackling real-world challenges. LLM agents are typically prompted to produce actions by generating JSON or text in a pre-defined format, which is usually limited by constrained action space (e.g., the scope of pre-defined tools) and restricted fl… ▽ More

    Submitted 23 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024; Code, data, model, and demo are available at https://github.com/xingyaoww/code-act

  44. arXiv:2402.00856  [pdf, other

    cs.CL

    Towards Efficient and Exact Optimization of Language Model Alignment

    Authors: Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang

    Abstract: The alignment of language models with human preferences is vital for their application in real-world tasks. The problem is formulated as optimizing the model's policy to maximize the expected reward that reflects human preferences with minimal deviation from the initial policy. While considered as a straightforward solution, reinforcement learning (RL) suffers from high variance in policy updates,… ▽ More

    Submitted 23 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 24 pages, 9 figures

  45. arXiv:2401.17923  [pdf, other

    physics.plasm-ph

    Radiatively Cooled Magnetic Reconnection Experiments Driven by Pulsed Power

    Authors: R Datta, K Chandler, C E Myers, J P Chittenden, A J Crilly, C Aragon, D J Ampleford, J T Banasek, A Edens, W R Fox, S B Hansen, E C Harding, C A Jennings, H Ji, C C Kuranz, S V Lebedev, Q Looker, S G Patel, A J Porwitzky, G A Shipley, D A Uzdensky, D A Yager-Elorriaga, J D Hare

    Abstract: We present evidence for strong radiative cooling in a pulsed-power-driven magnetic reconnection experiment. Two aluminum exploding wire arrays, driven by a 20 MA peak current, 300 ns rise time pulse from the Z machine (Sandia National Laboratories), generate strongly-driven plasma flows ($M_A \approx 7$) with anti-parallel magnetic fields, which form a reconnection layer ($S_L \approx 120$) at the… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  46. arXiv:2401.16865  [pdf, other

    cs.SE

    Depends-Kotlin: A Cross-Language Kotlin Dependency Extractor

    Authors: Qiong Feng, Xiaotian Ma, Huan Ji, Peng Liang

    Abstract: Since Google introduced Kotlin as an official programming language for developing Android apps in 2017, Kotlin has gained widespread adoption in Android development. However, compared to Java, there is limited support for Kotlin code dependency analysis, which is the foundation to software analysis. To bridge this gap, we developed Depends-Kotlin to extract entities and their dependencies in Kotli… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  47. arXiv:2401.15858  [pdf, other

    astro-ph.SR

    Transverse oscillation of prominence and filament induced by an EUV wave from the farside of the Sun

    Authors: Yanjie Zhang, Qingmin Zhang, De-chao Song, Haisheng Ji

    Abstract: In this paper, we report our multi-angle observations of the transverse oscillation of a prominence and a filament induced by an EUV wave originating from the farside of the Sun on 2014 September 1. The prominence oscillation was simultaneously observed by both Atmospheric Imaging Assembly (AIA) onboard the Solar Dynamics Observatory (SDO) spacecraft and Extreme-UltraViolet Imager (EUVI) onboard t… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 12 pages, 11 figures, accepted for publication in ApJ

  48. arXiv:2401.14104  [pdf

    q-bio.BM

    Label-free detection of exosomes from different cellular sources based on surface-enhanced Raman spectroscopy combined with machine learning models

    Authors: Yang Li, Xiaoming Lyu, Kuo Zhan, Haoyu Ji, Lei Qin, JianAn Huang

    Abstract: Exosomes are significant facilitators of inter-cellular communication that can unveil cell-cell interactions, signaling pathways, regulatory mechanisms and disease diagnostics. Nonetheless, current analysis required large amount of data for exosome identification that it hampers efficient and timely mechanism study and diagnostics. Here, we used a machine-learning assisted Surface-enhanced Raman s… ▽ More

    Submitted 26 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 5 figures

  49. arXiv:2401.10472  [pdf, other

    cs.CL

    Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences

    Authors: Hongyi Liu, Qingyun Wang, Payam Karisani, Heng Ji

    Abstract: Named entity recognition is a key component of Information Extraction (IE), particularly in scientific domains such as biomedicine and chemistry, where large language models (LLMs), e.g., ChatGPT, fall short. We investigate the applicability of transfer learning for enhancing a named entity recognition model trained in the biomedical domain (the source domain) to be used in the chemical domain (th… ▽ More

    Submitted 31 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 21 pages; Accepted by the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies; Code, data, and resources are publicly available for research purposes: https://github.com/Lhtie/Bio-Domain-Transfer

  50. arXiv:2401.10189  [pdf, other

    cs.CL cs.AI cs.LG

    Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction

    Authors: Qingyun Wang, Zixuan Zhang, Hongxiang Li, Xuan Liu, Jiawei Han, Huimin Zhao, Heng Ji

    Abstract: Fine-grained few-shot entity extraction in the chemical domain faces two unique challenges. First, compared with entity extraction tasks in the general domain, sentences from chemical papers usually contain more entities. Moreover, entity extraction models usually have difficulty extracting entities of long-tailed types. In this paper, we propose Chem-FINESE, a novel sequence-to-sequence (seq2seq)… ▽ More

    Submitted 29 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 16 pages. Accepted by Findings of the Association for Computational Linguistics: EACL 2024. Code and resources are available at https://github.com/EagleW/Chem-FINESE