Skip to main content

Showing 1–50 of 79 results for author: Bollegala, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16807  [pdf, other

    cs.CL

    Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning

    Authors: Tianhui Zhang, Bei Peng, Danushka Bollegala

    Abstract: Generative Commonsense Reasoning (GCR) requires a model to reason about a situation using commonsense knowledge, while generating coherent sentences. Although the quality of the generated sentences is crucial, the diversity of the generation is equally important because it reflects the model's ability to use a range of commonsense knowledge facts. Large Language Models (LLMs) have shown proficienc… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 16 pages, 6 figures

  2. arXiv:2404.12059  [pdf, other

    cs.CL

    Constituents Correspond to Word Sequence Patterns among Sentences with Equivalent Predicate-Argument Structures: Unsupervised Constituency Parsing by Span Matching

    Authors: Junjie Chen, Xiangheng He, Danushka Bollegala, Yusuke Miyao

    Abstract: Unsupervised constituency parsing is about identifying word sequences that form a syntactic unit (i.e., constituents) in a target sentence. Linguists identify the constituent by evaluating a set of Predicate-Argument Structure (PAS) equivalent sentences where we find the constituent corresponds to frequent word sequences. However, such information is unavailable to previous parsing methods which i… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2403.18025  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Improving Pre-trained Language Model Sensitivity via Mask Specific losses: A case study on Biomedical NER

    Authors: Micheal Abaho, Danushka Bollegala, Gary Leeming, Dan Joyce, Iain E Buchan

    Abstract: Adapting language models (LMs) to novel domains is often achieved through fine-tuning a pre-trained LM (PLM) on domain-specific data. Fine-tuning introduces new knowledge into an LM, enabling it to comprehend and efficiently perform a target domain task. Fine-tuning can however be inadvertently insensitive if it ignores the wide array of disparities (e.g in word meaning) between source and target… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Paper alrerady accepted for publishing by the NAACL 2024 conference (main conference paper)

  4. arXiv:2403.14001  [pdf, other

    cs.CL

    Evaluating Unsupervised Dimensionality Reduction Methods for Pretrained Sentence Embeddings

    Authors: Gaifan Zhang, Yi Zhou, Danushka Bollegala

    Abstract: Sentence embeddings produced by Pretrained Language Models (PLMs) have received wide attention from the NLP community due to their superior performance when representing texts in numerous downstream applications. However, the high dimensionality of the sentence embeddings produced by PLMs is problematic when representing large numbers of sentences in memory- or compute-constrained devices. As a so… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  5. arXiv:2403.00226  [pdf, other

    cs.CL

    A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection

    Authors: Taichi Aida, Danushka Bollegala

    Abstract: Detecting temporal semantic changes of words is an important task for various NLP applications that must make time-sensitive predictions. Lexical Semantic Change Detection (SCD) task involves predicting whether a given target word, $w$, changes its meaning between two different text corpora, $C_1$ and $C_2$. For this purpose, we propose a supervised two-staged SCD method that uses existing Word-in… ▽ More

    Submitted 10 April, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

  6. arXiv:2402.14258  [pdf, other

    cs.CL

    Eagle: Ethical Dataset Given from Real Interactions

    Authors: Masahiro Kaneko, Danushka Bollegala, Timothy Baldwin

    Abstract: Recent studies have demonstrated that large language models (LLMs) have ethical-related problems such as social biases, lack of moral reasoning, and generation of offensive content. The existing evaluation metrics and methods to address these ethical challenges use datasets intentionally created by instructing humans to create instances including ethical problems. Therefore, the data does not refl… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  7. arXiv:2401.15585  [pdf, other

    cs.CL

    Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki, Timothy Baldwin

    Abstract: There exist both scalable tasks, like reading comprehension and fact-checking, where model performance improves with model size, and unscalable tasks, like arithmetic reasoning and symbolic reasoning, where model performance does not necessarily improve with model size. Large language models (LLMs) equipped with Chain-of-Thought (CoT) prompting are able to make accurate incremental predictions eve… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  8. arXiv:2401.08511  [pdf, other

    cs.CL

    The Gaps between Pre-train and Downstream Settings in Bias Evaluation and Debiasing

    Authors: Masahiro Kaneko, Danushka Bollegala, Timothy Baldwin

    Abstract: The output tendencies of Pre-trained Language Models (PLM) vary markedly before and after Fine-Tuning (FT) due to the updates to the model parameters. These divergences in output tendencies result in a gap in the social biases of PLMs. For example, there exits a low correlation between intrinsic bias scores of a PLM and its extrinsic bias scores under FT-based debiasing methods. Additionally, appl… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  9. arXiv:2310.12936  [pdf, other

    cs.CL

    A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models

    Authors: Yi Zhou, Jose Camacho-Collados, Danushka Bollegala

    Abstract: Various types of social biases have been reported with pretrained Masked Language Models (MLMs) in prior work. However, multiple underlying factors are associated with an MLM such as its model size, size of the training data, training objectives, the domain from which pretraining data is sampled, tokenization, and languages present in the pretrained corpora, to name a few. It remains unclear as to… ▽ More

    Submitted 22 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 main conference

  10. arXiv:2310.10400  [pdf, other

    cs.CL cs.AI cs.LG

    Can Word Sense Distribution Detect Semantic Changes of Words?

    Authors: Xiaohang Tang, Yi Zhou, Taichi Aida, Procheta Sen, Danushka Bollegala

    Abstract: Semantic Change Detection (SCD) of words is an important task for various NLP applications that must make time-sensitive predictions. Some words are used over time in novel ways to express new meanings, and these new meanings establish themselves as novel senses of existing words. On the other hand, Word Sense Disambiguation (WSD) methods associate ambiguous words with sense ids, depending on the… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP 2023

    Journal ref: EMNLP 2023

  11. arXiv:2310.10397  [pdf, other

    cs.CL

    $\textit{Swap and Predict}$ -- Predicting the Semantic Changes in Words across Corpora by Context Swapping

    Authors: Taichi Aida, Danushka Bollegala

    Abstract: Meanings of words change over time and across domains. Detecting the semantic changes of words is an important task for various NLP applications that must make time-sensitive predictions. We consider the problem of predicting whether a given target word, $w$, changes its meaning between two different text corpora, $\mathcal{C}_1$ and $\mathcal{C}_2$. For this purpose, we propose… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP2023

  12. arXiv:2309.10551  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings

    Authors: Danushka Bollegala, Shuichi Otake, Tomoya Machide, Ken-ichi Kawarabayashi

    Abstract: We propose a Neighbourhood-Aware Differential Privacy (NADP) mechanism considering the neighbourhood of a word in a pretrained static word embedding space to determine the minimal amount of noise required to guarantee a specified privacy level. We first construct a nearest neighbour graph over the words using their embeddings, and factorise it into a set of connected components (i.e. neighbourhood… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted to IJCNLP-AACL 2023

  13. arXiv:2309.09092  [pdf, other

    cs.CL

    The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: Pre-trained language models trained on large-scale data have learned serious levels of social biases. Consequently, various methods have been proposed to debias pre-trained models. Debiasing methods need to mitigate only discriminatory bias information from the pre-trained models, while retaining information that is useful for the downstream tasks. In previous research, whether useful information… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: IJCNLP-AACL 2023

  14. arXiv:2309.07251  [pdf, other

    cs.CL

    In-Contextual Gender Bias Suppression for Large Language Models

    Authors: Daisuke Oba, Masahiro Kaneko, Danushka Bollegala

    Abstract: Despite their impressive performance in a wide range of NLP tasks, Large Language Models (LLMs) have been reported to encode worrying-levels of gender biases. Prior work has proposed debiasing methods that require human labelled examples, data augmentation and fine-tuning of LLMs, which are computationally costly. Moreover, one might not even have access to the model parameters for performing debi… ▽ More

    Submitted 20 February, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: EACL 2024 Findings - Long Paper

  15. arXiv:2309.06363  [pdf, other

    cs.CL

    Learning to Predict Concept Ordering for Common Sense Generation

    Authors: Tianhui Zhang, Danushka Bollegala, Bei Peng

    Abstract: Prior work has shown that the ordering in which concepts are shown to a commonsense generator plays an important role, affecting the quality of the generated sentence. However, it remains a challenge to determine the optimal ordering of a given set of concepts such that a natural sentence covering all the concepts could be generated from a pretrained generator. To understand the relationship betwe… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 10 pages

  16. arXiv:2307.07358  [pdf, other

    cs.RO

    Learn from Incomplete Tactile Data: Tactile Representation Learning with Masked Autoencoders

    Authors: Guanqun Cao, Jiaqi Jiang, Danushka Bollegala, Shan Luo

    Abstract: The missing signal caused by the objects being occluded or an unstable sensor is a common challenge during data collection. Such missing signals will adversely affect the results obtained from the data, and this issue is observed more frequently in robotic tactile perception. In tactile perception, due to the limited working space and the dynamic environment, the contact between the tactile sensor… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper is accepted at IROS 2023

  17. arXiv:2306.12705  [pdf, other

    cs.RO

    Multimodal Zero-Shot Learning for Tactile Texture Recognition

    Authors: Guanqun Cao, Jiaqi Jiang, Danushka Bollegala, Min Li, Shan Luo

    Abstract: Tactile sensing plays an irreplaceable role in robotic material recognition. It enables robots to distinguish material properties such as their local geometry and textures, especially for materials like textiles. However, most tactile recognition methods can only classify known materials that have been touched and trained with tactile data, yet cannot classify unknown materials that are not traine… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Under review at Robotics and Autonomous Systems

  18. arXiv:2305.19092  [pdf, other

    cs.CL

    Together We Make Sense -- Learning Meta-Sense Embeddings from Pretrained Static Sense Embeddings

    Authors: Haochen Luo, Yi Zhou, Danushka Bollegala

    Abstract: Sense embedding learning methods learn multiple vectors for a given ambiguous word, corresponding to its different word senses. For this purpose, different methods have been proposed in prior work on sense embedding learning that use different sense inventories, sense-tagged corpora and learning methods. However, not all existing sense embeddings cover all senses of ambiguous words equally well du… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL 2023

  19. arXiv:2305.16372  [pdf, other

    cs.LG cs.AI physics.chem-ph

    Metrics for quantifying isotropy in high dimensional unsupervised clustering tasks in a materials context

    Authors: Samantha Durdy, Michael W. Gaultois, Vladimir Gusev, Danushka Bollegala, Matthew J. Rosseinsky

    Abstract: Clustering is a common task in machine learning, but clusters of unlabelled data can be hard to quantify. The application of clustering algorithms in chemistry is often dependant on material representation. Ascertaining the effects of different representations, clustering algorithms, or data transformations on the resulting clusters is difficult due to the dimensionality of these data. We present… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 31 pages, 6 figures

  20. arXiv:2305.10610  [pdf, other

    cs.CL

    Solving Cosine Similarity Underestimation between High Frequency Words by L2 Norm Discounting

    Authors: Saeth Wannasuphoprasit, Yi Zhou, Danushka Bollegala

    Abstract: Cosine similarity between two words, computed using their contextualised token embeddings obtained from masked language models (MLMs) such as BERT has shown to underestimate the actual similarity between those words (Zhou et al., 2022). This similarity underestimation problem is particularly severe for highly frequent words. Although this problem has been noted in prior work, no solution has been… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 7 pages, 5 figures. To be published in the Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, 9-14 July 2023, Toronto, Canada

  21. arXiv:2305.08654  [pdf, other

    cs.CL

    Unsupervised Semantic Variation Prediction using the Distribution of Sibling Embeddings

    Authors: Taichi Aida, Danushka Bollegala

    Abstract: Languages are dynamic entities, where the meanings associated with words constantly change with time. Detecting the semantic variation of words is an important task for various NLP applications that must make time-sensitive predictions. Existing work on semantic variation prediction have predominantly focused on comparing some form of an averaged contextualised representation of a target word comp… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Findings of ACL2023

  22. arXiv:2302.05619  [pdf, other

    cs.CL cs.AI

    Evaluating the Robustness of Discrete Prompts

    Authors: Yoichi Ishibashi, Danushka Bollegala, Katsuhito Sudoh, Satoshi Nakamura

    Abstract: Discrete prompts have been used for fine-tuning Pre-trained Language Models for diverse NLP tasks. In particular, automatic methods that generate discrete prompts from a small set of training instances have reported superior performance. However, a closer look at the learnt prompts reveals that they contain noisy and counter-intuitive lexical constructs that would not be encountered in manually-wr… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted at EACL 2023

  23. arXiv:2301.12074  [pdf, other

    cs.CL

    Comparing Intrinsic Gender Bias Evaluation Measures without using Human Annotated Examples

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: Numerous types of social biases have been identified in pre-trained language models (PLMs), and various intrinsic bias evaluation measures have been proposed for quantifying those social biases. Prior works have relied on human annotated examples to compare existing intrinsic bias evaluation measures. However, this approach is not easily adaptable to different languages nor amenable to large scale… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: EACL 2023

  24. arXiv:2301.06826  [pdf, other

    cs.RO

    Vis2Hap: Vision-based Haptic Rendering by Cross-modal Generation

    Authors: Guanqun Cao, Jiaqi Jiang, Ningtao Mao, Danushka Bollegala, Min Li, Shan Luo

    Abstract: To assist robots in teleoperation tasks, haptic rendering which allows human operators access a virtual touch feeling has been developed in recent years. Most previous haptic rendering methods strongly rely on data collected by tactile sensors. However, tactile data is not widely available for robots due to their limited reachable space and the restrictions of tactile sensors. To eliminate the nee… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: This paper is accepted at ICRA 2023

  25. arXiv:2210.14815  [pdf, other

    cs.CL

    On the Curious Case of $\ell_2$ norm of Sense Embeddings

    Authors: Yi Zhou, Danushka Bollegala

    Abstract: We show that the $\ell_2$ norm of a static sense embedding encodes information related to the frequency of that sense in the training corpus used to learn the sense embeddings. This finding can be seen as an extension of a previously known relationship for word embeddings to sense embeddings. Our experimental results show that, in spite of its simplicity, the $\ell_2$ norm of sense embeddings is a… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of EMNLP 2022

  26. arXiv:2210.02938  [pdf, other

    cs.CL

    Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: We study the relationship between task-agnostic intrinsic and task-specific extrinsic social bias evaluation measures for Masked Language Models (MLMs), and find that there exists only a weak correlation between these two types of evaluation measures. Moreover, we find that MLMs debiased using different methods still re-learn social biases during fine-tuning on downstream tasks. We identify the so… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: COLING 2022

  27. arXiv:2208.10734  [pdf, other

    cs.CL

    Learning Dynamic Contextualised Word Embeddings via Template-based Temporal Adaptation

    Authors: Xiaohang Tang, Yi Zhou, Danushka Bollegala

    Abstract: Dynamic contextualised word embeddings (DCWEs) represent the temporal semantic variations of words. We propose a method for learning DCWEs by time-adapting a pretrained Masked Language Model (MLM) using time-sensitive templates. Given two snapshots $C_1$ and $C_2$ of a corpus taken respectively at two distinct timestamps $T_1$ and $T_2$, we first propose an unsupervised method to select (a) \emph{… ▽ More

    Submitted 13 June, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted to ACL 2023

  28. arXiv:2206.08841  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Random projections and Kernelised Leave One Cluster Out Cross-Validation: Universal baselines and evaluation tools for supervised machine learning for materials properties

    Authors: Samantha Durdy, Michael Gaultois, Vladimir Gusev, Danushka Bollegala, Matthew J. Rosseinsky

    Abstract: With machine learning being a popular topic in current computational materials science literature, creating representations for compounds has become common place. These representations are rarely compared, as evaluating their performance - and the performance of the algorithms that they are used with - is non-trivial. With many materials datasets containing bias and skew caused by the research pro… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 16 pages including references, 9 figures

  29. arXiv:2205.09867  [pdf, other

    cs.CL

    Gender Bias in Meta-Embeddings

    Authors: Masahiro Kaneko, Danushka Bollegala, Naoaki Okazaki

    Abstract: Different methods have been proposed to develop meta-embeddings from a given set of source embeddings. However, the source embeddings can contain unfair gender-related biases, and how these influence the meta-embeddings has not been studied yet. We study the gender bias in meta-embeddings created under three different settings: (1) meta-embedding multiple sources without performing any debiasing (… ▽ More

    Submitted 6 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Findings of EMNLP 2022

  30. arXiv:2205.00551  [pdf, other

    cs.CL

    Gender Bias in Masked Language Models for Multiple Languages

    Authors: Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki

    Abstract: Masked Language Models (MLMs) pre-trained by predicting masked tokens on large corpora have been used successfully in natural language processing tasks for a variety of languages. Unfortunately, it was reported that MLMs also learn discriminative biases regarding attributes such as gender and race. Because most studies have focused on MLMs in English, the bias of MLMs in other languages has rarely… ▽ More

    Submitted 4 May, 2022; v1 submitted 1 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  31. arXiv:2204.13097  [pdf, other

    cs.CL

    Learning to Borrow -- Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion

    Authors: Huda Hakami, Mona Hakami, Angrosh Mandya, Danushka Bollegala

    Abstract: Prior work on integrating text corpora with knowledge graphs (KGs) to improve Knowledge Graph Embedding (KGE) have obtained good performance for entities that co-occur in sentences in text corpora. Such sentences (textual mentions of entity-pairs) are represented as Lexicalised Dependency Paths (LDPs) between two entities. However, it is not possible to represent relations between entities that do… ▽ More

    Submitted 28 April, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Accepted in NAACL 2022

  32. arXiv:2204.12386  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Meta Word Embeddings by Unsupervised Weighted Concatenation of Source Embeddings

    Authors: Danushka Bollegala

    Abstract: Given multiple source word embeddings learnt using diverse algorithms and lexical resources, meta word embedding learning methods attempt to learn more accurate and wide-coverage word embeddings. Prior work on meta-embedding has repeatedly discovered that simple vector concatenation of the source embeddings to be a competitive baseline. However, it remains unclear as to why and when simple vec… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI-2022)

  33. arXiv:2204.11660  [pdf, ps, other

    cs.CL cs.AI cs.LG

    A Survey on Word Meta-Embedding Learning

    Authors: Danushka Bollegala, James O'Neill

    Abstract: Meta-embedding (ME) learning is an emerging approach that attempts to learn more accurate word embeddings given existing (source) word embeddings as the sole input. Due to their ability to incorporate semantics from multiple source embeddings in a compact manner with superior performance, ME learning has gained popularity among practitioners in NLP. To the best of our knowledge, there exist no… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI-2022)

    Journal ref: Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI-2022)

  34. arXiv:2204.07746  [pdf, ps, other

    cs.CL

    Unsupervised Attention-based Sentence-Level Meta-Embeddings from Contextualised Language Models

    Authors: Keigo Takahashi, Danushka Bollegala

    Abstract: A variety of contextualised language models have been proposed in the NLP community, which are trained on diverse corpora to produce numerous Neural Language Models (NLMs). However, different NLMs have reported different levels of performances in downstream NLP applications when used as text representations. We propose a sentence-level meta-embedding learning method that takes independently traine… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  35. arXiv:2204.03489  [pdf, other

    cs.CL cs.LG

    Position-based Prompting for Health Outcome Generation

    Authors: M. Abaho, D. Bollegala, P. Williamson, S. Dodd

    Abstract: Probing Pre-trained Language Models (PLMs) using prompts has indirectly implied that language models (LMs) can be treated as knowledge bases. To this end, this phenomena has been effective especially when these LMs are fine-tuned towards not just data of a specific domain, but also to the style or linguistic pattern of the prompts themselves. We observe that, satisfying a particular linguistic pat… ▽ More

    Submitted 30 March, 2022; originally announced April 2022.

  36. arXiv:2203.07523  [pdf, other

    cs.CL

    Sense Embeddings are also Biased--Evaluating Social Biases in Static and Contextualised Sense Embeddings

    Authors: Yi Zhou, Masahiro Kaneko, Danushka Bollegala

    Abstract: Sense embedding learning methods learn different embeddings for the different senses of an ambiguous word. One sense of an ambiguous word might be socially biased while its other senses remain unbiased. In comparison to the numerous prior work evaluating the social biases in pretrained word embeddings, the biases in sense embeddings have been relatively understudied. We create a benchmark dataset… ▽ More

    Submitted 16 March, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  37. Assessment of contextualised representations in detecting outcome phrases in clinical trials

    Authors: Micheal Abaho, Danushka Bollegala, Paula R Williamson, Susanna Dodd

    Abstract: Automating the recognition of outcomes reported in clinical trials using machine learning has a huge potential of speeding up access to evidence necessary in healthcare decision-making. Prior research has however acknowledged inadequate training corpora as a challenge for the Outcome detection (OD) task. Additionally, several contextualized representations like BERT and ELMO have achieved unparall… ▽ More

    Submitted 13 March, 2022; v1 submitted 13 February, 2022; originally announced March 2022.

    Journal ref: European Journal of Biomedical Informatics, 2021

  38. arXiv:2110.02204  [pdf, other

    cs.CL

    Learning Sense-Specific Static Embeddings using Contextualised Word Embeddings as a Proxy

    Authors: Yi Zhou, Danushka Bollegala

    Abstract: Contextualised word embeddings generated from Neural Language Models (NLMs), such as BERT, represent a word with a vector that considers the semantics of the target word as well its context. On the other hand, static word embeddings such as GloVe represent words by relatively low-dimensional, memory- and compute-efficient vectors but are not sensitive to the different senses of the word. We propos… ▽ More

    Submitted 6 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted to PACLIC 35

  39. arXiv:2106.08007  [pdf, other

    cs.CL cs.AI cs.LG

    Unsupervised Abstractive Opinion Summarization by Generating Sentences with Tree-Structured Topic Guidance

    Authors: Masaru Isonuma, Junichiro Mori, Danushka Bollegala, Ichiro Sakata

    Abstract: This paper presents a novel unsupervised abstractive summarization method for opinionated texts. While the basic variational autoencoder-based models assume a unimodal Gaussian prior for the latent code of sentences, we alternate it with a recursive Gaussian mixture, where each mixture component corresponds to the latent code of a topic sentence and is mixed by a tree-structured topic distribution… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: accepted to TACL, pre-MIT Press publication version

  40. Backretrieval: An Image-Pivoted Evaluation Metric for Cross-Lingual Text Representations Without Parallel Corpora

    Authors: Mikhail Fain, Niall Twomey, Danushka Bollegala

    Abstract: Cross-lingual text representations have gained popularity lately and act as the backbone of many tasks such as unsupervised machine translation and cross-lingual information retrieval, to name a few. However, evaluation of such representations is difficult in the domains beyond standard benchmarks due to the necessity of obtaining domain-specific parallel language data across different pairs of la… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: SIGIR 2021

  41. arXiv:2104.07789  [pdf, other

    cs.CL cs.AI cs.LG

    Detect and Classify -- Joint Span Detection and Classification for Health Outcomes

    Authors: Michael Abaho, Danushka Bollegala, Paula Williamson, Susanna Dodd

    Abstract: A health outcome is a measurement or an observation used to capture and assess the effect of a treatment. Automatic detection of health outcomes from text would undoubtedly speed up access to evidence necessary in healthcare decision making. Prior work on outcome detection has modelled this task as either (a) a sequence labelling task, where the goal is to detect which text spans describe health o… ▽ More

    Submitted 10 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  42. arXiv:2104.07496  [pdf, other

    cs.CL

    Unmasking the Mask -- Evaluating Social Biases in Masked Language Models

    Authors: Masahiro Kaneko, Danushka Bollegala

    Abstract: Masked Language Models (MLMs) have shown superior performances in numerous downstream NLP tasks when used as text encoders. Unfortunately, MLMs also demonstrate significantly worrying levels of social biases. We show that the previously proposed evaluation metrics for quantifying the social biases in MLMs are problematic due to following reasons: (1) prediction accuracy of the masked tokens itself… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  43. arXiv:2104.06893  [pdf, other

    cs.CL cs.AI cs.LG

    I Wish I Would Have Loved This One, But I Didn't -- A Multilingual Dataset for Counterfactual Detection in Product Reviews

    Authors: James O'Neill, Polina Rozenshtein, Ryuichi Kiryo, Motoko Kubota, Danushka Bollegala

    Abstract: Counterfactual statements describe events that did not or cannot take place. We consider the problem of counterfactual detection (CFD) in product reviews. For this purpose, we annotate a multilingual CFD dataset from Amazon product reviews covering counterfactual statements written in English, German, and Japanese languages. The dataset is unique as it contains counterfactuals in multiple language… ▽ More

    Submitted 15 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021

  44. arXiv:2102.06603  [pdf, other

    cs.LG cs.CV

    Semantically-Conditioned Negative Samples for Efficient Contrastive Learning

    Authors: James O' Neill, Danushka Bollegala

    Abstract: Negative sampling is a limiting factor w.r.t. the generalization of metric-learned neural networks. We show that uniform negative sampling provides little information about the class boundaries and thus propose three novel techniques for efficient negative sampling: drawing negative samples from (1) the top-$k$ most semantically similar classes, (2) the top-$k$ most semantically similar samples an… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  45. arXiv:2101.10070  [pdf, other

    cs.CL

    RelWalk A Latent Variable Model Approach to Knowledge Graph Embedding

    Authors: Danushka Bollegala, Huda Hakami, Yuichi Yoshida, Ken-ichi Kawarabayashi

    Abstract: Embedding entities and relations of a knowledge graph in a low-dimensional space has shown impressive performance in predicting missing links between entities. Although progresses have been achieved, existing methods are heuristically motivated and theoretical understanding of such embeddings is comparatively underdeveloped. This paper extends the random walk model (Arora et al., 2016a) of word em… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted in EACL 2021

  46. arXiv:2101.09525  [pdf, other

    cs.CL

    Dictionary-based Debiasing of Pre-trained Word Embeddings

    Authors: Masahiro Kaneko, Danushka Bollegala

    Abstract: Word embeddings trained on large corpora have shown to encode high levels of unfair discriminatory gender, racial, religious and ethnic biases. In contrast, human-written dictionaries describe the meanings of words in a concise, objective and an unbiased manner. We propose a method for debiasing pre-trained word embeddings using dictionaries, without requiring access to the original training r… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: EACL 2021

  47. arXiv:2101.09523  [pdf, other

    cs.CL

    Debiasing Pre-trained Contextualised Embeddings

    Authors: Masahiro Kaneko, Danushka Bollegala

    Abstract: In comparison to the numerous debiasing methods proposed for the static non-contextualised word embeddings, the discriminative biases in contextualised embeddings have received relatively little attention. We propose a fine-tuning method that can be applied at token- or sentence-levels to debias pre-trained contextualised embeddings. Our proposed method can be applied to any pre-trained contextual… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: EACL 2021

  48. arXiv:2101.09313  [pdf, other

    cs.CL cs.LG

    $k$-Neighbor Based Curriculum Sampling for Sequence Prediction

    Authors: James O' Neill, Danushka Bollegala

    Abstract: Multi-step ahead prediction in language models is challenging due to the discrepancy between training and test time processes. At test time, a sequence predictor is required to make predictions given past predictions as the input, instead of the past targets that are provided during training. This difference, known as exposure bias, can lead to the compounding of errors along a generated sequence… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:1809.05916

  49. arXiv:2010.13094  [pdf, other

    cs.CL

    Autoencoding Improves Pre-trained Word Embeddings

    Authors: Masahiro Kaneko, Danushka Bollegala

    Abstract: Prior work investigating the geometry of pre-trained word embeddings have shown that word embeddings to be distributed in a narrow cone and by centering and projecting using principal component vectors one can increase the accuracy of a given set of pre-trained word embeddings. However, theoretically, this post-processing step is equivalent to applying a linear autoencoder to minimise the squared… ▽ More

    Submitted 27 October, 2020; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: COLING 2020

  50. arXiv:2008.04442  [pdf, other

    cs.RO cs.CV

    Spatio-temporal Attention Model for Tactile Texture Recognition

    Authors: Guanqun Cao, Yi Zhou, Danushka Bollegala, Shan Luo

    Abstract: Recently, tactile sensing has attracted great interest in robotics, especially for facilitating exploration of unstructured environments and effective manipulation. A detailed understanding of the surface textures via tactile sensing is essential for many of these tasks. Previous works on texture recognition using camera based tactile sensors have been limited to treating all regions in one tactil… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: 7 pages, accepted by International Conference on Intelligent Robots and Systems 2020