Skip to main content

Showing 1–20 of 20 results for author: Konrád, J

Searching in archive cs. Search in all archives.
.
  1. Estimating Distances Between People using a Single Overhead Fisheye Camera with Application to Social-Distancing Oversight

    Authors: Zhangchi Lu, Mertcan Cokbas, Prakash Ishwar, Jansuz Konrad

    Abstract: Unobtrusive monitoring of distances between people indoors is a useful tool in the fight against pandemics. A natural resource to accomplish this are surveillance cameras. Unlike previous distance estimation methods, we use a single, overhead, fisheye camera with wide area coverage and propose two approaches. One method leverages a geometric model of the fisheye lens, whereas the other method uses… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Journal ref: In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP (2023), pages 528-535

  2. arXiv:2212.11477  [pdf, other

    cs.CV

    Spatio-Visual Fusion-Based Person Re-Identification for Overhead Fisheye Images

    Authors: Mertcan Cokbas, Prakash Ishwar, Janusz Konrad

    Abstract: Person re-identification (PRID) has been thoroughly researched in typical surveillance scenarios where various scenes are monitored by side-mounted, rectilinear-lens cameras. To date, few methods have been proposed for fisheye cameras mounted overhead and their performance is lacking. In order to close this performance gap, we propose a multi-feature framework for fisheye PRID where we combine dee… ▽ More

    Submitted 25 April, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

  3. Flowstorm: Open-Source Platform with Hybrid Dialogue Architecture

    Authors: Jan Pichl, Petr Marek, Jakub Konrád, Petr Lorenc, Ondřej Kobza, Tomáš Zajíček, Jan Šedivý

    Abstract: This paper presents a conversational AI platform called Flowstorm. Flowstorm is an open-source SaaS project suitable for creating, running, and analyzing conversational applications. Thanks to the fast and fully automated build process, the dialogues created within the platform can be executed in seconds. Furthermore, we propose a novel dialogue architecture that uses a combination of tree structu… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Journal ref: NAACL Demo Track (2022) 39-45

  4. arXiv:2210.01582  [pdf, other

    cs.CV

    FRIDA: Fisheye Re-Identification Dataset with Annotations

    Authors: Mertcan Cokbas, John Bolognino, Janusz Konrad, Prakash Ishwar

    Abstract: Person re-identification (PRID) from side-mounted rectilinear-lens cameras is a well-studied problem. On the other hand, PRID from overhead fisheye cameras is new and largely unstudied, primarily due to the lack of suitable image datasets. To fill this void, we introduce the "Fisheye Re-IDentification Dataset with Annotations" (FRIDA), with 240k+ bounding-box annotations of people, captured by 3 t… ▽ More

    Submitted 19 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 8 pages

  5. arXiv:2204.10849  [pdf, other

    cs.CL

    Metric Learning and Adaptive Boundary for Out-of-Domain Detection

    Authors: Petr Lorenc, Tommaso Gargiani, Jan Pichl, Jakub Konrád, Petr Marek, Ondřej Kobza, Jan Šedivý

    Abstract: Conversational agents are usually designed for closed-world environments. Unfortunately, users can behave unexpectedly. Based on the open-world environment, we often encounter the situation that the training and test data are sampled from different distributions. Then, data from different distributions are called out-of-domain (OOD). A robust conversational agent needs to react to these OOD uttera… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to The 27th International Conference on Natural Language & Information Systems (NLDB) 2022

  6. arXiv:2109.07968  [pdf, other

    cs.CL cs.AI

    Alquist 4.0: Towards Social Intelligence Using Generative Models and Dialogue Personalization

    Authors: Jakub Konrád, Jan Pichl, Petr Marek, Petr Lorenc, Van Duy Ta, Ondřej Kobza, Lenka Hýlová, Jan Šedivý

    Abstract: The open domain-dialogue system Alquist has a goal to conduct a coherent and engaging conversation that can be considered as one of the benchmarks of social intelligence. The fourth version of the system, developed within the Alexa Prize Socialbot Grand Challenge 4, brings two main innovations. The first addresses coherence, and the second addresses the engagingness of the conversation. For innova… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: 20 pages

  7. arXiv:2104.10454  [pdf, other

    cs.CL cs.AI cs.LG

    Text Summarization of Czech News Articles Using Named Entities

    Authors: Petr Marek, Štěpán Müller, Jakub Konrád, Petr Lorenc, Jan Pichl, Jan Šedivý

    Abstract: The foundation for the research of summarization in the Czech language was laid by the work of Straka et al. (2018). They published the SumeCzech, a large Czech news-based summarization dataset, and proposed several baseline approaches. However, it is clear from the achieved results that there is a large space for improvement. In our work, we focus on the impact of named entities on the summarizat… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Journal ref: The Prague Bulletin of Mathematical Linguistics 2021 116

  8. arXiv:2101.09585  [pdf, other

    cs.CV

    BSUV-Net 2.0: Spatio-Temporal Data Augmentations for Video-Agnostic Supervised Background Subtraction

    Authors: M. Ozan Tezcan, Prakash Ishwar, Janusz Konrad

    Abstract: Background subtraction (BGS) is a fundamental video processing task which is a key component of many applications. Deep learning-based supervised algorithms achieve very good perforamnce in BGS, however, most of these algorithms are optimized for either a specific video or a group of videos, and their performance decreases dramatically when applied to unseen videos. Recently, several papers addres… ▽ More

    Submitted 24 February, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

  9. Do We Need Online NLU Tools?

    Authors: Petr Lorenc, Petr Marek, Jan Pichl, Jakub Konrád, Jan Šedivý

    Abstract: The intent recognition is an essential algorithm of any conversational AI application. It is responsible for the classification of an input message into meaningful classes. In many bot development platforms, we can configure the NLU pipeline. Several intent recognition services are currently available as an API, or we choose from many open-source alternatives. However, there is no comparison of in… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 8 pages, 9 tables

  10. arXiv:2011.03261  [pdf, other

    cs.CL

    Alquist 3.0: Alexa Prize Bot Using Conversational Knowledge Graph

    Authors: Jan Pichl, Petr Marek, Jakub Konrád, Petr Lorenc, Van Duy Ta, Jan Šedivý

    Abstract: The third version of the open-domain dialogue system Alquist developed within the Alexa Prize 2020 competition is designed to conduct coherent and engaging conversations on popular topics. The main novel contribution is the introduction of a system leveraging an innovative approach based on a conversational knowledge graph and adjacency pairs. The conversational knowledge graph allows the system t… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  11. arXiv:2011.03259  [pdf, other

    cs.CL

    Alquist 2.0: Alexa Prize Socialbot Based on Sub-Dialogue Models

    Authors: Jan Pichl, Petr Marek, Jakub Konrád, Martin Matulík, Jan Šedivý

    Abstract: This paper presents the second version of the dialogue system named Alquist competing in Amazon Alexa Prize 2018. We introduce a system leveraging ontology-based topic structure called topic nodes. Each of the nodes consists of several sub-dialogues, and each sub-dialogue has its own LSTM-based model for dialogue management. The sub-dialogues can be triggered according to the topic hierarchy or a… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  12. arXiv:2005.11623  [pdf, other

    cs.CV

    RAPiD: Rotation-Aware People Detection in Overhead Fisheye Images

    Authors: Zhihao Duan, M. Ozan Tezcan, Hayato Nakamura, Prakash Ishwar, Janusz Konrad

    Abstract: Recent methods for people detection in overhead, fisheye images either use radially-aligned bounding boxes to represent people, assuming people always appear along image radius or require significant pre-/post-processing which radically increases computational complexity. In this work, we develop an end-to-end rotation-aware people detection method, named RAPiD, that detects people using arbitrari… ▽ More

    Submitted 23 May, 2020; originally announced May 2020.

    Comments: CVPR 2020 OmniCV Workshop paper extended version

  13. arXiv:2004.05685  [pdf, other

    cs.CV

    Low-Resolution Overhead Thermal Tripwire for Occupancy Estimation

    Authors: Mertcan Cokbas, Prakash Ishwar, Janusz Konrad

    Abstract: Smart buildings use occupancy sensing for various tasks ranging from energy-efficient HVAC and lighting to space-utilization analysis and emergency response. We propose a people counting system which uses a low-resolution thermal sensor. Unlike previous people-counting systems based on thermal sensors, we use an overhead tripwire configuration at entryways to detect and track transient entries or… ▽ More

    Submitted 5 May, 2020; v1 submitted 12 April, 2020; originally announced April 2020.

  14. VAE/WGAN-Based Image Representation Learning For Pose-Preserving Seamless Identity Replacement In Facial Images

    Authors: Hiroki Kawai, Jiawei Chen, Prakash Ishwar, Janusz Konrad

    Abstract: We present a novel variational generative adversarial network (VGAN) based on Wasserstein loss to learn a latent representation from a face image that is invariant to identity but preserves head-pose information. This facilitates synthesis of a realistic face image with the same head pose as a given input image, but with a different identity. One application of this network is in privacy-sensitive… ▽ More

    Submitted 1 March, 2020; originally announced March 2020.

    Comments: 6 pages, 5 figures, 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP)

    Journal ref: 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP)

  15. arXiv:1907.11371  [pdf, other

    cs.CV

    BSUV-Net: A Fully-Convolutional Neural Network for Background Subtraction of Unseen Videos

    Authors: M. Ozan Tezcan, Prakash Ishwar, Janusz Konrad

    Abstract: Background subtraction is a basic task in computer vision and video processing often applied as a pre-processing step for object tracking, people recognition, etc. Recently, a number of successful background-subtraction algorithms have been proposed, however nearly all of the top-performing ones are supervised. Crucially, their success relies upon the availability of some annotated frames of the t… ▽ More

    Submitted 14 January, 2020; v1 submitted 25 July, 2019; originally announced July 2019.

    Comments: 10 pages

  16. arXiv:1906.09313  [pdf, other

    cs.CV

    A Cyclically-Trained Adversarial Network for Invariant Representation Learning

    Authors: Jiawei Chen, Janusz Konrad, Prakash Ishwar

    Abstract: Recent studies show that deep neural networks are vulnerable to adversarial examples which can be generated via certain types of transformations. Being robust to a desired family of adversarial attacks is then equivalent to being invariant to a family of transformations. Learning invariant representations then naturally emerges as an important goal to achieve which we explore in this paper within… ▽ More

    Submitted 16 April, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

  17. arXiv:1804.06705  [pdf, other

    cs.CL

    Alquist: The Alexa Prize Socialbot

    Authors: Jan Pichl, Petr Marek, Jakub Konrád, Martin Matulík, Hoang Long Nguyen, Jan Šedivý

    Abstract: This paper describes a new open domain dialogue system Alquist developed as part of the Alexa Prize competition for the Amazon Echo line of products. The Alquist dialogue system is designed to conduct a coherent and engaging conversation on popular topics. We are presenting a hybrid system combining several machine learning and rule based approaches. We discuss and describe the Alquist pipeline, d… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

  18. arXiv:1803.07100  [pdf, ps, other

    cs.CV

    VGAN-Based Image Representation Learning for Privacy-Preserving Facial Expression Recognition

    Authors: Jiawei Chen, Janusz Konrad, Prakash Ishwar

    Abstract: Reliable facial expression recognition plays a critical role in human-machine interactions. However, most of the facial expression analysis methodologies proposed to date pay little or no attention to the protection of a user's privacy. In this paper, we propose a Privacy-Preserving Representation-Learning Variational Generative Adversarial Network (PPRL-VGAN) to learn an image representation that… ▽ More

    Submitted 7 September, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

  19. arXiv:1610.03898  [pdf, other

    cs.CV

    Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low Resolutions

    Authors: Jiawei Chen, Jonathan Wu, Janusz Konrad, Prakash Ishwar

    Abstract: Deep convolutional neural networks (ConvNets) have been recently shown to attain state-of-the-art performance for action recognition on standard-resolution videos. However, less attention has been paid to recognition performance at extremely low resolutions (eLR) (e.g., 16 x 12 pixels). Reliable action recognition using eLR cameras would address privacy concerns in various application environments… ▽ More

    Submitted 5 October, 2018; v1 submitted 12 October, 2016; originally announced October 2016.

  20. arXiv:0910.2917  [pdf, ps, other

    cs.CV

    Behavior Subtraction

    Authors: P. M. Jodoin, V. Saligrama, J. Konrad

    Abstract: Background subtraction has been a driving engine for many computer vision and video analytics tasks. Although its many variants exist, they all share the underlying assumption that photometric scene properties are either static or exhibit temporal stationarity. While this works in some applications, the model fails when one is interested in discovering {\it changes in scene dynamics} rather than… ▽ More

    Submitted 15 October, 2009; originally announced October 2009.