Skip to main content

Showing 1–10 of 10 results for author: Araslanov, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16818  [pdf, other

    cs.CV

    Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals

    Authors: Oliver Hahn, Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

    Abstract: Unsupervised semantic segmentation aims to automatically partition images into semantically meaningful regions by identifying global categories within an image corpus without any form of annotation. Building upon recent advances in self-supervised representation learning, we focus on how to leverage these large pre-trained models for the downstream task of unsupervised segmentation. We present Pri… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Code: https://github.com/visinf/primaps

  2. arXiv:2404.03778  [pdf, other

    cs.CV

    Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball

    Authors: Simon Weber, Barış Zöngür, Nikita Araslanov, Daniel Cremers

    Abstract: Hierarchy is a natural representation of semantic taxonomies, including the ones routinely used in image segmentation. Indeed, recent work on semantic segmentation reports improved accuracy from supervised training leveraging hierarchical label structures. Encouraged by these results, we revisit the fundamental assumptions behind that work. We postulate and then empirically verify that the reasons… ▽ More

    Submitted 15 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2212.10368  [pdf, other

    cs.CV

    Masked Event Modeling: Self-Supervised Pretraining for Event Cameras

    Authors: Simon Klenk, David Bonello, Lukas Koestler, Nikita Araslanov, Daniel Cremers

    Abstract: Event cameras asynchronously capture brightness changes with low latency, high temporal resolution, and high dynamic range. However, annotation of event data is a costly and laborious process, which limits the use of deep learning methods for classification and other semantic tasks with the event modality. To reduce the dependency on labeled event data, we introduce Masked Event Modeling (MEM), a… ▽ More

    Submitted 23 December, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: To appear at WACV 2024. Code: https://github.com/tum-vision/mem

  4. arXiv:2208.05788  [pdf, other

    cs.CV

    Semantic Self-adaptation: Enhancing Generalization with a Single Sample

    Authors: Sherwin Bahmani, Oliver Hahn, Eduard Zamfir, Nikita Araslanov, Daniel Cremers, Stefan Roth

    Abstract: The lack of out-of-domain generalization is a critical weakness of deep networks for semantic segmentation. Previous studies relied on the assumption of a static model, i. e., once the training process is complete, model parameters remain fixed at test time. In this work, we challenge this premise with a self-adaptive approach for semantic segmentation that adjusts the inference process to each in… ▽ More

    Submitted 13 December, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Published in TMLR (July 2023) | OpenReview: https://openreview.net/forum?id=ILNqQhGbLx | Code: https://github.com/visinf/self-adaptive | Video: https://youtu.be/s4DG65ic0EA

  5. arXiv:2111.06265  [pdf, other

    cs.CV cs.LG

    Dense Unsupervised Learning for Video Segmentation

    Authors: Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

    Abstract: We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train su… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: To appear at NeurIPS*2021. Code: https://github.com/visinf/dense-ulearn-vos

  6. arXiv:2105.00097  [pdf, other

    cs.CV cs.LG

    Self-supervised Augmentation Consistency for Adapting Semantic Segmentation

    Authors: Nikita Araslanov, Stefan Roth

    Abstract: We propose an approach to domain adaptation for semantic segmentation that is both practical and highly accurate. In contrast to previous work, we abandon the use of computationally involved adversarial objectives, network ensembles and style transfer. Instead, we employ standard data augmentation techniques $-$ photometric noise, flipping and scaling $-$ and ensure consistency of the semantic pre… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Comments: To appear at CVPR 2021. Code: https://github.com/visinf/da-sac

  7. arXiv:2005.08104  [pdf, other

    cs.CV cs.LG

    Single-Stage Semantic Segmentation from Image Labels

    Authors: Nikita Araslanov, Stefan Roth

    Abstract: Recent years have seen a rapid growth in new approaches improving the accuracy of semantic segmentation in a weakly supervised setting, i.e. with only image-level labels available for training. However, this has come at the cost of increased model complexity and sophisticated multi-stage training procedures. This is in contrast to earlier work that used only a single stage $-$ training one segment… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: To appear at CVPR 2020; minor corrections in Eq. (9). Code: https://github.com/visinf/1-stage-wseg

  8. arXiv:1909.12400  [pdf, other

    cs.CV

    Markov Decision Process for Video Generation

    Authors: Vladyslav Yushchenko, Nikita Araslanov, Stefan Roth

    Abstract: We identify two pathological cases of temporal inconsistencies in video generation: video freezing and video looping. To better quantify the temporal diversity, we propose a class of complementary metrics that are effective, easy to implement, data agnostic, and interpretable. Further, we observe that current state-of-the-art models are trained on video samples of fixed length thereby inhibiting l… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: To appear at 2019 ICCV Workshop on Large Scale Holistic Video Understanding

  9. arXiv:1904.05126  [pdf, other

    cs.CV cs.LG

    Actor-Critic Instance Segmentation

    Authors: Nikita Araslanov, Constantin Rothkopf, Stefan Roth

    Abstract: Most approaches to visual scene analysis have emphasised parallel processing of the image elements. However, one area in which the sequential nature of vision is apparent, is that of segmenting multiple, potentially similar and partially occluded objects in a scene. In this work, we revisit the recurrent formulation of this challenging problem in the context of reinforcement learning. Motivated by… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: To appear at CVPR 2019

  10. NimbRo Rescue: Solving Disaster-Response Tasks through Mobile Manipulation Robot Momaro

    Authors: Max Schwarz, Tobias Rodehutskors, David Droeschel, Marius Beul, Michael Schreiber, Nikita Araslanov, Ivan Ivanov, Christian Lenz, Jan Razlaw, Sebastian Schüller, David Schwarz, Angeliki Topalidou-Kyniazopoulou, Sven Behnke

    Abstract: Robots that solve complex tasks in environments too dangerous for humans to enter are desperately needed, e.g. for search and rescue applications. We describe our mobile manipulation robot Momaro, with which we participated successfully in the DARPA Robotics Challenge. It features a unique locomotion design with four legs ending in steerable wheels, which allows it both to drive omnidirectionally… ▽ More

    Submitted 15 October, 2018; v1 submitted 2 October, 2018; originally announced October 2018.

    Journal ref: Journal of Field Robotics 34(2): 400-425 (2017)