Author: "Wołczyk, Maciej" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wołczyk, Maciej"' showing total 34 results

Start Over Author "Wołczyk, Maciej"

34 results on '"Wołczyk, Maciej"'

1. BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Author: Paglieri, Davide, Cupiał, Bartłomiej, Coward, Samuel, Piterbarg, Ulyana, Wolczyk, Maciej, Khan, Akbir, Pignatelli, Eduardo, Kuciński, Łukasz, Pinto, Lerrel, Fergus, Rob, Foerster, Jakob Nicolaus, Parker-Holder, Jack, and Rocktäschel, Tim
Subjects: Computer Science - Artificial Intelligence
Abstract: Large Language Models (LLMs) and Vision Language Models (VLMs) possess extensive knowledge and exhibit promising reasoning abilities; however, they still struggle to perform well in complex, dynamic environments. Real-world tasks require handling intricate interactions, advanced spatial reasoning, long-term planning, and continuous exploration of new strategies-areas in which we lack effective methodologies for comprehensively evaluating these capabilities. To address this gap, we introduce BALROG, a novel benchmark designed to assess the agentic capabilities of LLMs and VLMs through a diverse set of challenging games. Our benchmark incorporates a range of existing reinforcement learning environments with varying levels of difficulty, including tasks that are solvable by non-expert humans in seconds to extremely challenging ones that may take years to master (e.g., the NetHack Learning Environment). We devise fine-grained metrics to measure performance and conduct an extensive evaluation of several popular open-source and closed-source LLMs and VLMs. Our findings indicate that while current models achieve partial success in the easier games, they struggle significantly with more challenging tasks. Notably, we observe severe deficiencies in vision-based decision-making, as models perform worse when visual representations of the environments are provided. We release BALROG as an open and user-friendly benchmark to facilitate future research and development in the agentic community., Comment: Preprint, under review
Published: 2024

2. Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models

Author: Góral, Gracjan, Ziarko, Alicja, Nauman, Michal, and Wołczyk, Maciej
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Visual perspective-taking (VPT), the ability to understand the viewpoint of another person, enables individuals to anticipate the actions of other people. For instance, a driver can avoid accidents by assessing what pedestrians see. Humans typically develop this skill in early childhood, but it remains unclear whether the recently emerging Vision Language Models (VLMs) possess such capability. Furthermore, as these models are increasingly deployed in the real world, understanding how they perform nuanced tasks like VPT becomes essential. In this paper, we introduce two manually curated datasets, Isle-Bricks and Isle-Dots for testing VPT skills, and we use it to evaluate 12 commonly used VLMs. Across all models, we observe a significant performance drop when perspective-taking is required. Additionally, we find performance in object detection tasks is poorly correlated with performance on VPT tasks, suggesting that the existing benchmarks might not be sufficient to understand this problem. The code and the dataset will be available at https://sites.google.com/view/perspective-taking
Published: 2024

3. State Soup: In-Context Skill Learning, Retrieval and Mixing

Author: Pióro, Maciej, Wołczyk, Maciej, Pascanu, Razvan, von Oswald, Johannes, and Sacramento, João
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: A new breed of gated-linear recurrent neural networks has reached state-of-the-art performance on a range of sequence modeling problems. Such models naturally handle long sequences efficiently, as the cost of processing a new input is independent of sequence length. Here, we explore another advantage of these stateful sequence models, inspired by the success of model merging through parameter interpolation. Building on parallels between fine-tuning and in-context learning, we investigate whether we can treat internal states as task vectors that can be stored, retrieved, and then linearly combined, exploiting the linearity of recurrence. We study this form of fast model merging on Mamba-2.8b, a pretrained recurrent model, and present preliminary evidence that simple linear state interpolation methods suffice to improve next-token perplexity as well as downstream in-context learning task performance.
Published: 2024

4. AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale

Author: Pardyl, Adam, Wronka, Michał, Wołczyk, Maciej, Adamczewski, Kamil, Trzciński, Tomasz, and Zieliński, Bartosz
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Active Visual Exploration (AVE) is a task that involves dynamically selecting observations (glimpses), which is critical to facilitate comprehension and navigation within an environment. While modern AVE methods have demonstrated impressive performance, they are constrained to fixed-scale glimpses from rigid grids. In contrast, existing mobile platforms equipped with optical zoom capabilities can capture glimpses of arbitrary positions and scales. To address this gap between software and hardware capabilities, we introduce AdaGlimpse. It uses Soft Actor-Critic, a reinforcement learning algorithm tailored for exploration tasks, to select glimpses of arbitrary position and scale. This approach enables our model to rapidly establish a general awareness of the environment before zooming in for detailed analysis. Experimental results demonstrate that AdaGlimpse surpasses previous methods across various visual tasks while maintaining greater applicability in realistic AVE scenarios., Comment: ECCV 2024
Published: 2024
Full Text: View/download PDF

5. Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

Author: Wołczyk, Maciej, Cupiał, Bartłomiej, Ostaszewski, Mateusz, Bortkiewicz, Michał, Zając, Michał, Pascanu, Razvan, Kuciński, Łukasz, and Miłoś, Piotr
Subjects: Computer Science - Machine Learning
Abstract: Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained capabilities, as recently showcased by the successful applications of foundation models. However, fine-tuning reinforcement learning (RL) models remains a challenge. This work conceptualizes one specific cause of poor transfer, accentuated in the RL setting by the interplay between actions and observations: forgetting of pre-trained capabilities. Namely, a model deteriorates on the state subspace of the downstream task not visited in the initial phase of fine-tuning, on which the model behaved well due to pre-training. This way, we lose the anticipated transfer benefits. We identify conditions when this problem occurs, showing that it is common and, in many cases, catastrophic. Through a detailed empirical analysis of the challenging NetHack and Montezuma's Revenge environments, we show that standard knowledge retention techniques mitigate the problem and thus allow us to take full advantage of the pre-trained capabilities. In particular, in NetHack, we achieve a new state-of-the-art for neural models, improving the previous best score from $5$K to over $10$K points in the Human Monk scenario., Comment: ICML 2024 Spotlight
Published: 2024

6. AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale

Author: Pardyl, Adam, Wronka, Michał, Wołczyk, Maciej, Adamczewski, Kamil, Trzciński, Tomasz, Zieliński, Bartosz, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Leonardis, Aleš, editor, Ricci, Elisa, editor, Roth, Stefan, editor, Russakovsky, Olga, editor, Sattler, Torsten, editor, and Varol, Gül, editor
Published: 2025
Full Text: View/download PDF

7. Discovering modular solutions that generalize compositionally

Author: Schug, Simon, Kobayashi, Seijin, Akram, Yassir, Wołczyk, Maciej, Proca, Alexandra, von Oswald, Johannes, Pascanu, Razvan, Sacramento, João, and Steger, Angelika
Subjects: Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing
Abstract: Many complex tasks can be decomposed into simpler, independent parts. Discovering such underlying compositional structure has the potential to enable compositional generalization. Despite progress, our most powerful systems struggle to compose flexibly. It therefore seems natural to make models more modular to help capture the compositional nature of many tasks. However, it is unclear under which circumstances modular systems can discover hidden compositional structure. To shed light on this question, we study a teacher-student setting with a modular teacher where we have full control over the composition of ground truth modules. This allows us to relate the problem of compositional generalization to that of identification of the underlying modules. In particular we study modularity in hypernetworks representing a general class of multiplicative interactions. We show theoretically that identification up to linear transformation purely from demonstrations is possible without having to learn an exponential number of module combinations. We further demonstrate empirically that under the theoretically identified conditions, meta-learning from finite data can discover modular policies that generalize compositionally in a number of complex environments., Comment: Published as a conference paper at ICLR 2024; Code available at https://github.com/smonsays/modular-hyperteacher
Published: 2023

8. The Effectiveness of World Models for Continual Reinforcement Learning

Author: Kessler, Samuel, Ostaszewski, Mateusz, Bortkiewicz, Michał, Żarski, Mateusz, Wołczyk, Maciej, Parker-Holder, Jack, Roberts, Stephen J., and Miłoś, Piotr
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically employ a replay buffer for training, which can be naturally extended to continual learning. We systematically study how different selective experience replay methods affect performance, forgetting, and transfer. We also provide recommendations regarding various modeling options for using world models. The best set of choices is called Continual-Dreamer, it is task-agnostic and utilizes the world model for continual exploration. Continual-Dreamer is sample efficient and outperforms state-of-the-art task-agnostic continual reinforcement learning methods on Minigrid and Minihack benchmarks., Comment: Accepted at CoLLAs 2023, 21 pages, 15 figures
Published: 2022

9. Disentangling Transfer in Continual Reinforcement Learning

Author: Wołczyk, Maciej, Zając, Michał, Pascanu, Razvan, Kuciński, Łukasz, and Miłoś, Piotr
Subjects: Computer Science - Machine Learning
Abstract: The ability of continual learning systems to transfer knowledge from previously seen tasks in order to maximize performance on new tasks is a significant challenge for the field, limiting the applicability of continual learning solutions to realistic scenarios. Consequently, this study aims to broaden our understanding of transfer and its driving forces in the specific case of continual reinforcement learning. We adopt SAC as the underlying RL algorithm and Continual World as a suite of continuous control tasks. We systematically study how different components of SAC (the actor and the critic, exploration, and data) affect transfer efficacy, and we provide recommendations regarding various modeling options. The best set of choices, dubbed ClonEx-SAC, is evaluated on the recent Continual World benchmark. ClonEx-SAC achieves 87% final success rate compared to 80% of PackNet, the best method in the benchmark. Moreover, the transfer grows from 0.18 to 0.54 according to the metric provided by Continual World., Comment: Accepted at NeurIPS 2022
Published: 2022

10. Hebbian Continual Representation Learning

Author: Morawiecki, Paweł, Krutsylo, Andrii, Wołczyk, Maciej, and Śmieja, Marek
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Machine Learning
Abstract: Continual Learning aims to bring machine learning into a more realistic scenario, where tasks are learned sequentially and the i.i.d. assumption is not preserved. Although this setting is natural for biological systems, it proves very difficult for machine learning models such as artificial neural networks. To reduce this performance gap, we investigate the question whether biologically inspired Hebbian learning is useful for tackling continual challenges. In particular, we highlight a realistic and often overlooked unsupervised setting, where the learner has to build representations without any supervision. By combining sparse neural networks with Hebbian learning principle, we build a simple yet effective alternative (HebbCL) to typical neural network models trained via the gradient descent. Due to Hebbian learning, the network have easily interpretable weights, which might be essential in critical application such as security or healthcare. We demonstrate the efficacy of HebbCL in an unsupervised learning setting applied to MNIST and Omniglot datasets. We also adapt the algorithm to the supervised scenario and obtain promising results in the class-incremental learning.
Published: 2022

11. Continual Learning with Guarantees via Weight Interval Constraints

Author: Wołczyk, Maciej, Piczak, Karol J., Wójcik, Bartosz, Pustelnik, Łukasz, Morawiecki, Paweł, Tabor, Jacek, Trzciński, Tomasz, and Spurek, Przemysław
Subjects: Computer Science - Machine Learning
Abstract: We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of data, while reducing the negative impact of catastrophic forgetting, yet they do not provide any firm guarantees that network performance will not deteriorate uncontrollably over time. In this work, we show how to put bounds on forgetting by reformulating continual learning of a model as a continual contraction of its parameter space. To that end, we propose Hyperrectangle Training, a new training methodology where each task is represented by a hyperrectangle in the parameter space, fully contained in the hyperrectangles of the previous tasks. This formulation reduces the NP-hard CL problem back to polynomial time while providing full resilience against forgetting. We validate our claim by developing InterContiNet (Interval Continual Learning) algorithm which leverages interval arithmetic to effectively model parameter regions as hyperrectangles. Through experimental results, we show that our approach performs well in a continual learning setup without storing data from previous tasks., Comment: Short presentation at ICML 2022
Published: 2022

12. On the relationship between disentanglement and multi-task learning

Author: Maziarka, Łukasz, Nowak, Aleksandra, Wołczyk, Maciej, and Bedychaj, Andrzej
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: One of the main arguments behind studying disentangled representations is the assumption that they can be easily reused in different tasks. At the same time finding a joint, adaptable representation of data is one of the key challenges in the multi-task learning setting. In this paper, we take a closer look at the relationship between disentanglement and multi-task learning based on hard parameter sharing. We perform a thorough empirical study of the representations obtained by neural networks trained on automatically generated supervised tasks. Using a set of standard metrics we show that disentanglement appears naturally during the process of multi-task neural network training.
Published: 2021

13. SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

Author: Vitelli, Matt, Chang, Yan, Ye, Yawei, Wołczyk, Maciej, Osiński, Błażej, Niendorf, Moritz, Grimmett, Hugo, Huang, Qiangui, Jain, Ashesh, and Ondruska, Peter
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In this paper we present the first safe system for full control of self-driving vehicles trained from human demonstrations and deployed in challenging, real-world, urban environments. Current industry-standard solutions use rule-based systems for planning. Although they perform reasonably well in common scenarios, the engineering complexity renders this approach incompatible with human-level performance. On the other hand, the performance of machine-learned (ML) planning solutions can be improved by simply adding more exemplar data. However, ML methods cannot offer safety guarantees and sometimes behave unpredictably. To combat this, our approach uses a simple yet effective rule-based fallback layer that performs sanity checks on an ML planner's decisions (e.g. avoiding collision, assuring physical feasibility). This allows us to leverage ML to handle complex situations while still assuring the safety, reducing ML planner-only collisions by 95%. We train our ML planner on 300 hours of expert driving demonstrations using imitation learning and deploy it along with the fallback layer in downtown San Francisco, where it takes complete control of a real vehicle and navigates a wide variety of challenging urban driving scenarios.
Published: 2021

14. Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

Author: Scheel, Oliver, Bergamini, Luca, Wołczyk, Maciej, Osiński, Błażej, and Ondruska, Peter
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In this work we are the first to present an offline policy gradient method for learning imitative policies for complex urban driving from a large corpus of real-world demonstrations. This is achieved by building a differentiable data-driven simulator on top of perception outputs and high-fidelity HD maps of the area. It allows us to synthesize new driving experiences from existing demonstrations using mid-level representations. Using this simulator we then train a policy network in closed-loop employing policy gradients. We train our proposed method on 100 hours of expert demonstrations on urban roads and show that it learns complex driving policies that generalize well and can perform a variety of driving maneuvers. We demonstrate this in simulation as well as deploy our model to self-driving vehicles in the real-world. Our method outperforms previously demonstrated state-of-the-art for urban driving scenarios -- all this without the need for complex state perturbations or collecting additional on-policy data during training. We make code and data publicly available., Comment: CoRL 2021
Published: 2021

15. PluGeN: Multi-Label Conditional Generation From Pre-Trained Models

Author: Wołczyk, Maciej, Proszewska, Magdalena, Maziarka, Łukasz, Zięba, Maciej, Wielopolski, Patryk, Kurczab, Rafał, and Śmieja, Marek
Subjects: Computer Science - Machine Learning
Abstract: Modern generative models achieve excellent quality in a variety of tasks including image or text generation and chemical molecule modeling. However, existing methods often lack the essential ability to generate examples with requested properties, such as the age of the person in the photo or the weight of the generated molecule. Incorporating such additional conditioning factors would require rebuilding the entire architecture and optimizing the parameters from scratch. Moreover, it is difficult to disentangle selected attributes so that to perform edits of only one attribute while leaving the others unchanged. To overcome these limitations we propose PluGeN (Plugin Generative Network), a simple yet effective generative technique that can be used as a plugin to pre-trained generative models. The idea behind our approach is to transform the entangled latent representation using a flow-based module into a multi-dimensional space where the values of each attribute are modeled as an independent one-dimensional distribution. In consequence, PluGeN can generate new samples with desired attributes as well as manipulate labeled attributes of existing examples. Due to the disentangling of the latent representation, we are even able to generate samples with rare or unseen combinations of attributes in the dataset, such as a young person with gray hair, men with make-up, or women with beards. We combined PluGeN with GAN and VAE models and applied it to conditional generation and manipulation of images and chemical molecule modeling. Experiments demonstrate that PluGeN preserves the quality of backbone models while adding the ability to control the values of labeled attributes.
Published: 2021

16. Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

Author: Wołczyk, Maciej, Wójcik, Bartosz, Bałazy, Klaudia, Podolak, Igor, Tabor, Jacek, Śmieja, Marek, and Trzciński, Tomasz
Subjects: Computer Science - Machine Learning
Abstract: The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. However, if a particular IC does not decide to return an answer early, its predictions are discarded, with its computations effectively being wasted. To solve this issue, we introduce Zero Time Waste (ZTW), a novel approach in which each IC reuses predictions returned by its predecessors by (1) adding direct connections between ICs and (2) combining previous outputs in an ensemble-like manner. We conduct extensive experiments across various datasets and architectures to demonstrate that ZTW achieves a significantly better accuracy vs. inference time trade-off than other recently proposed early exit methods., Comment: Accepted at NeurIPS 2021
Published: 2021

17. Continual World: A Robotic Benchmark For Continual Reinforcement Learning

Author: Wołczyk, Maciej, Zając, Michał, Pascanu, Razvan, Kuciński, Łukasz, and Miłoś, Piotr
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Robotics
Abstract: Continual learning (CL) -- the ability to continuously learn, building on previously acquired knowledge -- is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks. Understanding the right trade-off is conceptually and computationally challenging, which we argue has led the community to overly focus on catastrophic forgetting. In response to these issues, we advocate for the need to prioritize forward transfer and propose Continual World, a benchmark consisting of realistic and meaningfully diverse robotic tasks built on top of Meta-World as a testbed. Following an in-depth empirical evaluation of existing CL methods, we pinpoint their limitations and highlight unique algorithmic challenges in the RL setting. Our benchmark aims to provide a meaningful and computationally inexpensive challenge for the community and thus help better understand the performance of existing and future solutions. Information about the benchmark, including the open-source code, is available at https://sites.google.com/view/continualworld., Comment: NeurIPS 2021
Published: 2021

18. On the Relationship Between Disentanglement and Multi-task Learning

Author: Maziarka, Łukasz, Nowak, Aleksandra, Wołczyk, Maciej, Bedychaj, Andrzej, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Amini, Massih-Reza, editor, Canu, Stéphane, editor, Fischer, Asja, editor, Guns, Tias, editor, Kralj Novak, Petra, editor, and Tsoumakas, Grigorios, editor
Published: 2023
Full Text: View/download PDF

19. Finding the Optimal Network Depth in Classification Tasks

Author: Wójcik, Bartosz, Wołczyk, Maciej, Bałazy, Klaudia, and Tabor, Jacek
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We develop a fast end-to-end method for training lightweight neural networks using multiple classifier heads. By allowing the model to determine the importance of each head and rewarding the choice of a single shallow classifier, we are able to detect and remove unneeded components of the network. This operation, which can be seen as finding the optimal depth of the model, significantly reduces the number of parameters and accelerates inference across different hardware processing units, which is not the case for many standard pruning methods. We show the performance of our method on multiple network architectures and datasets, analyze its optimization properties, and conduct ablation studies.
Published: 2020

20. Zero time waste in pre-trained early exit neural networks

Author: Wójcik, Bartosz, Przewiȩźlikowski, Marcin, Szatkowski, Filip, Wołczyk, Maciej, Bałazy, Klaudia, Krzepkowski, Bartłomiej, Podolak, Igor, Tabor, Jacek, Śmieja, Marek, and Trzciński, Tomasz
Published: 2023
Full Text: View/download PDF

21. Biologically-Inspired Spatial Neural Networks

Author: Wołczyk, Maciej, Tabor, Jacek, Śmieja, Marek, and Maszke, Szymon
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We introduce bio-inspired artificial neural networks consisting of neurons that are additionally characterized by spatial positions. To simulate properties of biological systems we add the costs penalizing long connections and the proximity of neurons in a two-dimensional space. Our experiments show that in the case where the network performs two different tasks, the neurons naturally split into clusters, where each cluster is responsible for processing a different task. This behavior not only corresponds to the biological systems, but also allows for further insight into interpretability or continual learning.
Published: 2019

22. SeGMA: Semi-Supervised Gaussian Mixture Auto-Encoder

Author: Śmieja, Marek, Wołczyk, Maciej, Tabor, Jacek, and Geiger, Bernhard C.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: We propose a semi-supervised generative model, SeGMA, which learns a joint probability distribution of data and their classes and which is implemented in a typical Wasserstein auto-encoder framework. We choose a mixture of Gaussians as a target distribution in latent space, which provides a natural splitting of data into clusters. To connect Gaussian components with correct classes, we use a small amount of labeled data and a Gaussian classifier induced by the target distribution. SeGMA is optimized efficiently due to the use of Cramer-Wold distance as a maximum mean discrepancy penalty, which yields a closed-form expression for a mixture of spherical Gaussian components and thus obviates the need of sampling. While SeGMA preserves all properties of its semi-supervised predecessors and achieves at least as good generative performance on standard benchmark data sets, it presents additional features: (a) interpolation between any pair of points in the latent space produces realistically-looking samples; (b) combining the interpolation property with disentangled class and style variables, SeGMA is able to perform a continuous style transfer from one class to another; (c) it is possible to change the intensity of class characteristics in a data point by moving the latent representation of the data point away from specific Gaussian components.
Published: 2019

23. Hypernetwork functional image representation

Author: Klocek, Sylwester, Maziarka, Łukasz, Wołczyk, Maciej, Tabor, Jacek, Nowak, Jakub, and Śmieja, Marek
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Motivated by the human way of memorizing images we introduce their functional representation, where an image is represented by a neural network. For this purpose, we construct a hypernetwork which takes an image and returns weights to the target network, which maps point from the plane (representing positions of the pixel) into its corresponding color in the image. Since the obtained representation is continuous, one can easily inspect the image at various resolutions and perform on it arbitrary continuous operations. Moreover, by inspecting interpolations we show that such representation has some properties characteristic to generative models. To evaluate the proposed mechanism experimentally, we apply it to image super-resolution problem. Despite using a single model for various scaling factors, we obtained results comparable to existing super-resolution methods.
Published: 2019
Full Text: View/download PDF

24. On the Relationship Between Disentanglement and Multi-task Learning

Author: Maziarka, Łukasz, primary, Nowak, Aleksandra, additional, Wołczyk, Maciej, additional, and Bedychaj, Andrzej, additional
Published: 2023
Full Text: View/download PDF

25. Finding the Optimal Network Depth in Classification Tasks

Author: Wójcik, Bartosz, Wołczyk, Maciej, Bałazy, Klaudia, Tabor, Jacek, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Hutter, Frank, editor, Kersting, Kristian, editor, Lijffijt, Jefrey, editor, and Valera, Isabel, editor
Published: 2021
Full Text: View/download PDF

26. Multi-Label Conditional Generation From Pre-Trained Models

Author: Proszewska, Magdalena, primary, Wołczyk, Maciej, additional, Zieba, Maciej, additional, Wielopolski, Patryk, additional, Maziarka, Łukasz, additional, and Śmieja, Marek, additional
Published: 2024
Full Text: View/download PDF

27. Finding the Optimal Network Depth in Classification Tasks

Author: Wójcik, Bartosz, primary, Wołczyk, Maciej, additional, Bałazy, Klaudia, additional, and Tabor, Jacek, additional
Published: 2021
Full Text: View/download PDF

28. Hypernetwork Functional Image Representation

Author: Klocek, Sylwester, primary, Maziarka, Łukasz, additional, Wołczyk, Maciej, additional, Tabor, Jacek, additional, Nowak, Jakub, additional, and Śmieja, Marek, additional
Published: 2019
Full Text: View/download PDF

29. Hebbian Continual Representation Learning

Author: Morawiecki, Pawel, primary, Krutsylo, Andrii, additional, Wołczyk, Maciej, additional, and Śmieja, Marek, additional
Published: 2023
Full Text: View/download PDF

30. PluGeN: Multi-Label Conditional Generation from Pre-trained Models

Author: Wołczyk, Maciej, primary, Proszewska, Magdalena, additional, Maziarka, Łukasz, additional, Zieba, Maciej, additional, Wielopolski, Patryk, additional, Kurczab, Rafał, additional, and Smieja, Marek, additional
Published: 2022
Full Text: View/download PDF

31. Continual World : a robotic benchmark for continual reinforcement learning

Author: Kucinski, Lukasz, Miłoś, Piotr, Pascanu, Razvan, Wołczyk, Maciej, and Zając, Michał
Abstract: Continual learning (CL) - the ability to continuously learn, building on previ ously acquired knowledge - is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks. Understanding the right trade-off is conceptually and computationally challenging, which we argue has led the community to overly focus on catastrophic forgetting. In response to these issues, we advocate for the need to prioritize forward transfer and propose Continual World, a benchmark consisting of realistic and meaningfully diverse robotic tasks built on top of Meta-World [54] as a testbed. Following an in-depth empirical evaluation of existing CL methods, we pinpoint their limitations and highlight unique algorithmic challenges in the RL setting. Our benchmark aims to provide a meaningful and computationally inexpensive challenge for the community and thus help better understand the performance of existing and future solutions. Information about the benchmark, including the open-source code, is available at https://sites.google.com/view/continualworld.
Published: 2022

32. Głębokie uczenie : wprowadzenie

Author: Tabor, Jacek, Śmieja, Marek, Struski, Łukasz, Spurek, Przemysław, and Wołczyk, Maciej
Published: 2022

33. Remember More by Recalling Less: Investigating the Role of Batch Size in Continual Learning with Experience Replay (Student Abstract)

Author: Wołczyk, Maciej, primary and Krutsylo, Andrii, additional
Published: 2021
Full Text: View/download PDF

34. Deep learning-based initialization for object packing

Author: Wołczyk, Maciej, primary
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

34 results on '"Wołczyk, Maciej"'

1. BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

2. Seeing Through Their Eyes: Evaluating Visual Perspective Taking in Vision Language Models

3. State Soup: In-Context Skill Learning, Retrieval and Mixing

4. AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale

5. Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

6. AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale

7. Discovering modular solutions that generalize compositionally

8. The Effectiveness of World Models for Continual Reinforcement Learning

9. Disentangling Transfer in Continual Reinforcement Learning

10. Hebbian Continual Representation Learning

11. Continual Learning with Guarantees via Weight Interval Constraints

12. On the relationship between disentanglement and multi-task learning

13. SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies

14. Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

15. PluGeN: Multi-Label Conditional Generation From Pre-Trained Models

16. Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

17. Continual World: A Robotic Benchmark For Continual Reinforcement Learning

18. On the Relationship Between Disentanglement and Multi-task Learning

19. Finding the Optimal Network Depth in Classification Tasks

20. Zero time waste in pre-trained early exit neural networks

21. Biologically-Inspired Spatial Neural Networks

22. SeGMA: Semi-Supervised Gaussian Mixture Auto-Encoder

23. Hypernetwork functional image representation

24. On the Relationship Between Disentanglement and Multi-task Learning

25. Finding the Optimal Network Depth in Classification Tasks

26. Multi-Label Conditional Generation From Pre-Trained Models

27. Finding the Optimal Network Depth in Classification Tasks

28. Hypernetwork Functional Image Representation

29. Hebbian Continual Representation Learning

30. PluGeN: Multi-Label Conditional Generation from Pre-trained Models

31. Continual World : a robotic benchmark for continual reinforcement learning

32. Głębokie uczenie : wprowadzenie

33. Remember More by Recalling Less: Investigating the Role of Batch Size in Continual Learning with Experience Replay (Student Abstract)

34. Deep learning-based initialization for object packing

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

34 results on '"Wołczyk, Maciej"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources