11 results on '"Paulovich A"'
Search Results
2. HardVis: Visual Analytics to Handle Instance Hardness Using Undersampling and Oversampling Techniques.
- Author
-
Chatzimparmpas, A., Paulovich, F. V., and Kerren, A.
- Subjects
- *
MACHINE learning , *VISUAL analytics , *HARDNESS , *SYSTEMS design , *PREDICTIVE tests , *DATA distribution - Abstract
Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses challenges in many real‐world applications. Among a series of diverse techniques to solve this problem, sampling algorithms are regarded as an efficient solution. However, the problem is more fundamental, with many works emphasizing the importance of instance hardness. This issue refers to the significance of managing unsafe or potentially noisy instances that are more likely to be misclassified and serve as the root cause of poor classification performance. This paper introduces HardVis, a visual analytics system designed to handle instance hardness mainly in imbalanced classification scenarios. Our proposed system assists users in visually comparing different distributions of data types, selecting types of instances based on local characteristics that will later be affected by the active sampling method, and validating which suggestions from undersampling or oversampling techniques are beneficial for the ML model. Additionally, rather than uniformly undersampling/oversampling a specific class, we allow users to find and sample easy and difficult to classify training instances from all classes. Users can explore subsets of data from different perspectives to decide all those parameters, while HardVis keeps track of their steps and evaluates the model's predictive performance in a test set separately. The end result is a well‐balanced data set that boosts the predictive power of the ML model. The efficacy and effectiveness of HardVis are demonstrated with a hypothetical usage scenario and a use case. Finally, we also look at how useful our system is based on feedback we received from ML experts. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
3. User-driven Feature Space Transformation
- Author
-
G. M. H. Mamani, Fernando V. Paulovich, Francisco M. Fatore, and Luis Gustavo Nonato
- Subjects
Transformation (function) ,Theoretical computer science ,SIMPLE (military communications protocol) ,Human–computer interaction ,Feature (computer vision) ,Computer science ,Feature vector ,Relevance (information retrieval) ,Space (commercial competition) ,Computer Graphics and Computer-Aided Design ,Interactive visualization ,Image retrieval - Abstract
Interactive visualization systems for exploring and manipulating high-dimensional feature spaces have experienced a substantial progress in the last few years. State-of-art methods rely on solid mathematical and computational foundations that enable sophisticated and flexible interactive tools. Current methods are even capable of modifying data attributes during interaction, highlighting regions of potential interest in the feature space, and building visualizations that bring out the relevance of attributes. However, those methodologies rely on complex and non-intuitive interfaces that hamper the free handling of the feature spaces. Moreover, visualizing how neighborhood structures are affected during the space manipulation is also an issue for existing methods. This paper presents a novel visualization-assisted methodology for interacting and transforming data attributes embedded in feature spaces. The proposed approach relies on a combination of multidimensional projections and local transformations to provide an interactive mechanism for modifying attributes. Besides enabling a simple and intuitive visual layout, our approach allows the user to easily observe the changes in neighborhood structures during interaction. The usefulness of our methodology is shown in an application geared to image retrieval.
- Published
- 2013
- Full Text
- View/download PDF
4. Semantic Wordification of Document Collections
- Author
-
Luis Gustavo Nonato, Guilherme P. Telles, Rosane Minghim, Franklina Maria Bragion de Toledo, and Fernando V. Paulovich
- Subjects
Set (abstract data type) ,Information retrieval ,Computer science ,Point cloud ,Tag cloud ,Representation (mathematics) ,Projection (set theory) ,Computer Graphics and Computer-Aided Design ,Word (computer architecture) ,Visualization - Abstract
Word clouds have become one of the most widely accepted visual resources for document analysis and visualization, motivating the development of several methods for building layouts of keywords extracted from textual data. Existing methods are effective to demonstrate content, but are not capable of preserving semantic relationships among keywords while still linking the word cloud to the underlying document groups that generated them. Such representation is highly desirable for exploratory analysis of document collections. In this paper we present a novel approach to build document clouds, named ProjCloud that aim at solving both semantical layouts and linking with document sets. ProjCloud generates a semantically consistent layout from a set of documents. Through a multidimensional projection, it is possible to visualize the neighborhood relationship between highly related documents and their corresponding word clouds simultaneously. Additionally, we propose a new algorithm for building word clouds inside polygons, which employs spectral sorting to maintain the semantic relationship among words. The effectiveness and flexibility of our methodology is confirmed when comparisons are made to existing methods. The technique automatically constructs projection based layouts the user may choose to examine in the form of the point clouds or corresponding word clouds, allowing a high degree of control over the exploratory process. © 2012 Wiley Periodicals, Inc.
- Published
- 2012
- Full Text
- View/download PDF
5. Employing 2D Projections for Fast Visual Exploration of Large Fiber Tracking Data
- Author
-
Fernando V. Paulovich, Rosane Minghim, Jorge Poco, and Danilo Medeiros Eler
- Subjects
Flexibility (engineering) ,Computer science ,Fiber (mathematics) ,business.industry ,Feature vector ,Representation (systemics) ,Computer vision ,Artificial intelligence ,Object (computer science) ,Projection (set theory) ,business ,Curvature ,Computer Graphics and Computer-Aided Design - Abstract
Fiber tracts detection is an increasingly common technology for diagnosis and also understanding of brain function. Although tools for tracing and presenting brain fibers are advanced, it is still difficult for physicians or students to explore the dataset in 3D due to their intricate topology. In this work we present a visual exploration approach for fiber tracts data aimed at supporting exploration of such data. The work employs a local, precise and fast 2D multidimensional projection technique that allows a large number of fibers to be handled simultaneously and to select groups of bundled fibers for further exploration. In this approach, a DTI feature dataset, including curvature as well as spatial features, is projected on a 2D or 3D view. By handling groups formed in this view, exploration is linked to corresponding brain fibers in object space. The link exists in both directions and fibers selected in object space are also mapped to feature space. Our approach also allows users to modify the projection, controlling and improving, if necessary, the definition of groups of fibers for small and large datasets, due to the local nature of the projection. Compared to other related work, the method presented here is faster for creating visual representations, making it possible to explore complete sets of fibers tracts up to 250K fibers, which was not possible previously. Additionally, the ability to change configuration of the feature space representation adds a high degree of flexibility to the process. © 2012 Wiley Periodicals, Inc.
- Published
- 2012
- Full Text
- View/download PDF
6. A Framework for Exploring Multidimensional Data with 3D Projections
- Author
-
Jorge Poco, Paul Rosenthal, Rosane Minghim, Fernando V. Paulovich, Ronak Etemadpour, Lars Linsen, Tran Van Long, and Maria Cristina Ferreira de Oliveira
- Subjects
Similarity (geometry) ,Computer science ,business.industry ,Feature vector ,Visual space ,Machine learning ,computer.software_genre ,Computer Graphics and Computer-Aided Design ,Rendering (computer graphics) ,Visualization ,Data point ,Projection (mathematics) ,Artificial intelligence ,Data mining ,business ,computer ,Spatial analysis - Abstract
Visualization of high-dimensional data requires a mapping to a visual space. Whenever the goal is to preserve similarity relations a frequent strategy is to use 2D projections, which afford intuitive interactive exploration, e.g., by users locating and selecting groups and gradually drilling down to individual objects. In this paper, we propose a framework for projecting high-dimensional data to 3D visual spaces, based on a generalization of the Least-Square Projection (LSP). We compare projections to 2D and 3D visual spaces both quantitatively and through a user study considering certain exploration tasks. The quantitative analysis confirms that 3D projections outperform 2D projections in terms of precision. The user study indicates that certain tasks can be more reliably and confidently answered with 3D projections. Nonetheless, as 3D projections are displayed on 2D screens, interaction is more difficult. Therefore, we incorporate suitable interaction functionalities into a framework that supports 3D transformations, predefined optimal 2D views, coordinated 2D and 3D views, and hierarchical 3D cluster definition and exploration. For visually encoding data clusters in a 3D setup, we employ color coding of projected data points as well as four types of surface renderings. A second user study evaluates the suitability of these visual encodings. Several examples illustrate the framework's applicability for both visual exploration of multidimensional abstract (non-spatial) data as well as the feature space of multi-variate spatial data.
- Published
- 2011
- Full Text
- View/download PDF
7. User-driven Feature Space Transformation
- Author
-
Mamani, G. M. H., primary, Fatore, F. M., additional, Nonato, L. G., additional, and Paulovich, F. V., additional
- Published
- 2013
- Full Text
- View/download PDF
8. Employing 2D Projections for Fast Visual Exploration of Large Fiber Tracking Data
- Author
-
Poco, Jorge, primary, Eler, Danilo M., additional, Paulovich, Fernando V., additional, and Minghim, Rosane, additional
- Published
- 2012
- Full Text
- View/download PDF
9. Semantic Wordification of Document Collections
- Author
-
Paulovich, Fernando V., primary, Toledo, Franklina M. B., additional, Telles, Guilherme P., additional, Minghim, Rosane, additional, and Nonato, Luis Gustavo, additional
- Published
- 2012
- Full Text
- View/download PDF
10. A Framework for Exploring Multidimensional Data with 3D Projections
- Author
-
Poco, J., primary, Etemadpour, R., additional, Paulovich, F.V., additional, Long, T.V., additional, Rosenthal, P., additional, Oliveira, M.C.F., additional, Linsen, L., additional, and Minghim, R., additional
- Published
- 2011
- Full Text
- View/download PDF
11. Piece wise Laplacian‐based Projection for Interactive Data Exploration and Organization
- Author
-
Paulovich, F.V., primary, Eler, D.M., additional, Poco, J., additional, Botha, C.P., additional, Minghim, R., additional, and Nonato, L.G., additional
- Published
- 2011
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.