8 results on '"Andy S. Alic"'
Search Results
2. D4.1 Initial Repository Deployment
- Author
-
Ignacio Blanquer, Damià Segrelles, Pau Lozano, Sergio López Huguet, and Andy S. Alic
- Subjects
Cancer Images, Cloud, Data Repositories - Abstract
The first version of the CHAIMELEON repository deployed includes a set of resources and services that fulfil the most important use cases identified. A cloud-based architecture has been defined, implemented and deployed on UPV’s cloud premises. The architecture components are described using an infrastructure-as-code approach so it can be deployed in any other cloud offering supported. The CHAIMELEON repository is defined, taking into account privacy and security by design on the access to the data stored. This is implemented around three main concepts: the user role, which defines the actions and permissions of the different actors interacting with the repository; the dataset, which is a citable set of medical images and associated clinical data; and the processing applications, which are applications that run in the platform to access and process the data from the datasets. The CHAIMELEON repository has undergone a Data Protection Impact Assessment (DPIA) risk analysis which led to several refinements and improvements. With this scenario in mind, the initial version of the CHAIMELEON repository provides the management of the life-cycle of the datasets, the access to the data and its processing using efficient and user-friendly applications from a catalogue. This functionality is demonstrated in two videos showing the interactions to register in the platform and access the data (https://www.youtube.com/watch?v=cIyiAPl9ezw&t=398s) and several operations concerning the management of the life cycle (https://www.youtube.com/watch?v=oH4glJNxPDc&t=2s).
- Published
- 2023
- Full Text
- View/download PDF
3. Tracer: A multi-blockchain traceability system with applications in healthcare
- Author
-
Sergio López-Huguet, Pau Lozano, J. Damian Segrelles Quilis, Ignacio Blanquer Espert, and Andy S Alic
- Published
- 2022
- Full Text
- View/download PDF
4. A cloud-based framework for machine learning workloads and applications
- Author
-
Viet Tran, Doina Cristina Duma, Stefan Dlugolinsky, L. Lloret Iglesias, V. Kozlov, Zděnek Šustr, Marica Antonacci, Wolfgang zu Castell, Germán Moltó, Pawel Wolniewicz, Andy S. Alic, Miguel Caballer, Jorge Gomes, Marcin Plociennik, Jesús Marco de Lucas, Ignacio Heredia Cacha, Isabel Campos Plasencia, Giang Nguyen, Mario David, Álvaro López García, Marcus Hardt, Keiichi Ito, Alessandro Costantini, Giacinto Donvito, Pablo Orviz Fernández, European Commission, López García, Álvaro [0000-0002-0013-4602], Marco, Jesús [0000-0001-7914-8494], Lloret Iglesias, Lara [0000-0002-0157-4765], Campos, Isabel [0000-0002-9350-0383], Heredia, Ignacio [0000-0001-6317-7100], Orviz, Pablo [0000-0002-2473-6405], López García, Álvaro, Marco, Jesús, Lloret Iglesias, Lara, Campos, Isabel, Heredia, Ignacio, and Orviz, Pablo
- Subjects
0209 industrial biotechnology ,Service (systems architecture) ,General Computer Science ,Cover (telecommunications) ,Computer science ,Cloud computing ,02 engineering and technology ,Machine learning ,computer.software_genre ,Set (abstract data type) ,03 medical and health sciences ,distributed computing ,020901 industrial engineering & automation ,CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL ,General Materials Science ,Serverless architectures ,DevOps ,030304 developmental biology ,0303 health sciences ,business.industry ,Deep learning ,DATA processing & computer science ,General Engineering ,deep learning ,Computers and information processing ,Distributed computing ,Cloud Computing ,Computers And Information Processing ,Deep Learning ,Distributed Computing ,Machine Learning ,Serverless Architectures ,machine learning ,serverless architectures ,computers and information processing ,Artificial intelligence ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,ddc:004 ,business ,computer ,lcsh:TK1-9971 - Abstract
In this paper we propose a distributed architecture to provide machine learning practitioners with a set of tools and cloud services that cover the whole machine learning development cycle: ranging from the models creation, training, validation and testing to the models serving as a service, sharing and publication. In such respect, the DEEP-Hybrid-DataCloud framework allows transparent access to existing e-Infrastructures, effectively exploiting distributed resources for the most compute-intensive tasks coming from the machine learning development cycle. Moreover, it provides scientists with a set of Cloud-oriented services to make their models publicly available, by adopting a serverless architecture and a DevOps approach, allowing an easy share, publish and deploy of the developed models., This work was supported by the project DEEP-Hybrid-DataCloud ‘‘Designing and Enabling E-infrastructures for intensive Processing in a Hybrid DataCloud’’ that has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant 777435.
- Published
- 2020
5. Easy One-Step Amplification and Labeling Procedure for Copy Number Variation Detection
- Author
-
Carmen Ivorra, Ana B. García-García, Andy S. Alic, José M. Juanes, José T. Real, Sergio Martínez-Hervás, Pablo Marin, Maria D. Olivares, Veronica Gonzalez-Albert, Blanca Navarro, Alicia Serrano, Laura Olivares, Felipe J. Chaves, Veronica Lendinez, and Sebastian Blesa
- Subjects
0301 basic medicine ,DNA Copy Number Variations ,Clinical Biochemistry ,Computational biology ,Polymerase Chain Reaction ,03 medical and health sciences ,0302 clinical medicine ,Humans ,Multiplex ,Multiplex ligation-dependent probe amplification ,Copy-number variation ,In Situ Hybridization, Fluorescence ,Fluorescent Dyes ,Chemistry ,Biochemistry (medical) ,Sequence Analysis, DNA ,Amplicon ,Chromosome 17 (human) ,MSH6 ,DNA sequencer ,030104 developmental biology ,Receptors, LDL ,MSH2 ,030220 oncology & carcinogenesis ,DNA Probes ,Multiplex Polymerase Chain Reaction - Abstract
Background The specific characteristics of copy number variations (CNVs) require specific methods of detection and characterization. We developed the Easy One-Step Amplification and Labeling procedure for CNV detection (EOSAL-CNV), a new method based on proportional amplification and labeling of amplicons in 1 PCR. Methods We used tailed primers for specific amplification and a pair of labeling probes (only 1 labeled) for amplification and labeling of all amplicons in just 1 reaction. Products were loaded directly onto a capillary DNA sequencer for fragment sizing and quantification. Data obtained could be analyzed by Microsoft Excel spreadsheet or EOSAL-CNV analysis software. We developed the protocol using the LDLR (low density lipoprotein receptor) gene including 23 samples with 8 different CNVs. After optimizing the protocol, it was used for genes in the following multiplexes: BRCA1 (BRCA1 DNA repair associated), BRCA2 (BRCA2 DNA repair associated), CHEK2 (checkpoint kinase 2), MLH1 (mutL homolog 1) plus MSH6 (mutS homolog 6), MSH2 (mutS homolog 2) plus EPCAM (epithelial cell adhesion molecule) and chromosome 17 (especially the TP53 [tumor protein 53] gene). We compared our procedure with multiplex ligation-dependent probe amplification (MLPA). Results The simple procedure for CNV detection required 150 min, with 240 samples, EOSAL-CNV excluded the presence of CNVs in all controls, and in all cases, results were identical using MLPA and EOSAL-CNV. Analysis of the 17p region in tumor samples showed 100% similarity between fluorescent in situ hybridization and EOSAL-CNV. Conclusions EOSAL-CNV allowed reliable, fast, easy detection and characterization of CNVs. It provides an alternative to targeted analysis methods such as MLPA.
- Published
- 2019
6. MuffinInfo: HTML5-Based Statistics Extractor from Next-Generation Sequencing Data
- Author
-
Ignacio Blanquer and Andy S. Alic
- Subjects
0301 basic medicine ,FASTQ format ,Computer science ,computer.software_genre ,03 medical and health sciences ,Software ,Histogram ,Statistics ,Escherichia coli ,Genetics ,Molecular Biology ,HTML5 ,business.industry ,Escherichia coli Proteins ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Sequence Analysis, DNA ,Ion semiconductor sequencing ,Computational Mathematics ,030104 developmental biology ,Computational Theory and Mathematics ,Modeling and Simulation ,A priori and a posteriori ,Nanopore sequencing ,Data mining ,Line (text file) ,business ,computer - Abstract
Usually, the information known a priori about a newly sequenced organism is limited. Even resequencing the same organism can generate unpredictable output. We introduce MuffinInfo, a FastQ/Fasta/SAM information extractor implemented in HTML5 capable of offering insights into next-generation sequencing (NGS) data. Our new tool can run on any software or hardware environment, in command line or graphically, and in browser or standalone. It presents information such as average length, base distribution, quality scores distribution, k-mer histogram, and homopolymers analysis. MuffinInfo improves upon the existing extractors by adding the ability to save and then reload the results obtained after a run as a navigable file (also supporting saving pictures of the charts), by supporting custom statistics implemented by the user, and by offering user-adjustable parameters involved in the processing, all in one software. At the moment, the extractor works with all base space technologies such as Illumina, Roche, Ion Torrent, Pacific Biosciences, and Oxford Nanopore. Owing to HTML5, our software demonstrates the readiness of web technologies for mild intensive tasks encountered in bioinformatics.
- Published
- 2016
- Full Text
- View/download PDF
7. BIGSEA: A Big Data analytics platform for public transportation information
- Author
-
Dorgival Guedes, Sandro Fiore, Rosa M. Badia, Nazareno Andrade, Nádia P. Kozievitch, Walter Abrahão dos Santos, Tarciso Braz, Giovanni Aloisio, Paulo Silva, Marco Vieira, Danilo Ardagna, Fábio Morais, Nuno Antunes, Jussara M. Almeida, Daniele Lezzi, Demetrio Gomes Mestre, Andy S. Alic, Wagner Meira, Tânia Basso, Carlos Eduardo Santos Pires, Ignacio Blanquer, Matheus Maciel, Regina Moraes, Donatello Elia, Andrey Brito, Marco Lattuada, European Commission, Ministério da Ciência, Tecnologia e Inovação (Brasil), Almeida, Jussara [0000-0001-9142-2919], Antunes, Nuno [0000-0002-6044-4012], Ardagna, Danilo [0000-0003-4224-927X], Badia, Rosa M. [0000-0003-2941-5499], Braz, Tarciso [0000-0001-8620-3877], Lattuada, Marco [0000-0003-0062-6049], Lezzi, Daniele [0000-0001-5081-7244], Mestre, Demetrio [0000-0003-4727-3340], Moraes, Regina [0000-0003-0678-4777], Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors, Barcelona Supercomputing Center, Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions, Almeida, Jussara, Antunes, Nuno, Ardagna, Danilo, Badia, Rosa M., Braz, Tarciso, Lattuada, Marco, Lezzi, Daniele, Mestre, Demetrio, Moraes, Regina, Alic, A. S., Almeida, J., Aloisio, G., Andrade, N., Antunes, N., Ardagna, D., Badia, R. M., Basso, T., Blanquer, I., Braz, T., Brito, A., Elia, D., Fiore, S., Guedes, D., Lattuada, M., Lezzi, D., Maciel, M., Meira, W., Mestre, D., Moraes, R., Morais, F., Pires, C. E., Kozievitch, N. P., Santos, W. D., Silva, P., and Vieira, M.
- Subjects
Computació en núvol ,Computer Networks and Communications ,Computer science ,Performance ,Deployment ,Big data ,Library science ,Transport ,Transportation ,02 engineering and technology ,Workflows ,11. Sustainability ,CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL ,0202 electrical engineering, electronic engineering, information engineering ,Cloud computing ,European commission ,Informàtica::Arquitectura de computadors [Àrees temàtiques de la UPC] ,business.industry ,Macrodades ,020206 networking & telecommunications ,Workflow ,Work (electrical) ,Hardware and Architecture ,Software deployment ,Public transport ,020201 artificial intelligence & image processing ,business ,Software - Abstract
Analysis of public transportation data in large cities is a challenging problem. Managing data ingestion, data storage, data quality enhancement, modelling and analysis requires intensive computing and a non-trivial amount of resources. In EUBra-BIGSEA (Europe–Brazil Collaboration of Big Data Scientific Research Through Cloud-Centric Applications) we address such problems in a comprehensive and integrated way. EUBra-BIGSEA provides a platform for building up data analytic workflows on top of elastic cloud services without requiring skills related to either programming or cloud services. The approach combines cloud orchestration, Quality of Service and automatic parallelisation on a platform that includes a toolbox for implementing privacy guarantees and data quality enhancement as well as advanced services for sentiment analysis, traffic jam estimation and trip recommendation based on estimated crowdedness., The work shown in this article has been funded jointly by European Commission under the Cooperation Programme, Horizon2020 grant agreement No 690116 (EUBra-BIGSEA) and the Min-istériode Ciência,Tecnologiae Inovação(MCTI) from Brazil
- Published
- 2019
8. Objective review ofde novostand-alone error correction methods for NGS data
- Author
-
Andy S. Alic, Joaquín Dopazo, David Ruzafa, and Ignacio Blanquer
- Subjects
0301 basic medicine ,030102 biochemistry & molecular biology ,Burrows–Wheeler transform ,Computer science ,Suite ,Probabilistic logic ,computer.software_genre ,Biochemistry ,Field (computer science) ,Computer Science Applications ,03 medical and health sciences ,Computational Mathematics ,030104 developmental biology ,Materials Chemistry ,Data mining ,Physical and Theoretical Chemistry ,Suffix ,Error detection and correction ,Cluster analysis ,computer ,Reference genome - Abstract
The sequencing market has increased steadily over the last few years, with different approaches to read DNA information prone to different types of errors. Multiple studies demonstrated the impact of sequencing errors on different applications of next-generation sequencing (NGS), making error correction a fundamental initial step. Different methods in the literature use different approaches and fit different types of problems. We analyzed 50 methods divided into five main approaches (k-spectrum, suffix arrays, multiple-sequence alignment, read clustering, and probabilistic models). They are not published as a part of a suite (stand-alone), and target raw, unprocessed data without an existing reference genome (de novo). These correctors handle one or more sequencing technologies using the same or different approaches. They face general challenges (sometimes with specific traits for specific technologies) such as repetitive regions, uncalled bases, and ploidy. Even assessing their performance is a challenge in itself because of the approach taken by various authors, the unknown factor (de novo), and the behavior of the third-party tools employed in the benchmarks. This study aims to help the researcher in the field to advance the field of error correction, the educator to have a brief but comprehensive companion, and the bioinformatician to choose the right tool for the right job. WIREs Comput Mol Sci 2016, 6:111–146. doi: 10.1002/wcms.1239 For further resources related to this article, please visit the WIREs website.
- Published
- 2016
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.