8 results on '"Thales Francisco Mota Carvalho"'
Search Results
2. Iterated Local Search Based Heuristic for Scheduling Jobs on Unrelated Parallel Machines with Machine Deterioration Effect.
- Author
-
Vívian Ludimila Aguiar Santos, José Elías C. Arroyo, and Thales Francisco Mota Carvalho
- Published
- 2016
- Full Text
- View/download PDF
3. HISTÓRICO DAS MULHERES NA TECNOLOGIA DA INFORMAÇÃO E ANÁLISE DA PARTICIPAÇÃO FEMININA NOS CURSOS SUPERIORES DO BRASIL
- Author
-
Thales Francisco Mota Carvalho, Maria do Socorro Vieira Barreto, and Vívian Ludimila Aguiar Santos
- Published
- 2021
4. Multi-objective Iterated Local Search based on decomposition for job scheduling problems with machine deterioration effect
- Author
-
Vívian Ludimila Aguiar Santos, Luciana Assis, Frederico Guimarães, Miri Weiss Cohen, and Thales Francisco Mota Carvalho
- Subjects
Artificial Intelligence ,Control and Systems Engineering ,Electrical and Electronic Engineering - Published
- 2022
5. Fangorn Forest (F2): a machine learning approach to classify genes and genera in the family Geminiviridae
- Author
-
Thales Francisco Mota Carvalho, José Cleydson F. Silva, Elizabeth P. B. Fontes, and Fabio Ribeiro Cerqueira
- Subjects
Geminivirus ,machine learning ,0301 basic medicine ,Gene prediction ,Genus classification ,DNA, Satellite ,Biology ,lcsh:Computer applications to medicine. Medical informatics ,Machine learning ,computer.software_genre ,Biochemistry ,Genome ,Machine Learning ,Open Reading Frames ,User-Computer Interface ,03 medical and health sciences ,Structural Biology ,Genus ,Multilayer perceptron ,ORFS ,lcsh:QH301-705.5 ,Molecular Biology ,Internet ,Random Forest ,Support vector machines ,business.industry ,Methodology Article ,Applied Mathematics ,Plants ,Computer Science Applications ,Random forest ,Support vector machine ,Geminiviridae ,030104 developmental biology ,lcsh:Biology (General) ,ROC Curve ,Metagenomics ,Area Under Curve ,lcsh:R858-859.7 ,Gene classification ,Sequential minimal optimization ,Artificial intelligence ,business ,computer - Abstract
Background Geminiviruses infect a broad range of cultivated and non-cultivated plants, causing significant economic losses worldwide. The studies of the diversity of species, taxonomy, mechanisms of evolution, geographic distribution, and mechanisms of interaction of these pathogens with the host have greatly increased in recent years. Furthermore, the use of rolling circle amplification (RCA) and advanced metagenomics approaches have enabled the elucidation of viromes and the identification of many viral agents in a large number of plant species. As a result, determining the nomenclature and taxonomically classifying geminiviruses turned into complex tasks. In addition, the gene responsible for viral replication (particularly, the viruses belonging to the genus Mastrevirus) may be spliced due to the use of the transcriptional/splicing machinery in the host cells. However, the current tools have limitations concerning the identification of introns. Results This study proposes a new method, designated Fangorn Forest (F2), based on machine learning approaches to classify genera using an ab initio approach, i.e., using only the genomic sequence, as well as to predict and classify genes in the family Geminiviridae. In this investigation, nine genera of the family Geminiviridae and their related satellite DNAs were selected. We obtained two training sets, one for genus classification, containing attributes extracted from the complete genome of geminiviruses, while the other was made up to classify geminivirus genes, containing attributes extracted from ORFs taken from the complete genomes cited above. Three ML algorithms were applied on those datasets to build the predictive models: support vector machines, using the sequential minimal optimization training approach, random forest (RF), and multilayer perceptron. RF demonstrated a very high predictive power, achieving 0.966, 0.964, and 0.995 of precision, recall, and area under the curve (AUC), respectively, for genus classification. For gene classification, RF could reach 0.983, 0.983, and 0.998 of precision, recall, and AUC, respectively. Conclusions Therefore, Fangorn Forest is proven to be an efficient method for classifying genera of the family Geminiviridae with high precision and effective gene prediction and classification. The method is freely accessible at www.geminivirus.org:8080/geminivirusdw/discoveryGeminivirus.jsp. Electronic supplementary material The online version of this article (10.1186/s12859-017-1839-x) contains supplementary material, which is available to authorized users.
- Published
- 2017
6. Rama: a machine learning approach for ribosomal protein prediction in plants
- Author
-
José Cleydson F. Silva, Iara P. Calil, Fabio Ribeiro Cerqueira, Elizabeth P. B. Fontes, and Thales Francisco Mota Carvalho
- Subjects
0301 basic medicine ,Ribosomal Proteins ,lcsh:Medicine ,Biology ,Machine learning ,computer.software_genre ,Ribosome ,Article ,Machine Learning ,03 medical and health sciences ,Rama ,Ribosomal protein ,lcsh:Science ,Plant Proteins ,Multidisciplinary ,business.industry ,lcsh:R ,RNA-Binding Proteins ,Translation (biology) ,030104 developmental biology ,lcsh:Q ,Artificial intelligence ,business ,computer ,Function (biology) - Abstract
Ribosomal proteins (RPs) play a fundamental role within all type of cells, as they are major components of ribosomes, which are essential for translation of mRNAs. Furthermore, these proteins are involved in various physiological and pathological processes. The intrinsic biological relevance of RPs motivated advanced studies for the identification of unrevealed RPs. In this work, we propose a new computational method, termed Rama, for the prediction of RPs, based on machine learning techniques, with a particular interest in plants. To perform an effective classification, Rama uses a set of fundamental attributes of the amino acid side chains and applies a two-step procedure to classify proteins with unknown function as RPs. The evaluation of the resultant predictive models showed that Rama could achieve mean sensitivity, precision, and specificity of 0.91, 0.91, and 0.82, respectively. Furthermore, a list of proteins that have no annotation in Phytozome v.10, and are annotated as RPs in Phytozome v.12, were correctly classified by our models. Additional computational experiments have also shown that Rama presents high accuracy to differentiate ribosomal proteins from RNA-binding proteins. Finally, two novel proteins of Arabidopsis thaliana were validated in biological experiments. Rama is freely available at http://inctipp.bioagro.ufv.br:8080/Rama.
- Published
- 2017
7. Geminivirus data warehouse: a database enriched with machine learning approaches
- Author
-
Roberto Ramos Sobrinho, Michihito Deguchi, Anésia A. Santos, José Cleydson F. Silva, Elizabeth P. B. Fontes, Pedro Marcus Pereira Vidigal, Marcos Fernando Basso, Fabyano Fonseca e Silva, Francisco Murilo Zerbini, Otávio J. B. Brustolini, Welison A. Pereira, Renildes Lúcio Ferreira Fontes, Maximiller Dal-Bianco, Thales Francisco Mota Carvalho, and Fabio Ribeiro Cerqueira
- Subjects
0301 basic medicine ,Computer science ,computer.software_genre ,Biochemistry ,Genome ,Transcriptome ,Machine Learning ,chemistry.chemical_compound ,Knowledge discovery ,Structural Biology ,Genus ,Databases, Genetic ,lcsh:QH301-705.5 ,Phylogeny ,Genomic organization ,Database ,biology ,Applied Mathematics ,Plants ,Data warehouse ,Computer Science Applications ,Geminiviridae ,lcsh:R858-859.7 ,Geminivirus ,Algorithms ,Data Warehouse ,DNA, Single-Stranded ,Genomics ,Context (language use) ,Machine learning ,lcsh:Computer applications to medicine. Medical informatics ,03 medical and health sciences ,Open Reading Frames ,Phylogenetics ,Molecular Biology ,Data mining ,Random Forest ,business.industry ,Host (biology) ,Computational Biology ,biology.organism_classification ,030104 developmental biology ,chemistry ,lcsh:Biology (General) ,Vector (epidemiology) ,DNA, Viral ,Artificial intelligence ,business ,computer ,DNA ,Random forest - Abstract
Background The Geminiviridae family encompasses a group of single-stranded DNA viruses with twinned and quasi-isometric virions, which infect a wide range of dicotyledonous and monocotyledonous plants and are responsible for significant economic losses worldwide. Geminiviruses are divided into nine genera, according to their insect vector, host range, genome organization, and phylogeny reconstruction. Using rolling-circle amplification approaches along with high-throughput sequencing technologies, thousands of full-length geminivirus and satellite genome sequences were amplified and have become available in public databases. As a consequence, many important challenges have emerged, namely, how to classify, store, and analyze massive datasets as well as how to extract information or new knowledge. Data mining approaches, mainly supported by machine learning (ML) techniques, are a natural means for high-throughput data analysis in the context of genomics, transcriptomics, proteomics, and metabolomics. Results Here, we describe the development of a data warehouse enriched with ML approaches, designated geminivirus.org. We implemented search modules, bioinformatics tools, and ML methods to retrieve high precision information, demarcate species, and create classifiers for genera and open reading frames (ORFs) of geminivirus genomes. Conclusions The use of data mining techniques such as ETL (Extract, Transform, Load) to feed our database, as well as algorithms based on machine learning for knowledge extraction, allowed us to obtain a database with quality data and suitable tools for bioinformatics analysis. The Geminivirus Data Warehouse (geminivirus.org) offers a simple and user-friendly environment for information retrieval and knowledge discovery related to geminiviruses. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1646-4) contains supplementary material, which is available to authorized users.
- Published
- 2016
8. Iterated Local Search Based Heuristic for Scheduling Jobs on Unrelated Parallel Machines with Machine Deterioration Effect
- Author
-
Jose Elias Claudio Arroyo, Vívian Ludimila Aguiar Santos, and Thales Francisco Mota Carvalho
- Subjects
0209 industrial biotechnology ,Mathematical optimization ,021103 operations research ,Job shop scheduling ,business.industry ,Computer science ,Heuristic ,Iterated local search ,0211 other engineering and technologies ,02 engineering and technology ,Scheduling (computing) ,020901 industrial engineering & automation ,Local search (optimization) ,business ,Metaheuristic ,Random variable - Abstract
In this research, we study an unrelated parallel machine scheduling problem in which the jobs cause deterioration of the machines. This deterioration decreases the performance of the machines, therefore the processing times of the jobs are increased over time. The problem is to find the processing sequence of jobs on each machine in order to reduce the deterioration of the machines and consequently minimize the makespan. Given that the problem is NP-hard, we propose an Iterated Local Search (ILS) heuristic to obtain near-optimal solutions. In this work we combine ILS meta-heuristic with Random Variable Neighborhood Descent (RVND) local search. The performance of our heuristic is compared with the state-of-the-art meta-heuristic algorithm proposed in the literature for the problem under study. The results show that the our heuristic outperform the existing algorithm by a significant margin.
- Published
- 2016
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.