29 results on '"Mamana Mbiyavanga"'
Search Results
2. Assessing HLA imputation accuracy in a West African population.
- Author
-
Ruth Nanjala, Mamana Mbiyavanga, Suhaila Hashim, Santie de Villiers, and Nicola Mulder
- Subjects
Medicine ,Science - Abstract
The Human Leukocyte Antigen (HLA) region plays an important role in autoimmune and infectious diseases. HLA is a highly polymorphic region and thus difficult to impute. We, therefore, sought to evaluate HLA imputation accuracy, specifically in a West African population, since they are understudied and are known to harbor high genetic diversity. The study sets were selected from 315 Gambian individuals within the Gambian Genome Variation Project (GGVP) Whole Genome Sequence datasets. Two different arrays, Illumina Omni 2.5 and Human Hereditary and Health in Africa (H3Africa), were assessed for the appropriateness of their markers, and these were used to test several imputation panels and tools. The reference panels were chosen from the 1000 Genomes (1kg-All), 1000 Genomes African (1kg-Afr), 1000 Genomes Gambian (1kg-Gwd), H3Africa, and the HLA Multi-ethnic datasets. HLA-A, HLA-B, and HLA-C alleles were imputed using HIBAG, SNP2HLA, CookHLA, and Minimac4, and concordance rate was used as an assessment metric. The best performing tool was found to be HIBAG, with a concordance rate of 0.84, while the best performing reference panel was the H3Africa panel, with a concordance rate of 0.62. Minimac4 (0.75) was shown to increase HLA-B allele imputation accuracy compared to HIBAG (0.71), SNP2HLA (0.51) and CookHLA (0.17). The H3Africa and Illumina Omni 2.5 array performances were comparable, showing that genotyping arrays have less influence on HLA imputation in West African populations. The findings show that using a larger population-specific reference panel and the HIBAG tool improves the accuracy of HLA imputation in a West African population.
- Published
- 2023
- Full Text
- View/download PDF
3. The H3ABioNet helpdesk: an online bioinformatics resource, enhancing Africa’s capacity for genomics research
- Author
-
Judit Kumuthini, Lyndon Zass, Sumir Panji, Samson P. Salifu, Jonathan K. Kayondo, Victoria Nembaware, Mamana Mbiyavanga, Ajayi Olabode, Ali Kishk, Gordon Wells, Nicola J. Mulder, and as members of the Sustainability and Outreach Work Package of the H3ABioNet Consortium
- Subjects
Bioinformatics support ,Genomics support ,H3Africa ,H3ABioNet ,Helpdesk ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Currently, formal mechanisms for bioinformatics support are limited. The H3Africa Bioinformatics Network has implemented a public and freely available Helpdesk (HD), which provides generic bioinformatics support to researchers through an online ticketing platform. The following article reports on the H3ABioNet HD (H3A-HD)‘s development, outlining its design, management, usage and evaluation framework, as well as the lessons learned through implementation. Results The H3A-HD evaluated using automatically generated usage logs, user feedback and qualitative ticket evaluation. Evaluation revealed that communication methods, ticketing strategies and the technical platforms used are some of the primary factors which may influence the effectivity of HD. Conclusion To continuously improve the H3A-HD services, the resource should be regularly monitored and evaluated. The H3A-HD design, implementation and evaluation framework could be easily adapted for use by interested stakeholders within the Bioinformatics community and beyond.
- Published
- 2019
- Full Text
- View/download PDF
4. The Extent and Impact of Variation in ADME Genes in Sub-Saharan African Populations
- Author
-
Jorge E. B. da Rocha, Houcemeddine Othman, Gerrit Botha, Laura Cottino, David Twesigomwe, Samah Ahmed, Britt I. Drögemöller, Faisal M. Fadlelmola, Philip Machanick, Mamana Mbiyavanga, Sumir Panji, Galen E. B. Wright, Clement Adebamowo, Mogomotsi Matshaba, Michéle Ramsay, Gustave Simo, Martin C. Simuunza, Caroline T. Tiemessen, Sandra Baldwin, Mathias Chiano, Charles Cox, Annette S. Gross, Pamela Thomas, Francisco-Javier Gamo, and Scott Hazelhurst
- Subjects
ADME ,genetic diversity ,Africa ,pharmacogenomics ,CNV ,Therapeutics. Pharmacology ,RM1-950 - Abstract
Introduction: Investigating variation in genes involved in the absorption, distribution, metabolism, and excretion (ADME) of drugs are key to characterizing pharmacogenomic (PGx) relationships. ADME gene variation is relatively well characterized in European and Asian populations, but data from African populations are under-studied—which has implications for drug safety and effective use in Africa.Results: We identified significant ADME gene variation in African populations using data from 458 high-coverage whole genome sequences, 412 of which are novel, and from previously available African sequences from the 1,000 Genomes Project. ADME variation was not uniform across African populations, particularly within high impact coding variation. Copy number variation was detected in 116 ADME genes, with equal ratios of duplications/deletions. We identified 930 potential high impact coding variants, of which most are discrete to a single African population cluster. Large frequency differences (i.e., >10%) were seen in common high impact variants between clusters. Several novel variants are predicted to have a significant impact on protein structure, but additional functional work is needed to confirm the outcome of these for PGx use. Most variants of known clinical outcome are rare in Africa compared to European populations, potentially reflecting a clinical PGx research bias to European populations.Discussion: The genetic diversity of ADME genes across sub-Saharan African populations is large. The Southern African population cluster is most distinct from that of far West Africa. PGx strategies based on European variants will be of limited use in African populations. Although established variants are important, PGx must take into account the full range of African variation. This work urges further characterization of variants in African populations including in vitro and in silico studies, and to consider the unique African ADME landscape when developing precision medicine guidelines and tools for African populations.
- Published
- 2021
- Full Text
- View/download PDF
5. Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics
- Author
-
Shakuntala Baichoo, Yassine Souilmi, Sumir Panji, Gerrit Botha, Ayton Meintjes, Scott Hazelhurst, Hocine Bendou, Eugene de Beste, Phelelani T. Mpangase, Oussema Souiai, Mustafa Alghali, Long Yi, Brian D. O’Connor, Michael Crusoe, Don Armstrong, Shaun Aron, Fourie Joubert, Azza E. Ahmed, Mamana Mbiyavanga, Peter van Heusden, Lerato E. Magosi, Jennie Zermeno, Liudmila Sergeevna Mainzer, Faisal M. Fadlelmola, C. Victor Jongeneel, and Nicola Mulder
- Subjects
Workflows ,Pipeline ,Bioinformatics ,Africa ,Genomics ,Docker ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background The Pan-African bioinformatics network, H3ABioNet, comprises 27 research institutions in 17 African countries. H3ABioNet is part of the Human Health and Heredity in Africa program (H3Africa), an African-led research consortium funded by the US National Institutes of Health and the UK Wellcome Trust, aimed at using genomics to study and improve the health of Africans. A key role of H3ABioNet is to support H3Africa projects by building bioinformatics infrastructure such as portable and reproducible bioinformatics workflows for use on heterogeneous African computing environments. Processing and analysis of genomic data is an example of a big data application requiring complex interdependent data analysis workflows. Such bioinformatics workflows take the primary and secondary input data through several computationally-intensive processing steps using different software packages, where some of the outputs form inputs for other steps. Implementing scalable, reproducible, portable and easy-to-use workflows is particularly challenging. Results H3ABioNet has built four workflows to support (1) the calling of variants from high-throughput sequencing data; (2) the analysis of microbial populations from 16S rDNA sequence data; (3) genotyping and genome-wide association studies; and (4) single nucleotide polymorphism imputation. A week-long hackathon was organized in August 2016 with participants from six African bioinformatics groups, and US and European collaborators. Two of the workflows are built using the Common Workflow Language framework (CWL) and two using Nextflow. All the workflows are containerized for improved portability and reproducibility using Docker, and are publicly available for use by members of the H3Africa consortium and the international research community. Conclusion The H3ABioNet workflows have been implemented in view of offering ease of use for the end user and high levels of reproducibility and portability, all while following modern state of the art bioinformatics data processing protocols. The H3ABioNet workflows will service the H3Africa consortium projects and are currently in use. All four workflows are also publicly available for research scientists worldwide to use and adapt for their respective needs. The H3ABioNet workflows will help develop bioinformatics capacity and assist genomics research within Africa and serve to increase the scientific output of H3Africa and its Pan-African Bioinformatics Network.
- Published
- 2018
- Full Text
- View/download PDF
6. Proposed minimum information guideline for kidney disease—research and clinical data reporting: a cross-sectional study
- Author
-
Michael Thompson, Chirag Patel, Judit Kumuthini, Christiaan van Woerden, Andrew Mallett, Lyndon Zass, Melek Chaouch, Katherine Johnston, Mamana Mbiyavanga, Shakuntala Baichoo, Zahra Mungloo-Dilmohamud, and Nicola Mulder
- Subjects
Medicine - Abstract
Objective This project aimed to develop and propose a standardised reporting guideline for kidney disease research and clinical data reporting, in order to improve kidney disease data quality and integrity, and combat challenges associated with the management and challenges of ‘Big Data’.Methods A list of recommendations was proposed for the reporting guideline based on the systematic review and consolidation of previously published data collection and reporting standards, including PhenX measures and Minimal Information about a Proteomics Experiment (MIAPE). Thereafter, these recommendations were reviewed by domain-specialists using an online survey, developed in Research Electronic Data Capture (REDCap). Following interpretation and consolidation of the survey results, the recommendations were mapped to existing ontologies using Zooma, Ontology Lookup Service and the Bioportal search engine. Additionally, an associated eXtensible Markup Language schema was created for the REDCap implementation to increase user friendliness and adoption.Results The online survey was completed by 53 respondents; the majority of respondents were dual clinician-researchers (57%), based in Australia (35%), Africa (33%) and North America (22%). Data elements within the reporting standard were identified as participant-level, study-level and experiment-level information, further subdivided into essential or optional information.Conclusion The reporting guideline is readily employable for kidney disease research projects, and also adaptable for clinical utility. The adoption of the reporting guideline in kidney disease research can increase data quality and the value for long-term preservation, ensuring researchers gain the maximum benefit from their collected and generated data.
- Published
- 2019
- Full Text
- View/download PDF
7. Organizing and running bioinformatics hackathons within Africa: The H3ABioNet cloud computing experience [version 2; peer review: 2 approved, 1 approved with reservations]
- Author
-
Azza E. Ahmed, Phelelani T. Mpangase, Sumir Panji, Shakuntala Baichoo, Yassine Souilmi, Faisal M. Fadlelmola, Mustafa Alghali, Shaun Aron, Hocine Bendou, Eugene De Beste, Mamana Mbiyavanga, Oussema Souiai, Long Yi, Jennie Zermeno, Don Armstrong, Brian D. O'Connor, Liudmila Sergeevna Mainzer, Michael R. Crusoe, Ayton Meintjes, Peter Van Heusden, Gerrit Botha, Fourie Joubert, C. Victor Jongeneel, Scott Hazelhurst, and Nicola Mulder
- Subjects
Medicine ,Science - Abstract
The need for portable and reproducible genomics analysis pipelines is growing globally as well as in Africa, especially with the growth of collaborative projects like the Human Health and Heredity in Africa Consortium (H3Africa). The Pan-African H3Africa Bioinformatics Network (H3ABioNet) recognized the need for portable, reproducible pipelines adapted to heterogeneous computing environments, and for the nurturing of technical expertise in workflow languages and containerization technologies. Building on the network’s Standard Operating Procedures (SOPs) for common genomic analyses, H3ABioNet arranged its first Cloud Computing and Reproducible Workflows Hackathon in 2016, with the purpose of translating those SOPs into analysis pipelines able to run on heterogeneous computing environments and meeting the needs of H3Africa research projects. This paper describes the preparations for this hackathon and reflects upon the lessons learned about its impact on building the technical and scientific expertise of African researchers. The workflows developed were made publicly available in GitHub repositories and deposited as container images on Quay.io.
- Published
- 2019
- Full Text
- View/download PDF
8. Proposed Guideline for Minimum Information Stroke Research and Clinical Data Reporting
- Author
-
Judit Kumuthini, Lyndon Zass, Melek Chaouch, Michael Thompson, Paul Olowoyo, Mamana Mbiyavanga, Faniyan Moyinoluwalogo, Gordon Wells, Victornia Nembeware, Nicola J. Mulder, Mayowa Owolabi, and H3ABioNet Consortium’s Data and Standard Working Group as members of the H3Africa Consortium
- Subjects
Stroke ,minimum information requirement guideline ,standardization ,reporting guideline ,data reporting ,H3ABioNet ,Science (General) ,Q1-390 - Abstract
The management and analyses of large datasets is one of the grand challenges of modern biomedical research. Establishing methods to harmonise and standardise data collection, reporting, sharing and the employed data dictionaries, can support the resolution of these challenges whilst improving research quality, data quality and integrity, allowing sustainable knowledge transfer through re-usability, interoperability, reproducibility. The current project aimed to develop and propose a standardised reporting guideline for stroke research and clinical data reporting. Through systematic consolidation and harmonization of published data collection and reporting standards, several recommendations were drafted for the proposed guideline. These recommendations were reviewed by domain-researchers and clinicians using an online survey, developed in REDCap. The survey was completed by 20 international stroke-specialists, majority of respondents were based in Africa (10), followed by America, Europe and Australia (10). Of these respondents; the majority were working as dual clinician-researchers (57%) with more than 10 years’ experience in the field (78%). Data elements within the reporting standard were classified as participant-, study- and experiment-level information, further subdivided into essential or optional information, and defined using existing ontologies. The proposed reporting guideline can be employed for research utility and adapted for clinical utility as well. It is accompanied with an associated XML schema for REDCap implementation, to increase the user friendliness of data capturing, sharing, reporting and governance. Ultimately, the adoption of common reporting in stroke research has the potential to ensure that researchers gain the maximum benefit from their generated data and data collections.
- Published
- 2019
- Full Text
- View/download PDF
9. Assessing computational genomics skills: Our experience in the H3ABioNet African bioinformatics network.
- Author
-
C Victor Jongeneel, Ovokeraye Achinike-Oduaran, Ezekiel Adebiyi, Marion Adebiyi, Seun Adeyemi, Bola Akanle, Shaun Aron, Efejiro Ashano, Hocine Bendou, Gerrit Botha, Emile Chimusa, Ananyo Choudhury, Ravikiran Donthu, Jenny Drnevich, Oluwadamila Falola, Christopher J Fields, Scott Hazelhurst, Liesl Hendry, Itunuoluwa Isewon, Radhika S Khetani, Judit Kumuthini, Magambo Phillip Kimuda, Lerato Magosi, Liudmila Sergeevna Mainzer, Suresh Maslamoney, Mamana Mbiyavanga, Ayton Meintjes, Danny Mugutso, Phelelani Mpangase, Richard Munthali, Victoria Nembaware, Andrew Ndhlovu, Trust Odia, Adaobi Okafor, Olaleye Oladipo, Sumir Panji, Venesa Pillay, Gloria Rendon, Dhriti Sengupta, and Nicola Mulder
- Subjects
Biology (General) ,QH301-705.5 - Abstract
The H3ABioNet pan-African bioinformatics network, which is funded to support the Human Heredity and Health in Africa (H3Africa) program, has developed node-assessment exercises to gauge the ability of its participating research and service groups to analyze typical genome-wide datasets being generated by H3Africa research groups. We describe a framework for the assessment of computational genomics analysis skills, which includes standard operating procedures, training and test datasets, and a process for administering the exercise. We present the experiences of 3 research groups that have taken the exercise and the impact on their ability to manage complex projects. Finally, we discuss the reasons why many H3ABioNet nodes have declined so far to participate and potential strategies to encourage them to do so.
- Published
- 2017
- Full Text
- View/download PDF
10. 'Broadband' Bioinformatics Skills Transfer with the Knowledge Transfer Programme (KTP): Educational Model for Upliftment and Sustainable Development.
- Author
-
Emile R Chimusa, Mamana Mbiyavanga, Velaphi Masilela, and Judit Kumuthini
- Subjects
Biology (General) ,QH301-705.5 - Abstract
A shortage of practical skills and relevant expertise is possibly the primary obstacle to social upliftment and sustainable development in Africa. The "omics" fields, especially genomics, are increasingly dependent on the effective interpretation of large and complex sets of data. Despite abundant natural resources and population sizes comparable with many first-world countries from which talent could be drawn, countries in Africa still lag far behind the rest of the world in terms of specialized skills development. Moreover, there are serious concerns about disparities between countries within the continent. The multidisciplinary nature of the bioinformatics field, coupled with rare and depleting expertise, is a critical problem for the advancement of bioinformatics in Africa. We propose a formalized matchmaking system, which is aimed at reversing this trend, by introducing the Knowledge Transfer Programme (KTP). Instead of individual researchers travelling to other labs to learn, researchers with desirable skills are invited to join African research groups for six weeks to six months. Visiting researchers or trainers will pass on their expertise to multiple people simultaneously in their local environments, thus increasing the efficiency of knowledge transference. In return, visiting researchers have the opportunity to develop professional contacts, gain industry work experience, work with novel datasets, and strengthen and support their ongoing research. The KTP develops a network with a centralized hub through which groups and individuals are put into contact with one another and exchanges are facilitated by connecting both parties with potential funding sources. This is part of the PLOS Computational Biology Education collection.
- Published
- 2015
- Full Text
- View/download PDF
11. Consent Codes: Maintaining Consent in an Ever-expanding Open Science Ecosystem.
- Author
-
Stephanie O. M. Dyke, Kathleen Connor, Victoria Nembaware, Nchangwi S. Munung, Kathy Reinold, Giselle Kerry, Mamana Mbiyavanga, Lyndon Zass, Mauricio Moldes, Samir Das, John Mike Davis, Jordi Rambla De Argila, J. Dylan Spalding, Alan C. Evans, Nicola J. Mulder, and Jason Karamchandani
- Published
- 2023
- Full Text
- View/download PDF
12. ancGWAS: a post genome-wide association study method for interaction, pathway and ancestry analysis in homogeneous and admixed populations.
- Author
-
Emile R. Chimusa, Mamana Mbiyavanga, Gaston K. Mazandu, and Nicola J. Mulder
- Published
- 2016
- Full Text
- View/download PDF
13. A-DaGO-Fun: an adaptable Gene Ontology semantic similarity-based functional analysis tool.
- Author
-
Gaston K. Mazandu, Emile R. Chimusa, Mamana Mbiyavanga, and Nicola J. Mulder
- Published
- 2016
- Full Text
- View/download PDF
14. Performance and accuracy evaluation of reference panels for genotype imputation in sub-Saharan African populations
- Author
-
Dhriti Sengupta, Gerrit Botha, Ayton Meintjes, Mamana Mbiyavanga, Scott Hazelhurst, Nicola Mulder, Michèle Ramsay, and Ananyo Choudhury
- Subjects
Genetics ,Biochemistry, Genetics and Molecular Biology (miscellaneous) - Published
- 2023
- Full Text
- View/download PDF
15. Distinct Genetic Loci and Variations in Blood Pressure and Pulse Rate in Europeans and Africans from the UK Biobank
- Author
-
Musalula Sinkala, Samar S. M. Elsheikh, Mamana Mbiyavanga, Joshua Cullinan, and Nicola J. Mulder
- Abstract
Trans-ancestral variations exist for various anthropometric traits. These traits, including cardiovascular traits, are likely associated with different loci in individuals of different ancestry groups. We assessed the difference in systolic blood pressure, diastolic blood pressure, pulse rate and maximum heart rate among African and European ancestry individuals. Furthermore, we conducted a genome-wide association study of cardiovascular traits in 383,471 Europeans and 5,978 Africans represented in the UK Biobank. Here, we report 2 and 1,202 variants associated with cardiovascular traits in Africans and Europeans, respectively. We identify 2 novel variants in Africans, including rs9388010 located in the GJA1 gene previously associated with numerous cardiomyopathies. Remarkably, we find the associated variants are primarily unique to each ancestry group and map to largely different genes. Through integrative enrichment analyses we find that gene sets within each ancestral group are significantly enriched for pathways and phenotypes related to cardiovascular physiology and disease. Our discoveries provide a better understanding of variations in cardiovascular traits and the different associated variants in Africans and Europeans.
- Published
- 2022
- Full Text
- View/download PDF
16. Consent Codes: Maintaining Consent in an Ever-expanding Open Science Ecosystem
- Author
-
Stephanie O. M. Dyke, Kathleen Connor, Victoria Nembaware, Nchangwi S. Munung, Kathy Reinold, Giselle Kerry, Mamana Mbiyavanga, Lyndon Zass, Mauricio Moldes, Samir Das, John M. Davis, Jordi Rambla De Argila, J. Dylan Spalding, Alan C. Evans, Nicola Mulder, and Jason Karamchandani
- Subjects
Consent ,Ethics ,General Neuroscience ,Data sharing ,Open science ,Data access ,Data management ,Software ,Information Systems - Abstract
We previously proposed a structure for recording consent-based data use 'categories' and 'requirements' - Consent Codes - with a view to supporting maximum use and integration of genomic research datasets, and reducing uncertainty about permissible re-use of shared data. Here we discuss clarifications and subsequent updates to the Consent Codes (v4) based on new areas of application (e.g., the neurosciences, biobanking, H3Africa), policy developments (e.g., return of research results), and further practical considerations, including developments in automated approaches to consent management. SOMD, SD, ACE and JK were supported by The Neuro Tanenbaum Open Science Institute, the Canadian Open Neuroscience Platform (funded in part by Brain Canada), and McGill Healthy Brains for Healthy Lives. NM and LZ are funded by the NIH under grant number U24HG006941. MM is funded by EUH2020 CINECA grant number 825775. NM, VN and NSM are funded by the NHLBI award number U24HL135600. JDS and GK are funded by the Wellcome Trust grant 360G-Wellcome-201535_Z_16_Z and previously the EU H2020 Corbel grant number 645248.
- Published
- 2022
17. Genome-wide Association Study of Pulmonary Function in Europeans and Africans from the UK Biobank Identifies Distinct Variants
- Author
-
Musalula Sinkala, Samar S. M. Elsheikh, Mamana Mbiyavanga, Joshua Cullinan, and Nicola J. Mulder
- Abstract
Pulmonary function is an indicator of well-being, and pulmonary pathologies are the third major cause of death worldwide. FEV1, FVC, and PEF are quantitively used to assess pulmonary function. We conducted a genome-wide association analysis of pulmonary function in 383,471 individuals of European and 5,978 African descent represented in the UK Biobank. Here, we report 817 variants in Europeans and 3 in Africans associated (p-values < 5 × 10−8) with three pulmonary function parameters; FEV1, FVC and PEF. In addition to 377 variants in Europeans previously reported to be associated with phenotypes related to pulmonary function, we identified 330 novel loci, including an ISX intergenic variant rs369476290 on chromosome 22 in Africans and a KDM2A intron variant rs12790261 on chromosome 11 in Europeans. Remarkably, we find no shared variants among Africans and Europeans. Enrichment analyses of variants separately for each ancestry background revealed significant enrichment for terms related to pulmonary phenotypes in Europeans but not Africans. Further analysis of studies of pulmonary phenotypes revealed individuals of European background are disproportionally overrepresented in datasets compared to Africans, with the gap widening over the past five years. Our findings offer a better understanding of the different variants that modify pulmonary function in Africans and Europeans, a significant finding for future GWAS studies and medicine.
- Published
- 2022
- Full Text
- View/download PDF
18. The extent and impact of variation in ADME genes in sub-Saharan African populations
- Author
-
Houcemeddine Othman, Sandra Baldwin, Clement Adebamowo, Britt I. Drögemöller, Pamela Thomas, Charles D. Cox, Annette S. Gross, Jorge da Rocha, Samah Ahmed, Gustave Simo, Caroline T. Tiemessen, Sumir Panji, Martin Simuunza, Laura Cottino, Mamana Mbiyavanga, Gerrit Botha, Francisco-Javier Gamo, Michèle Ramsay, Mathias Chiano, Mogomotsi Matshaba, David Twesigomwe, Scott Hazelhurst, Philip Machanick, Faisal M. Fadlelmola, and Galen E B Wright
- Subjects
Genetic diversity ,Drug development ,Evolutionary biology ,Pharmacogenomics ,Genetic variation ,Copy-number variation ,Biology ,Genotyping ,Genome ,ADME - Abstract
Investigating variation in genes involved in theabsorption, distribution, metabolism, andexcretion(ADME) of drugs are key to characterising pharmacogenomic (PGx) relationships. ADME gene variation is relatively well characterised in European and Asian populations, but African populations are under-studied – which has implications for safe and effective drug use in Africa.The genetic diversity of ADME genes across sub-Saharan African populations is large. The Southern African population cluster is most distinct from that of far West Africa. PGx strategies based on European variants will be of limited use in African populations.Although established variants are important, PGx must take into account the full range of African variation. This work urges further characterisation of variants in African populations includingin vitroandin silicostudies, and to consider the unique African ADME landscape when developing precision medicine guidelines and tools for African populations.Author summaryThe ADME genes are a group of genes that play a key role in absorption, distribution, metabolism and excretion of drugs. Variations in these genes can have a significant impact on drug safety and efficacy. Africa has a high level of genetic variation and is under-studied in drug development, which makes study of variations in these genes in African populations very important. Using a new data set of 458 high-coverage genomes from across Africa, we characterise the extent and impact of variation in the ADME genes, looking at both single nucleotide and copy number variations. We identified 343,368 variants, including 40,692 novel variants, and 930 coding variants which are predicted to have high impact on function. Our discovery curves indicate that there will be considerable value in sequencing more African genomes. Moreover, relatively few of these novel variants are captured on common genotyping arrays. We show that there is considerable diversity within Africa in some important genes, and this will have significant consequences for the emerging field of precision medicine in Africa.
- Published
- 2020
- Full Text
- View/download PDF
19. Author Correction: High-depth African genomes inform human migration and health
- Author
-
Trust Odia, Judit Kumuthini, Shaun Aron, Yasmina Jaufeerally Fakim, Donna M. Muzny, Anisah W. Ghoorah, Charles N. Rotimi, Oscar A. Nyangiri, Gerrit Botha, Sally N. Adebamowo, Neil A. Hanchard, Alia Benkahla, Emile R. Chimusa, Laura R. Botigué, Oluwadamilare Falola, Ananyo Choudhury, Nicola Mulder, Samar K. Kassim, Eileen Dareng, Scott Hazelhurst, Mamana Mbiyavanga, Zané Lombard, Gaston K. Mazandu, Ezekiel Adebiyi, Richard A. Gibbs, Ginger A. Metcalf, Taoufik Bensellak, Daniel Shriner, Michèle Ramsay, Adebowale Adeyemo, Gordon Wells, and Dhriti Sengupta
- Subjects
Multidisciplinary ,Human evolutionary genetics ,Human migration ,business.industry ,Published Erratum ,MEDLINE ,Computational biology ,Biology ,Genome ,Evolutionary genetics ,DNA sequencing ,Genetics research ,Genetic variation ,Next-generation sequencing ,Author Correction ,business - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41586-021-03286-9.
- Published
- 2021
- Full Text
- View/download PDF
20. Proceedings of a Sickle Cell Disease Ontology workshop — Towards the first comprehensive ontology for Sickle Cell Disease
- Author
-
Catherine Chunda-Liyoka, Melissa A. Haendel, Sumir Panji, Kwaku Ohene-Frempong, Marsha Treadwell, Kenneth Opap, Philomene Lopez-Sall, Simon Jupp, Kofi A. Anie, Bamidele O. Tayo, Vimal K. Derebail, Kais Ghedira, Karen Kengne Kamga, Solomon F. Ofori-Acquah, Andrew D. Campbell, Raphael Z. Sangeda, Adekunle Adekile, Furahini Chinenere, Muntaser E. Ibrahim, Neil A. Hanchard, Damian Nirenberg, Deogratias Munube, Carol Hamilton, Nicola Mulder, Mamana Mbiyavanga, Biobele J. Brown, Jennifer Knight-Madden, Léon Tshilolo, Victoria Nembaware, Baba Inusa, Charmaine D.M. Royal, Miriam Park, Obiageli E Nnodu, Wayne Huggins, Ambroise Wonkam, Gift D. Pule, Amy Geard, University of Cape Town, Kuwait University, Imperial College London, Evelina London Children's Hospital, University of Ibadan, University of Michigan [Ann Arbor], University of Michigan System, University of Dar es Salaam (UDSM), University Teaching Hospital [Lusaka] (UTH), University of Zambia [Lusaka] (UNZA), University of North Carolina [Chapel Hill] (UNC), University of North Carolina System (UNC), Laboratoire de Parasitologie Médicale, Biotechnologies et Biomolécules (LR11IPT06), Institut Pasteur de Tunis, Réseau International des Instituts Pasteur (RIIP)-Réseau International des Instituts Pasteur (RIIP), Research Triangle Institute International (RTI International), Baylor College of Medicine (BCM), Baylor University, Oregon Health and Science University [Portland] (OHSU), University of Khartoum, European Bioinformatics Institute [Hinxton] (EMBL-EBI), EMBL Heidelberg, University of Yaoundé [Cameroun], The University of the West Indies, Université Cheikh Anta Diop [Dakar, Sénégal] (UCAD), Makerere University College of Health Science [Kampala] (CHS), Makerere University [Kampala, Ouganda] (MAK), University of Abuja, University of Pittsburgh (PITT), Pennsylvania Commonwealth System of Higher Education (PCSHE), Children’s Hospital of Philadelphia (CHOP ), University of São Paulo (USP), Duke University [Durham], Loyola University [Chicago], UCSF Benioff Children's Hospital Oakland, University of California [San Francisco] (UCSF), University of California-University of California, Centre de formation et d’appui sanitaire de Monkole (CEFA-MONKOLE), The workshop was funded by the National Heart, Lung, and Blood Institute (NHLBI), National Institutes of Health (NIH) as a supplement to the H3ABioNet grant. H3ABioNet is supported by the National Human Genome Research Institute (NHGRI), Office of the Director (OD), NIH under award number U41HG006941. The content of this report is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. Funding for two of the participants was provided by the National Human Genome Research Institute and the National Institute on Drug Abuse Genomic Resource Award: U41 HG007050 and the National Heart, Lung, and Blood Institute Supplement: U41HG007050-02S4., and We would like to thank Jade Hotchkiss who started the initial work on the ontology.
- Subjects
0301 basic medicine ,medicine.medical_specialty ,congenital, hereditary, and neonatal diseases and abnormalities ,lcsh:QH426-470 ,[SDV]Life Sciences [q-bio] ,Pharmaceutical Science ,Single gene ,Disease ,Ontology (information science) ,Bioinformatics ,Article ,03 medical and health sciences ,Quality of life (healthcare) ,Rare Diseases ,Disease Ontology ,hemic and lymphatic diseases ,medicine ,Intensive care medicine ,Molecular Biology ,Pediatric ,Sickle Cell Disease ,business.industry ,Pain Research ,Hematology ,3. Good health ,lcsh:Genetics ,030104 developmental biology ,Orphan Drug ,Good Health and Well Being ,business ,Biotechnology - Abstract
International audience; Sickle cell disease (SCD) is a debilitating single gene disorder caused by a single point mutation that results in physical deformation (i.e. sickling) of erythrocytes at reduced oxygen tensions. Up to 75% of SCD in newborns world-wide occurs in sub-Saharan Africa, where neonatal and childhood mortality from sickle cell related complications is high. While SCD research across the globe is tackling the disease on multiple fronts, advances have yet to significantly impact on the health and quality of life of SCD patients, due to lack of coordination of these disparate efforts. Ensuring data across studies is directly comparable through standardization is a necessary step towards realizing this goal. Such a standardization requires the development and implementation of a disease-specific ontology for SCD that is applicable globally. Ontology development is best achieved by bringing together experts in the domain to contribute their knowledge.The SCD community and H3ABioNet members joined forces at a recent SCD Ontology workshop to develop an ontology covering aspects of SCD under the classes: phenotype, diagnostics, therapeutics, quality of life, disease modifiers and disease stage. The aim of the workshop was for participants to contribute their expertise to development of the structure and contents of the SCD ontology. Here we describe the proceedings of the Sickle Cell Disease Ontology Workshop held in Cape Town South Africa in February 2016 and its outcomes. The objective of the workshop was to bring together experts in SCD from around the world to contribute their expertise to the development of various aspects of the SCD ontology.
- Published
- 2016
- Full Text
- View/download PDF
21. Minimum Information required for a DMET Experiment reporting
- Author
-
Ron H.N. van Schaik, Clint Mizzi, Milan Macek, Panu Somervuo, Vita Dolzan, Mamana Mbiyavanga, Judit Kumuthini, Raj Ramesar, Jyotishman Pathak, Alessio Squassina, George P. Patrinos, Emile R. Chimusa, Kusha Kalideen, Clinical Chemistry, and Pathology
- Subjects
0301 basic medicine ,Standardization ,computer.internet_protocol ,Computer science ,Interoperability ,Population ,Information Dissemination ,Bioinformatics ,03 medical and health sciences ,Genetics ,Humans ,XML schema ,Data reporting ,education ,computer.programming_language ,Pharmacology ,education.field_of_study ,Data science ,Data sharing ,030104 developmental biology ,Pharmacogenetics ,Research Design ,Data Interpretation, Statistical ,Molecular Medicine ,computer ,XML ,Research Article - Abstract
Aim: To provide pharmacogenomics reporting guidelines, the information and tools required for reporting to public omic databases. Material & methods: For effective DMET data interpretation, sharing, interoperability, reproducibility and reporting, we propose the Minimum Information required for a DMET Experiment (MIDE) reporting. Results: MIDE provides reporting guidelines and describes the information required for reporting, data storage and data sharing in the form of XML. Conclusion: The MIDE guidelines will benefit the scientific community with pharmacogenomics experiments, including reporting pharmacogenomics data from other technology platforms, with the tools that will ease and automate the generation of such reports using the standardized MIDE XML schema, facilitating the sharing, dissemination, reanalysis of datasets through accessible and transparent pharmacogenomics data reporting.
- Published
- 2016
22. Assessing computational genomics skills: Our experience in the H3ABioNet African bioinformatics network
- Author
-
Shaun Aron, Mamana Mbiyavanga, Lerato E Magosi, Efejiro Ashano, Christopher J. Fields, C. Victor Jongeneel, Danny Mugutso, Phelelani T. Mpangase, Sumir Panji, Venesa Pillay, Seun Adeyemi, Adaobi Okafor, Oluwadamila Falola, Hocine Bendou, Ananyo Choudhury, Olaleye Oladipo, Ezekiel Adebiyi, Radhika S. Khetani, Ovokeraye Achinike-Oduaran, Bola Akanle, Richard J. Munthali, Suresh Maslamoney, Ayton Meintjes, Gloria Rendon, Nicola Mulder, Trust Odia, Andrew Ndhlovu, Ravikiran Donthu, Itunuoluwa Isewon, Liesl M. Hendry, Emile R. Chimusa, Jenny Drnevich, Judit Kumuthini, Magambo Phillip Kimuda, Scott Hazelhurst, Liudmila Sergeevna Mainzer, Marion O. Adebiyi, Victoria Nembaware, Dhriti Sengupta, and Gerrit Botha
- Subjects
0301 basic medicine ,Service (systems architecture) ,Computer science ,Data management ,Social Sciences ,Bioinformatics ,Database and Informatics Methods ,South Africa ,Sociology ,Databases, Genetic ,Medicine and Health Sciences ,Public and Occupational Health ,Biology (General) ,Ecology ,Health services research ,Genomics ,Research Assessment ,Sports Science ,3. Good health ,Test (assessment) ,Professions ,Computational Theory and Mathematics ,Modeling and Simulation ,Workshops ,Health Services Research ,QH301-705.5 ,Process (engineering) ,Developing country ,Black People ,Nigeria ,Research and Analysis Methods ,Education ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,Genome-Wide Association Studies ,Genetics ,Humans ,Sports and Exercise Medicine ,Molecular Biology ,Exercise ,Developing Countries ,Ecology, Evolution, Behavior and Systematics ,business.industry ,Computational genomics ,Biology and Life Sciences ,Computational Biology ,Human Genetics ,Physical Activity ,Genome Analysis ,Data science ,Health Care ,030104 developmental biology ,Physical Fitness ,People and Places ,Scientists ,Database Management Systems ,Population Groupings ,business - Abstract
The H3ABioNet pan-African bioinformatics network, which is funded to support the Human Heredity and Health in Africa (H3Africa) program, has developed node-assessment exercises to gauge the ability of its participating research and service groups to analyze typical genome-wide datasets being generated by H3Africa research groups. We describe a framework for the assessment of computational genomics analysis skills, which includes standard operating procedures, training and test datasets, and a process for administering the exercise. We present the experiences of 3 research groups that have taken the exercise and the impact on their ability to manage complex projects. Finally, we discuss the reasons why many H3ABioNet nodes have declined so far to participate and potential strategies to encourage them to do so., Author summary Many programs have been developed to boost the technical and computational skills of scientists working in low to medium income countries (LMIC), who often struggle to remain competitive with their peers in more developed parts of the world. Typically, these programs rely on intensive workshops where students acquire and exercise these skills under the supervision of experienced trainers. However, when trainees return to their home institutions, even after extensive exposure to state of the art techniques, they often find it difficult to put the skills they have acquired into practice and to establish themselves as fully independent practitioners. We have attempted to build a framework through which teams of scientists in African research groups can demonstrate that they have acquired the necessary skills to analyze different types of genomic datasets. Three teams of scientists who have successfully submitted to this assessment exercise report their positive experiences. Many potential participants have so far declined the opportunity, and we discuss the reasons for their reluctance as well as possible ways to facilitate their engagement and provide them with incentives. We argue that assessments such as this could be part of any program aiming to develop technical skills in scientists wishing to support genomic research programs.
- Published
- 2017
23. Proposed minimum information guideline for kidney disease—research and clinical data reporting: a cross-sectional study
- Author
-
Chirag Patel, Zahra Mungloo-Dilmohamud, Michael Thompson, Lyndon Zass, Mamana Mbiyavanga, Melek Chaouch, Katherine Johnston, Shakuntala Baichoo, Christiaan van Woerden, Nicola Mulder, Judit Kumuthini, Andrew Mallett, and Academic Medical Center
- Subjects
Biomedical Research ,Knowledge management ,Electronic data capture ,kidney disease ,Big data ,MEDLINE ,Health Informatics ,Guidelines as Topic ,reporting guideline ,Ontology (information science) ,Translational Research, Biomedical ,03 medical and health sciences ,0302 clinical medicine ,Humans ,Medicine ,030212 general & internal medicine ,Data reporting ,data reporting ,Original Research ,FAIR ,Data collection ,business.industry ,Reproducibility of Results ,General Medicine ,Guideline ,data standardisation ,Nephrology ,Research Design ,13. Climate action ,Data quality ,Kidney Diseases ,H3ABioNet ,business ,030217 neurology & neurosurgery - Abstract
ObjectiveThis project aimed to develop and propose a standardised reporting guideline for kidney disease research and clinical data reporting, in order to improve kidney disease data quality and integrity, and combat challenges associated with the management and challenges of ‘Big Data’.MethodsA list of recommendations was proposed for the reporting guideline based on the systematic review and consolidation of previously published data collection and reporting standards, including PhenX measures and Minimal Information about a Proteomics Experiment (MIAPE). Thereafter, these recommendations were reviewed by domain-specialists using an online survey, developed in Research Electronic Data Capture (REDCap). Following interpretation and consolidation of the survey results, the recommendations were mapped to existing ontologies using Zooma, Ontology Lookup Service and the Bioportal search engine. Additionally, an associated eXtensible Markup Language schema was created for the REDCap implementation to increase user friendliness and adoption.ResultsThe online survey was completed by 53 respondents; the majority of respondents were dual clinician-researchers (57%), based in Australia (35%), Africa (33%) and North America (22%). Data elements within the reporting standard were identified as participant-level, study-level and experiment-level information, further subdivided into essential or optional information.ConclusionThe reporting guideline is readily employable for kidney disease research projects, and also adaptable for clinical utility. The adoption of the reporting guideline in kidney disease research can increase data quality and the value for long-term preservation, ensuring researchers gain the maximum benefit from their collected and generated data.
- Published
- 2019
- Full Text
- View/download PDF
24. Organizing and running bioinformatics hackathons within Africa: The H3ABioNet cloud computing experience
- Author
-
Michael R. Crusoe, Liudmila Sergeevna Mainzer, Eugene de Beste, Ayton Meintjes, Gerrit Botha, Azza Ahmed, Nicola Mulder, Fourie Joubert, Shaun Aron, Don Armstrong, Sumir Panji, Shakuntala Baichoo, Hocine Bendou, Scott Hazelhurst, C. Victor Jongeneel, Peter van Heusden, Faisal M. Fadlelmola, Mamana Mbiyavanga, Phelelani T. Mpangase, Oussema Souiai, Brian O'Connor, Yassine Souilmi, Long Yi, Mustafa Alghali, and Jennie Zermeno
- Subjects
0301 basic medicine ,Bioinformatics ,workflow ,business.industry ,capacity building ,Applied Mathematics ,pipeline ,Capacity building ,Symmetric multiprocessor system ,Cloud computing ,Articles ,Method Article ,Pipeline (software) ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Workflow ,Open research ,reproducible ,hackathon ,Container (abstract data type) ,Applied research ,business ,030217 neurology & neurosurgery - Abstract
The need for portable and reproducible genomics analysis pipelines is growing globally as well as in Africa, especially with the growth of collaborative projects like the Human Health and Heredity in Africa Consortium (H3Africa). The Pan-African H3Africa Bioinformatics Network (H3ABioNet) recognized the need for portable, reproducible pipelines adapted to heterogeneous computing environments, and for the nurturing of technical expertise in workflow languages and containerization technologies. Building on the network’s Standard Operating Procedures (SOPs) for common genomic analyses, H3ABioNet arranged its first Cloud Computing and Reproducible Workflows Hackathon in 2016, with the purpose of translating those SOPs into analysis pipelines able to run on heterogeneous computing environments and meeting the needs of H3Africa research projects. This paper describes the preparations for this hackathon and reflects upon the lessons learned about its impact on building the technical and scientific expertise of African researchers. The workflows developed were made publicly available in GitHub repositories and deposited as container images on Quay.io.
- Published
- 2019
- Full Text
- View/download PDF
25. Proposed Guideline for Minimum Information Stroke Research and Clinical Data Reporting
- Author
-
Mamana Mbiyavanga, Faniyan Moyinoluwalogo, Melek Chaouch, Gordon Wells, Michael Thompson, Paul Olowoyo, Victoria Nembaware, H ABioNet Consortium’s Data, Mayowa O. Owolabi, Nicola Mulder, Lyndon Zass, and Judit Kumuthini
- Subjects
Knowledge management ,010504 meteorology & atmospheric sciences ,Standardization ,Computer science ,reporting guideline ,01 natural sciences ,Computer Science (miscellaneous) ,Data reporting ,lcsh:Science (General) ,data reporting ,0105 earth and related environmental sciences ,standardization ,Data collection ,Stroke ,minimum information requirement guideline ,H3ABioNet ,business.industry ,05 social sciences ,Guideline ,Data dictionary ,Bioinformatics, Biomedical Science, Data Science ,Computer Science Applications ,XML Schema (W3C) ,Data quality ,0509 other social sciences ,050904 information & library sciences ,business ,Knowledge transfer ,lcsh:Q1-390 - Abstract
The management and analyses of large datasets is one of the grand challenges of modern biomedical research. Establishing methods to harmonise and standardise data collection, reporting, sharing and the employed data dictionaries, can support the resolution of these challenges whilst improving research quality, data quality and integrity, allowing sustainable knowledge transfer through re-usability, interoperability, reproducibility. The current project aimed to develop and propose a standardised reporting guideline for stroke research and clinical data reporting. Through systematic consolidation and harmonization of published data collection and reporting standards, several recommendations were drafted for the proposed guideline. These recommendations were reviewed by domain-researchers and clinicians using an online survey, developed in REDCap. The survey was completed by 20 international stroke-specialists, majority of respondents were based in Africa (10), followed by America, Europe and Australia (10). Of these respondents; the majority were working as dual clinician-researchers (57%) with more than 10 years’ experience in the field (78%). Data elements within the reporting standard were classified as participant-, study- and experiment-level information, further subdivided into essential or optional information, and defined using existing ontologies. The proposed reporting guideline can be employed for research utility and adapted for clinical utility as well. It is accompanied with an associated XML schema for REDCap implementation, to increase the user friendliness of data capturing, sharing, reporting and governance. Ultimately, the adoption of common reporting in stroke research has the potential to ensure that researchers gain the maximum benefit from their generated data and data collections.
- Published
- 2019
- Full Text
- View/download PDF
26. A-DaGO-Fun: an adaptable Gene Ontology semantic similarity-based functional analysis tool
- Author
-
Gaston K. Mazandu, Mamana Mbiyavanga, Emile R. Chimusa, and Nicola Mulder
- Subjects
0301 basic medicine ,Statistics and Probability ,Databases, Factual ,Computer science ,0206 medical engineering ,02 engineering and technology ,computer.software_genre ,Biochemistry ,Open Biomedical Ontologies ,03 medical and health sciences ,Text mining ,Semantic similarity ,Knowledge extraction ,Humans ,Molecular Biology ,Information retrieval ,Database ,business.industry ,Gene ontology ,Ontology-based data integration ,Computational Biology ,Proteins ,Molecular Sequence Annotation ,Applications Notes ,Semantics ,Computer Science Applications ,Computational Mathematics ,Gene Ontology ,030104 developmental biology ,Genes ,Computational Theory and Mathematics ,business ,computer ,Software ,020602 bioinformatics - Abstract
Summary: Gene Ontology (GO) semantic similarity measures are being used for biological knowledge discovery based on GO annotations by integrating biological information contained in the GO structure into data analyses. To empower users to quickly compute, manipulate and explore these measures, we introduce A-DaGO-Fun (ADaptable Gene Ontology semantic similarity-based Functional analysis). It is a portable software package integrating all known GO information content-based semantic similarity measures and relevant biological applications associated with these measures. A-DaGO-Fun has the advantage not only of handling datasets from the current high-throughput genome-wide applications, but also allowing users to choose the most relevant semantic similarity approach for their biological applications and to adapt a given module to their needs. Availability and implementation: A-DaGO-Fun is freely available to the research community at http://web.cbio.uct.ac.za/ITGOM/adagofun. It is implemented in Linux using Python under free software (GNU General Public Licence). Contact: gmazandu@cbio.uct.ac.za or Nicola.Mulder@uct.ac.za Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2015
- Full Text
- View/download PDF
27. 'Broadband' Bioinformatics Skills Transfer with the Knowledge Transfer Programme (KTP): Educational Model for Upliftment and Sustainable Development
- Author
-
Judit Kumuthini, Velaphi Masilela, Emile R. Chimusa, Mamana Mbiyavanga, Institute of Infectious Disease and Molecular Medicine, and Faculty of Health Sciences
- Subjects
Conservation of Natural Resources ,Models, Educational ,Bioinformatics ,Population genetics ,Computer science ,0206 medical engineering ,Population ,02 engineering and technology ,Education ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,Multidisciplinary approach ,Genomic medicine ,Genetics ,Humans ,education ,Molecular Biology ,lcsh:QH301-705.5 ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,Sustainable development ,Internet ,0303 health sciences ,education.field_of_study ,Ecology ,business.industry ,4. Education ,Computational Biology ,Genomics ,Natural resource ,Work experience ,Computational Theory and Mathematics ,Work (electrical) ,lcsh:Biology (General) ,Modeling and Simulation ,Africa ,Scientists ,The Internet ,business ,Knowledge transfer ,Genome complexity ,020602 bioinformatics ,Africans - Abstract
A shortage of practical skills and relevant expertise is possibly the primary obstacle to social upliftment and sustainable development in Africa. The “omics” fields, especially genomics, are increasingly dependent on the effective interpretation of large and complex sets of data. Despite abundant natural resources and population sizes comparable with many first-world countries from which talent could be drawn, countries in Africa still lag far behind the rest of the world in terms of specialized skills development. Moreover, there are serious concerns about disparities between countries within the continent. The multidisciplinary nature of the bioinformatics field, coupled with rare and depleting expertise, is a critical problem for the advancement of bioinformatics in Africa. We propose a formalized matchmaking system, which is aimed at reversing this trend, by introducing the Knowledge Transfer Programme (KTP). Instead of individual researchers travelling to other labs to learn, researchers with desirable skills are invited to join African research groups for six weeks to six months. Visiting researchers or trainers will pass on their expertise to multiple people simultaneously in their local environments, thus increasing the efficiency of knowledge transference. In return, visiting researchers have the opportunity to develop professional contacts, gain industry work experience, work with novel datasets, and strengthen and support their ongoing research. The KTP develops a network with a centralized hub through which groups and individuals are put into contact with one another and exchanges are facilitated by connecting both parties with potential funding sources. This is part of the PLOS Computational Biology Education collection.
- Published
- 2015
28. ancGWAS: a post genome-wide association study method for interaction, pathway and ancestry analysis in homogeneous and admixed populations
- Author
-
Nicola Mulder, Mamana Mbiyavanga, Gaston K. Mazandu, and Emile R. Chimusa
- Subjects
0301 basic medicine ,Statistics and Probability ,Linkage disequilibrium ,Population ,Gene regulatory network ,Single-nucleotide polymorphism ,Genome-wide association study ,Breast Neoplasms ,Biology ,Biochemistry ,Polymorphism, Single Nucleotide ,Linkage Disequilibrium ,03 medical and health sciences ,Human interactome ,Protein Interaction Mapping ,Humans ,Gene Regulatory Networks ,Genetic Predisposition to Disease ,education ,Molecular Biology ,Genetic association ,Genetics ,education.field_of_study ,nutritional and metabolic diseases ,Original Papers ,Computer Science Applications ,Computational Mathematics ,030104 developmental biology ,Genetics, Population ,Computational Theory and Mathematics ,Epistasis ,Female ,Software ,Genome-Wide Association Study ,Signal Transduction - Abstract
Motivation: Despite numerous successful Genome-wide Association Studies (GWAS), detecting variants that have low disease risk still poses a challenge. GWAS may miss disease genes with weak genetic effects or strong epistatic effects due to the single-marker testing approach commonly used. GWAS may thus generate false negative or inconclusive results, suggesting the need for novel methods to combine effects of single nucleotide polymorphisms within a gene to increase the likelihood of fully characterizing the susceptibility gene. Results: We developed ancGWAS, an algebraic graph-based centrality measure that accounts for linkage disequilibrium in identifying significant disease sub-networks by integrating the association signal from GWAS data sets into the human protein–protein interaction (PPI) network. We validated ancGWAS using an association study result from a breast cancer data set and the simulation of interactive disease loci in the simulation of a complex admixed population, as well as pathway-based GWAS simulation. This new approach holds promise for deconvoluting the interactions between genes underlying the pathogenesis of complex diseases. Results obtained yield a novel central breast cancer sub-network of the human interactome implicated in the proteoglycan syndecan-mediated signaling events pathway which is known to play a major role in mesenchymal tumor cell proliferation, thus providing further insights into breast cancer pathogenesis. Availability and implementation: The ancGWAS package and documents are available at http://www.cbio.uct.ac.za/~emile/software.html Contact: emile.chimusa@uct.ac.za, Nicola.Mulder@uct.ac.za Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2015
29. Development of Bioinformatics Infrastructure for Genomics Research
- Author
-
Nicola J. Mulder, Ezekiel Adebiyi, Marion Adebiyi, Seun Adeyemi, Azza Ahmed, Rehab Ahmed, Bola Akanle, Mohamed Alibi, Don L. Armstrong, Shaun Aron, Efejiro Ashano, Shakuntala Baichoo, Alia Benkahla, David K. Brown, Emile R. Chimusa, Faisal M. Fadlelmola, Dare Falola, Segun Fatumo, Kais Ghedira, Amel Ghouila, Scott Hazelhurst, Itunuoluwa Isewon, Segun Jung, Samar Kamal Kassim, Jonathan K. Kayondo, Mamana Mbiyavanga, Ayton Meintjes, Somia Mohammed, Abayomi Mosaku, Ahmed Moussa, Mustafa Muhammd, Zahra Mungloo-Dilmohamud, Oyekanmi Nashiru, Trust Odia, Adaobi Okafor, Olaleye Oladipo, Victor Osamor, Jellili Oyelade, Khalid Sadki, Samson Pandam Salifu, Jumoke Soyemi, Sumir Panji, Fouzia Radouani, Oussama Souiai, Özlem Tastan Bishop, The HABioNet Consortium, as Members of the HAfrica Consortium, University of Cape Town, Department of Computer and Information Sciences, Covenant University, Covenant University Bioinformatics Research (CUBRe), University of Khartoum, Laboratoire de Bioinformatique, biomathématiques, biostatistiques (BIMS) (LR11IPT09), Institut Pasteur de Tunis, Réseau International des Instituts Pasteur (RIIP)-Réseau International des Instituts Pasteur (RIIP)-Université de Tunis El Manar (UTM), University of Illinois at Urbana-Champaign [Urbana], University of Illinois System, University of the Witwatersrand [Johannesburg] (WITS), Federal Ministry of Science and Technology [Abuja] (FMST), University of Mauritius, Rhodes University, Grahamstown, Institute of Infectious Diseases and Molecular Medicine (IDM), Future University of Sudan, Laboratoire de Transmission, Contrôle et Immunobiologie des Infections - Laboratory of Transmission, Control and Immunobiology of Infection (LR11IPT02), Réseau International des Instituts Pasteur (RIIP)-Réseau International des Instituts Pasteur (RIIP), Computation Institute [Chicago], University of Chicago, Université Ain Shams, Uganda Virus Research Institute (UVRI), Laboratoire des Technologies de l'Information et de la Communication [Tanger] (Labtic), Ecole Nationale des Sciences Appliquées [Tanger] (ENSAT), Landmark University [Omu-Aran], Université Mohammed V, Kwame Nkrumah University of Science and Technology [GHANA] (KNUST), École polytechnique fédérale d'Ilaro, Institut Pasteur du Maroc, Réseau International des Instituts Pasteur (RIIP), and H3ABioNet is supported by the National Institutes of Health Common Fund (grant number U41HG006941)
- Subjects
0301 basic medicine ,MESH: Genomics/methods ,Epidemiology ,Computer science ,[SDV]Life Sciences [q-bio] ,media_common.quotation_subject ,Genomics ,MESH: Africa ,Bioinformatics ,Data type ,03 medical and health sciences ,0302 clinical medicine ,Excellence ,Controlled vocabulary ,media_common ,MESH: Computational Biology/trends ,Community and Home Care ,Spatial data infrastructure ,MESH: Humans ,Data collection ,MESH: Biomedical Research/methods ,Data science ,Metadata ,030104 developmental biology ,Workflow ,Cardiology and Cardiovascular Medicine ,030217 neurology & neurosurgery - Abstract
Background: Although pockets of bioinformatics excellence have developed in Africa, generally, large-scale genomic data analysis has been limited by the availability of expertise and infrastructure. H3ABioNet, a pan-African bioinformatics network, was established to build capacity specifically to enable H3Africa (Human Heredity and Health in Africa) researchers to analyze their data in Africa. Since the inception of the H3Africa initiative, H3ABioNet’s role has evolved in response to changing needs from the consortium and the African bioinformatics community.Objectives: H3ABioNet set out to develop core bioinformatics infrastructure and capacity for genomics research in various aspects of data collection, transfer, storage, and analysis.Methods and Results: Various resources have been developed to address genomic data management and analysis needs of H3Africa researchers and other scientific communities on the continent. NetMap was developed and used to build an accurate picture of network performance within Africa and between Africa and the rest of the world, and Globus Online has been rolled out to facilitate data transfer. A participant recruitment database was developed to monitor participant enrollment, and data is being harmonized through the use of ontologies and controlled vocabularies. The standardized metadata will be integrated to provide a search facility for H3Africa data and biospecimens. Because H3Africa projects are generating large-scale genomic data, facilities for analysis and interpretation are critical. H3ABioNet is implementing several data analysis platforms that provide a large range of bioinformatics tools or workflows, such as Galaxy, the Job Management System, and eBiokits. A set of reproducible, portable, and cloud-scalable pipelines to support the multiple H3Africa data types are also being developed and dockerized to enable execution on multiple computing infrastructures. In addition, new tools have been developed for analysis of the uniquely divergent African data and for downstream interpretation of prioritized variants. To provide support for these and other bioinformatics queries, an online bioinformatics helpdesk backed by broad consortium expertise has been established. Further support is provided by means of various modes of bioinformatics training.Conclusions: For the past 4 years, the development of infrastructure support and human capacity through H3ABioNet, have significantly contributed to the establishment of African scientific networks, data analysis facilities, and training programs. Here, we describe the infrastructure and how it has affected genomics and bioinformatics research in Africa.HighlightsH3ABioNet is building capacity to enable analysis of genomic data in Africa.Infrastructure has been built for clinical and genomic data storage, management, and analysis.New algorithms and pipelines for African genomic data analysis have been developed.Data are being harmonized using ontologies to enable easy search and retrieval.Genomics training is implemented using various online and face-to-face approaches.
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.