28 results on '"Shaun Aron"'
Search Results
2. Functional characterisation of the transcriptome from leaf tissue of the fluoroacetate-producing plant, Dichapetalum cymosum, in response to mechanical wounding
- Author
-
Selisha A. Sooklal, Phelelani T. Mpangase, Mihai-Silviu Tomescu, Shaun Aron, Scott Hazelhurst, Robert H. Archer, and Karl Rumbold
- Subjects
Medicine ,Science - Abstract
Abstract Dichapetalum cymosum produces the toxic fluorinated metabolite, fluoroacetate, presumably as a defence mechanism. Given the rarity of fluorinated metabolites in nature, the biosynthetic origin and function of fluoroacetate have been of particular interest. However, the mechanism for fluorination in D. cymosum was never elucidated. More importantly, there is a severe lack in knowledge on a genetic level for fluorometabolite-producing plants, impeding research on the subject. Here, we report on the first transcriptome for D. cymosum and investigate the wound response for insights into fluorometabolite production. Mechanical wounding studies were performed and libraries of the unwounded (control) and wounded (30 and 60 min post wounding) plant were sequenced using the Illumina HiSeq platform. A combined reference assembly generated 77,845 transcripts. Using the SwissProt, TrEMBL, GO, eggNOG, KEGG, Pfam, EC and PlantTFDB databases, a 69% annotation rate was achieved. Differential expression analysis revealed the regulation of 364 genes in response to wounding. The wound responses in D. cymosum included key mechanisms relating to signalling cascades, phytohormone regulation, transcription factors and defence-related secondary metabolites. However, the role of fluoroacetate in inducible wound responses remains unclear. Bacterial fluorinases were searched against the D. cymosum transcriptome but transcripts with homology were not detected suggesting the presence of a potentially different fluorinating enzyme in plants. Nevertheless, the transcriptome produced in this study significantly increases genetic resources available for D. cymosum and will assist with future research into fluorometabolite-producing plants.
- Published
- 2020
- Full Text
- View/download PDF
3. Ten simple rules for developing bioinformatics capacity at an academic institution
- Author
-
Shaun Aron, C. Victor Jongeneel, Paballo Abel Chauke, Melek Chaouch, Judit Kumuthini, Lyndon Zass, Fouzia Radouani, Samar Kamal Kassim, Faisal M. Fadlelmola, and Nicola Mulder
- Subjects
Biology (General) ,QH301-705.5 - Published
- 2021
4. The Development of a Sustainable Bioinformatics Training Environment Within the H3Africa Bioinformatics Network (H3ABioNet)
- Author
-
Shaun Aron, Paballo Abel Chauke, Verena Ras, Sumir Panji, Katherine Johnston, and Nicola Mulder
- Subjects
LMIC ,training modalities ,Africa ,bioinformatics training ,professional development ,Education (General) ,L7-991 - Abstract
Bioinformatics training programs have been developed independently around the world based on the perceived needs of the local and global academic communities. The field of bioinformatics is complicated by the need to train audiences from diverse backgrounds in a variety of topics to various levels of competencies. While there have been several attempts to develop standardised approaches to provide bioinformatics training globally, the challenges encountered in resource limited settings hinder the adaptation of these global approaches. H3ABioNet, a Pan-African Bioinformatics Network with 27 nodes in 16 African countries, has realised that there is no single simple solution to this challenge and has rather, over the years, evolved and adapted training approaches to create a sustainable training environment, with several components that allow for the successful dissemination of bioinformatics knowledge to diverse audiences. This has been achieved through the implementation of a combination of training modalities and sharing of high quality training material and experiences. The results highlight the success of implementing this multi-pronged approach to training, to reach audiences from different backgrounds and provide training in a variety of different areas of expertise. While face-to-face training was initially required and successful, the mixed-model teaching approach allowed for an increased reach, providing training in advanced analysis topics to reach large audiences across the continent with minimal teaching resources. The transition to hackathons provided an environment to allow the progression of skills, once basic skills had been developed, together with the development of real-world solutions to bioinformatics problems. Ensuring our training materials are FAIR, and through synergistic collaborations with global training partners, the reach of our training materials extends beyond H3ABioNet. Coupled with the opportunity to develop additional career building soft skills, such as scientific communication, H3ABioNet has created a flexible, sustainable and high quality bioinformatics training environment that has successfully been implemented to train several highly skilled African bioinformaticians on the continent.
- Published
- 2021
- Full Text
- View/download PDF
5. Using a multiple-delivery-mode training approach to develop local capacity and infrastructure for advanced bioinformatics in Africa.
- Author
-
Verena Ras, Gerrit Botha, Shaun Aron, Katie Lennard, Imane Allali, Shantelle Claassen-Weitz, Kilaza Samson Mwaikono, Dane Kennedy, Jessica R Holmes, Gloria Rendon, Sumir Panji, Christopher J Fields, and Nicola Mulder
- Subjects
Biology (General) ,QH301-705.5 - Abstract
With more microbiome studies being conducted by African-based research groups, there is an increasing demand for knowledge and skills in the design and analysis of microbiome studies and data. However, high-quality bioinformatics courses are often impeded by differences in computational environments, complicated software stacks, numerous dependencies, and versions of bioinformatics tools along with a lack of local computational infrastructure and expertise. To address this, H3ABioNet developed a 16S rRNA Microbiome Intermediate Bioinformatics Training course, extending its remote classroom model. The course was developed alongside experienced microbiome researchers, bioinformaticians, and systems administrators, who identified key topics to address. Development of containerised workflows has previously been undertaken by H3ABioNet, and Singularity containers were used here to enable the deployment of a standard replicable software stack across different hosting sites. The pilot ran successfully in 2019 across 23 sites registered in 11 African countries, with more than 200 participants formally enrolled and 106 volunteer staff for onsite support. The pulling, running, and testing of the containers, software, and analyses on various clusters were performed prior to the start of the course by hosting classrooms. The containers allowed the replication of analyses and results across all participating classrooms running a cluster and remained available posttraining ensuring analyses could be repeated on real data. Participants thus received the opportunity to analyse their own data, while local staff were trained and supported by experienced experts, increasing local capacity for ongoing research support. This provides a model for delivering topic-specific bioinformatics courses across Africa and other remote/low-resourced regions which overcomes barriers such as inadequate infrastructures, geographical distance, and access to expertise and educational materials.
- Published
- 2021
- Full Text
- View/download PDF
6. Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics
- Author
-
Shakuntala Baichoo, Yassine Souilmi, Sumir Panji, Gerrit Botha, Ayton Meintjes, Scott Hazelhurst, Hocine Bendou, Eugene de Beste, Phelelani T. Mpangase, Oussema Souiai, Mustafa Alghali, Long Yi, Brian D. O’Connor, Michael Crusoe, Don Armstrong, Shaun Aron, Fourie Joubert, Azza E. Ahmed, Mamana Mbiyavanga, Peter van Heusden, Lerato E. Magosi, Jennie Zermeno, Liudmila Sergeevna Mainzer, Faisal M. Fadlelmola, C. Victor Jongeneel, and Nicola Mulder
- Subjects
Workflows ,Pipeline ,Bioinformatics ,Africa ,Genomics ,Docker ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background The Pan-African bioinformatics network, H3ABioNet, comprises 27 research institutions in 17 African countries. H3ABioNet is part of the Human Health and Heredity in Africa program (H3Africa), an African-led research consortium funded by the US National Institutes of Health and the UK Wellcome Trust, aimed at using genomics to study and improve the health of Africans. A key role of H3ABioNet is to support H3Africa projects by building bioinformatics infrastructure such as portable and reproducible bioinformatics workflows for use on heterogeneous African computing environments. Processing and analysis of genomic data is an example of a big data application requiring complex interdependent data analysis workflows. Such bioinformatics workflows take the primary and secondary input data through several computationally-intensive processing steps using different software packages, where some of the outputs form inputs for other steps. Implementing scalable, reproducible, portable and easy-to-use workflows is particularly challenging. Results H3ABioNet has built four workflows to support (1) the calling of variants from high-throughput sequencing data; (2) the analysis of microbial populations from 16S rDNA sequence data; (3) genotyping and genome-wide association studies; and (4) single nucleotide polymorphism imputation. A week-long hackathon was organized in August 2016 with participants from six African bioinformatics groups, and US and European collaborators. Two of the workflows are built using the Common Workflow Language framework (CWL) and two using Nextflow. All the workflows are containerized for improved portability and reproducibility using Docker, and are publicly available for use by members of the H3Africa consortium and the international research community. Conclusion The H3ABioNet workflows have been implemented in view of offering ease of use for the end user and high levels of reproducibility and portability, all while following modern state of the art bioinformatics data processing protocols. The H3ABioNet workflows will service the H3Africa consortium projects and are currently in use. All four workflows are also publicly available for research scientists worldwide to use and adapt for their respective needs. The H3ABioNet workflows will help develop bioinformatics capacity and assist genomics research within Africa and serve to increase the scientific output of H3Africa and its Pan-African Bioinformatics Network.
- Published
- 2018
- Full Text
- View/download PDF
7. Whole-genome sequencing for an enhanced understanding of genetic variation among South Africans
- Author
-
Ananyo Choudhury, Michèle Ramsay, Scott Hazelhurst, Shaun Aron, Soraya Bardien, Gerrit Botha, Emile R. Chimusa, Alan Christoffels, Junaid Gamieldien, Mahjoubeh J. Sefid-Dashti, Fourie Joubert, Ayton Meintjes, Nicola Mulder, Raj Ramesar, Jasper Rees, Kathrine Scholtz, Dhriti Sengupta, Himla Soodyall, Philip Venter, Louise Warnich, and Michael S. Pepper
- Subjects
Science - Abstract
African populations show a high level of genetic diversity and extensive regional admixture. Here, the authors sequence the whole genomes of 24 South African individuals of different ethnolinguistic origin and find substantive genomic divergence between two southeastern Bantu-speaking groups.
- Published
- 2017
- Full Text
- View/download PDF
8. SWATH-MS based proteomic profiling of pancreatic ductal adenocarcinoma tumours reveals the interplay between the extracellular matrix and related intracellular pathways.
- Author
-
Ekene Emmanuel Nweke, Previn Naicker, Shaun Aron, Stoyan Stoychev, John Devar, David L Tabb, Jones Omoshoro-Jones, Martin Smith, and Geoffrey Candy
- Subjects
Medicine ,Science - Abstract
Pancreatic cancer accounts for 2.8% of new cancer cases worldwide and is projected to become the second leading cause of cancer-related deaths by 2030. Patients of African ancestry appear to be at an increased risk for pancreatic ductal adenocarcinoma (PDAC), with more severe disease and outcomes. The purpose of this study was to map the proteomic and genomic landscape of a cohort of PDAC patients of African ancestry. Thirty tissues (15 tumours and 15 normal adjacent tissues) were obtained from consenting South African PDAC patients. Optimisation of the sample preparation method allowed for the simultaneous extraction of high-purity protein and DNA for SWATH-MS and OncoArray SNV analyses. We quantified 3402 proteins with 49 upregulated and 35 downregulated proteins at a minimum 2.1 fold change and FDR adjusted p-value (q-value) ≤ 0.01 when comparing tumour to normal adjacent tissue. Many of the upregulated proteins in the tumour samples are involved in extracellular matrix formation (ECM) and related intracellular pathways. In addition, proteins such as EMIL1, KBTB2, and ZCCHV involved in the regulation of ECM proteins were observed to be dysregulated in pancreatic tumours. Downregulation of pathways involved in oxygen and carbon dioxide transport were observed. Genotype data showed missense mutations in some upregulated proteins, such as MYPN, ESTY2 and SERPINB8. Approximately 11% of the dysregulated proteins, including ISLR, BP1, PTK7 and OLFL3, were predicted to be secretory proteins. These findings help in further elucidating the biology of PDAC and may aid in identifying future plausible markers for the disease.
- Published
- 2020
- Full Text
- View/download PDF
9. Organizing and running bioinformatics hackathons within Africa: The H3ABioNet cloud computing experience [version 2; peer review: 2 approved, 1 approved with reservations]
- Author
-
Azza E. Ahmed, Phelelani T. Mpangase, Sumir Panji, Shakuntala Baichoo, Yassine Souilmi, Faisal M. Fadlelmola, Mustafa Alghali, Shaun Aron, Hocine Bendou, Eugene De Beste, Mamana Mbiyavanga, Oussema Souiai, Long Yi, Jennie Zermeno, Don Armstrong, Brian D. O'Connor, Liudmila Sergeevna Mainzer, Michael R. Crusoe, Ayton Meintjes, Peter Van Heusden, Gerrit Botha, Fourie Joubert, C. Victor Jongeneel, Scott Hazelhurst, and Nicola Mulder
- Subjects
Medicine ,Science - Abstract
The need for portable and reproducible genomics analysis pipelines is growing globally as well as in Africa, especially with the growth of collaborative projects like the Human Health and Heredity in Africa Consortium (H3Africa). The Pan-African H3Africa Bioinformatics Network (H3ABioNet) recognized the need for portable, reproducible pipelines adapted to heterogeneous computing environments, and for the nurturing of technical expertise in workflow languages and containerization technologies. Building on the network’s Standard Operating Procedures (SOPs) for common genomic analyses, H3ABioNet arranged its first Cloud Computing and Reproducible Workflows Hackathon in 2016, with the purpose of translating those SOPs into analysis pipelines able to run on heterogeneous computing environments and meeting the needs of H3Africa research projects. This paper describes the preparations for this hackathon and reflects upon the lessons learned about its impact on building the technical and scientific expertise of African researchers. The workflows developed were made publicly available in GitHub repositories and deposited as container images on Quay.io.
- Published
- 2019
- Full Text
- View/download PDF
10. Whole genome sequence of Oscheius sp. TEL-2014 entomopathogenic nematodes isolated from South Africa
- Author
-
Tiisetso E. Lephoto, Phelelani T. Mpangase, Shaun Aron, and Vincent M. Gray
- Subjects
Genetics ,QH426-470 - Abstract
We present the annotation of the draft genome sequence of Oscheius sp. TEL-2014 (Genbank accession number KM492926). This entomopathogenic nematode was isolated from grassland in Suikerbosrand Nature Reserve near Johannesburg in South Africa. Oscheius sp. Strain TEL has a genome size of 110,599,558 bp and a GC content of 42.24%. The genome sequence can be accessed at DDBJ/EMBL/GenBank under the accession number LNBV00000000. Keywords: Oscheius, Entomopathogenic nematodes, Whole-genome sequencing, Genome assembly, Genome annotation
- Published
- 2016
- Full Text
- View/download PDF
11. Designing a course model for distance-based online bioinformatics training in Africa: The H3ABioNet experience.
- Author
-
Kim T Gurwitz, Shaun Aron, Sumir Panji, Suresh Maslamoney, Pedro L Fernandes, David P Judge, Amel Ghouila, Jean-Baka Domelevo Entfellner, Fatma Z Guerfali, Colleen Saunders, Ahmed Mansour Alzohairy, Samson P Salifu, Rehab Ahmed, Ruben Cloete, Jonathan Kayondo, Deogratius Ssemwanga, Nicola Mulder, and H3ABioNet Consortium's Education Training and Working Group as members of the H3Africa Consortium
- Subjects
Biology (General) ,QH301-705.5 - Abstract
Africa is not unique in its need for basic bioinformatics training for individuals from a diverse range of academic backgrounds. However, particular logistical challenges in Africa, most notably access to bioinformatics expertise and internet stability, must be addressed in order to meet this need on the continent. H3ABioNet (www.h3abionet.org), the Pan African Bioinformatics Network for H3Africa, has therefore developed an innovative, free-of-charge "Introduction to Bioinformatics" course, taking these challenges into account as part of its educational efforts to provide on-site training and develop local expertise inside its network. A multiple-delivery-mode learning model was selected for this 3-month course in order to increase access to (mostly) African, expert bioinformatics trainers. The content of the course was developed to include a range of fundamental bioinformatics topics at the introductory level. For the first iteration of the course (2016), classrooms with a total of 364 enrolled participants were hosted at 20 institutions across 10 African countries. To ensure that classroom success did not depend on stable internet, trainers pre-recorded their lectures, and classrooms downloaded and watched these locally during biweekly contact sessions. The trainers were available via video conferencing to take questions during contact sessions, as well as via online "question and discussion" forums outside of contact session time. This learning model, developed for a resource-limited setting, could easily be adapted to other settings.
- Published
- 2017
- Full Text
- View/download PDF
12. Assessing computational genomics skills: Our experience in the H3ABioNet African bioinformatics network.
- Author
-
C Victor Jongeneel, Ovokeraye Achinike-Oduaran, Ezekiel Adebiyi, Marion Adebiyi, Seun Adeyemi, Bola Akanle, Shaun Aron, Efejiro Ashano, Hocine Bendou, Gerrit Botha, Emile Chimusa, Ananyo Choudhury, Ravikiran Donthu, Jenny Drnevich, Oluwadamila Falola, Christopher J Fields, Scott Hazelhurst, Liesl Hendry, Itunuoluwa Isewon, Radhika S Khetani, Judit Kumuthini, Magambo Phillip Kimuda, Lerato Magosi, Liudmila Sergeevna Mainzer, Suresh Maslamoney, Mamana Mbiyavanga, Ayton Meintjes, Danny Mugutso, Phelelani Mpangase, Richard Munthali, Victoria Nembaware, Andrew Ndhlovu, Trust Odia, Adaobi Okafor, Olaleye Oladipo, Sumir Panji, Venesa Pillay, Gloria Rendon, Dhriti Sengupta, and Nicola Mulder
- Subjects
Biology (General) ,QH301-705.5 - Abstract
The H3ABioNet pan-African bioinformatics network, which is funded to support the Human Heredity and Health in Africa (H3Africa) program, has developed node-assessment exercises to gauge the ability of its participating research and service groups to analyze typical genome-wide datasets being generated by H3Africa research groups. We describe a framework for the assessment of computational genomics analysis skills, which includes standard operating procedures, training and test datasets, and a process for administering the exercise. We present the experiences of 3 research groups that have taken the exercise and the impact on their ability to manage complex projects. Finally, we discuss the reasons why many H3ABioNet nodes have declined so far to participate and potential strategies to encourage them to do so.
- Published
- 2017
- Full Text
- View/download PDF
13. Functional characterisation of the transcriptome from leaf tissue of the fluoroacetate-producing plant, Dichapetalum cymosum, in response to mechanical wounding
- Author
-
Robert H. Archer, Karl Rumbold, Shaun Aron, Mihai-Silviu Tomescu, Selisha A. Sooklal, Phelelani T. Mpangase, and Scott Hazelhurst
- Subjects
0106 biological sciences ,0301 basic medicine ,Bioinformatics ,Fluoroacetates ,Science ,Defence mechanisms ,Down-Regulation ,01 natural sciences ,Article ,Transcriptome ,Magnoliopsida ,03 medical and health sciences ,Gene Expression Regulation, Plant ,Transferases ,RNA, Messenger ,KEGG ,Transcriptomics ,Gene ,Plant Proteins ,Regulation of gene expression ,Multidisciplinary ,biology ,Dichapetalum cymosum ,Molecular Sequence Annotation ,biology.organism_classification ,Gene expression profiling ,Biosynthetic Pathways ,Up-Regulation ,Plant Leaves ,Gene Ontology ,030104 developmental biology ,Biochemistry ,Wounding ,Fluoroacetate ,Medicine ,Stress, Mechanical ,Function (biology) ,010606 plant biology & botany - Abstract
Dichapetalum cymosum produces the toxic fluorinated metabolite, fluoroacetate, presumably as a defence mechanism. Given the rarity of fluorinated metabolites in nature, the biosynthetic origin and function of fluoroacetate have been of particular interest. However, the mechanism for fluorination in D. cymosum was never elucidated. More importantly, there is a severe lack in knowledge on a genetic level for fluorometabolite-producing plants, impeding research on the subject. Here, we report on the first transcriptome for D. cymosum and investigate the wound response for insights into fluorometabolite production. Mechanical wounding studies were performed and libraries of the unwounded (control) and wounded (30 and 60 min post wounding) plant were sequenced using the Illumina HiSeq platform. A combined reference assembly generated 77,845 transcripts. Using the SwissProt, TrEMBL, GO, eggNOG, KEGG, Pfam, EC and PlantTFDB databases, a 69% annotation rate was achieved. Differential expression analysis revealed the regulation of 364 genes in response to wounding. The wound responses in D. cymosum included key mechanisms relating to signalling cascades, phytohormone regulation, transcription factors and defence-related secondary metabolites. However, the role of fluoroacetate in inducible wound responses remains unclear. Bacterial fluorinases were searched against the D. cymosum transcriptome but transcripts with homology were not detected suggesting the presence of a potentially different fluorinating enzyme in plants. Nevertheless, the transcriptome produced in this study significantly increases genetic resources available for D. cymosum and will assist with future research into fluorometabolite-producing plants.
- Published
- 2020
14. Guidelines for developing and updating short courses and course programs using the ISCB competency framework
- Author
-
Russell Schwartz, Catherine Brooksbank, Bruno Gaeta, Nicola Mulder, Sarah Morgan, Venkata Satagopam, Verena Ras, Samson Pandam Salifu, Sonika Tyagi, Gaston K. Mazandu, Shaun Aron, Marta Lloret-Llinares, Judit Kumuthini, MIchelle Brazas, Astrid Gall, Benjamin Moore, and Alexandra Holinski
- Subjects
FOS: Computer and information sciences ,Bioinformatics ,Course design ,Competency framework ,Competency profile ,Computational Biology ,Training ,Curriculum design ,Education ,Continuing Professional Development - Abstract
Competency frameworks have proved to be a powerful tool for curriculum development and assessment across many subject domains, and the field of computational biology is no exception. Efforts from the ISCB to develop and successively refine a set of competencies for bioinformatics education and various associated mapping tools have provided a framework for bringing competency-based design principles broadly to education and training of a wide range of professionals in need of some level of mastery of the principles and practice of computational biology. This document seeks to provide some basic guidance for education and training professionals in the field in how to use this framework effectively. It includes a basic background on competency-based education and the history of the ISCB competency framework specifically, leading up to the Version 3 framework considered here. It then follows with some basic principles of applying competency-based education and an illustration of how they apply to different tasks in curriculum development. Appendices and various linked documents provide further elaboration and helpful guidance on the ISCB competencies specifically and some ways in which versions of them have been used already to develop diverse forms of bioinformatics education and training experience. Our target readerships are trainers and educators working in computational biology or more broadly in the molecular life sciences, medicine, and other disciplines that use biomolecular data, including those working in academia, industry and the public sector., These guidelines are available as a google doc at https://docs.google.com/document/d/1hgBEq-C8HvmxxQ2grYiqNXt-agM1LWzW673j6uQHHmI/edit#heading=h.1x0y3vxuz6a7
- Published
- 2021
- Full Text
- View/download PDF
15. Genetic substructure and complex demographic history of South African Bantu speakers
- Author
-
Shane A. Norris, Felistas Mashinya, Gavin Whitelaw, Shaun Aron, F. Xavier Gómez-Olivé, Carina M. Schlebusch, Dhriti Sengupta, Hilde Gunnink, Peter Delius, Ananyo Choudhury, Cesar Fortes-Lima, Marianne Alberts, Natalia Chousou-Polydouri, AWI-Gen Study, Koen Bostoen, Michèle Ramsay, Scott Hazelhurst, and Stephen Tollman
- Subjects
0301 basic medicine ,Male ,Population genetics ,General Physics and Astronomy ,Bantu languages ,Genome-wide association studies ,Gene flow ,Evolutionsbiologi ,South Africa ,0302 clinical medicine ,Gene Frequency ,Ethnicity ,Phylogeny ,media_common ,Language ,education.field_of_study ,Multidisciplinary ,Geography ,Population size ,Genomics ,Science General ,Trait ,Female ,Gene Flow ,Demographic history ,media_common.quotation_subject ,Science ,Population ,Black People ,Genetics and Molecular Biology ,General Biochemistry, Genetics and Molecular Biology ,Article ,Evolutionary genetics ,03 medical and health sciences ,Genetics ,Humans ,Genetik ,education ,Genetic association ,Demography ,Evolutionary Biology ,Chromosomes, Human, Y ,Genetic Variation ,Linguistics ,General Chemistry ,Computational biology and bioinformatics ,030104 developmental biology ,Genetics, Population ,Haplotypes ,General Biochemistry ,030217 neurology & neurosurgery ,Diversity (politics) ,Genome-Wide Association Study - Abstract
South Eastern Bantu-speaking (SEB) groups constitute more than 80% of the population in South Africa. Despite clear linguistic and geographic diversity, the genetic differences between these groups have not been systematically investigated. Based on genome-wide data of over 5000 individuals, representing eight major SEB groups, we provide strong evidence for fine-scale population structure that broadly aligns with geographic distribution and is also congruent with linguistic phylogeny (separation of Nguni, Sotho-Tswana and Tsonga speakers). Although differential Khoe-San admixture plays a key role, the structure persists after Khoe-San ancestry-masking. The timing of admixture, levels of sex-biased gene flow and population size dynamics also highlight differences in the demographic histories of individual groups. The comparisons with five Iron Age farmer genomes further support genetic continuity over ~400 years in certain regions of the country. Simulated trait genome-wide association studies further show that the observed population structure could have major implications for biomedical genomics research in South Africa., Despite linguistic and geographic diversity in South Eastern Bantu-speaking (SEB) groups of South Africa, genetic variation in these groups has not been investigated in depth. Here, the authors analyse genome-wide data from 5056 individuals, providing insights into demographic history across SEB groups.
- Published
- 2021
- Full Text
- View/download PDF
16. Genetic-substructure and complex demographic history of South African Bantu speakers
- Author
-
Shane A. Norris, Gavin Whitelaw, Scott Hazelhurst, Ananyo Choudhury, Michèle Ramsay, Dhriti Sengupta, F Gomez-Olive Casas, Koen Bostoen, Marianne Alberts, Hilde Gunnink, Carina M. Schlebusch, Stephen Tollman, Peter Delius, Shaun Aron, Cesar Fortes-Lima, Natalia Chousou-Polydouri, as members, and Felistas Mashinya
- Subjects
education.field_of_study ,Demographic history ,media_common.quotation_subject ,Population size ,Population ,Bantu languages ,Gene flow ,Geography ,Trait ,education ,Demography ,Diversity (politics) ,media_common ,Genetic association - Abstract
South Eastern Bantu-speaking (SEB) groups constitute more than 80% of the population in South Africa. Despite clear linguistic and geographic diversity, the genetic differences between these groups have not been systematically investigated. Based on genome-wide data of over 5000 individuals, representing eight major SEB groups, we provide strong evidence for fine-scale population structure that broadly aligns with geographic distribution and is also congruent with linguistic phylogeny (separation of Nguni, Sotho-Tswana and Tsonga speakers). Although differential Khoe-San admixture plays a key role, the structure persists after Khoe-San ancestry-masking. The timing of admixture, levels of sex-biased gene flow and population size dynamics also highlight differences in the demographic histories of individual groups. The comparisons with five Iron Age farmer genomes further support genetic continuity over ∼400 years in certain regions of the country. Simulated trait genome-wide association studies further show that the observed population structure could have major implications for biomedical genomics research in South Africa.
- Published
- 2020
- Full Text
- View/download PDF
17. SWATH-MS based proteomic profiling of Pancreatic Ductal Adenocarcinoma tumours reveals the interplay between the extracellular matrix and related intracellular pathways
- Author
-
Martin Smith, David L. Tabb, J. Devar, Ekene Emmanuel Nweke, Stoyan Stoychev, Shaun Aron, Geoffrey P. Candy, Previn Naicker, and Jones Omoshoro-Jones
- Subjects
Male ,0301 basic medicine ,Protein Folding ,Proteome ,Biochemistry ,Mass Spectrometry ,Lung and Intrathoracic Tumors ,Cohort Studies ,Extracellular matrix ,0302 clinical medicine ,Breast Tumors ,Medicine and Health Sciences ,Macromolecular Structure Analysis ,Ethnicities ,Gene Regulatory Networks ,Aged, 80 and over ,Regulation of gene expression ,Extracellular Matrix Proteins ,Multidisciplinary ,Middle Aged ,Prognosis ,Extracellular Matrix ,Gene Expression Regulation, Neoplastic ,Oncology ,030220 oncology & carcinogenesis ,Medicine ,Female ,Cellular Structures and Organelles ,PTK7 ,Intracellular ,Carcinoma, Pancreatic Ductal ,Research Article ,Protein Structure ,Science ,Biology ,Polymorphism, Single Nucleotide ,Pancreatic Cancer ,03 medical and health sciences ,Pancreatic cancer ,Gastrointestinal Tumors ,Breast Cancer ,Biomarkers, Tumor ,medicine ,Humans ,Molecular Biology ,Aged ,African People ,Proteomic Profiling ,Gene Expression Profiling ,Cancers and Neoplasms ,Biology and Life Sciences ,Proteins ,Cancer ,Cell Biology ,medicine.disease ,Fold change ,Pancreatic Neoplasms ,Gene expression profiling ,030104 developmental biology ,Secretory protein ,Case-Control Studies ,People and Places ,Cancer research ,Population Groupings ,Follow-Up Studies - Abstract
Pancreatic cancer accounts for 2.8% of new cancer cases worldwide and is projected to become the second leading cause of cancer-related deaths by 2030. Patients of African ancestry appear to be at an increased risk for pancreatic ductal adenocarcinoma (PDAC), with more severe disease and outcomes. The purpose of this study was to map the proteomic and genomic landscape of a cohort of PDAC patients of African ancestry. Thirty tissues (15 tumours and 15 normal adjacent tissues) were obtained from consenting South African PDAC patients. Optimisation of the sample preparation method allowed for the simultaneous extraction of high-purity protein and DNA for SWATH-MS and OncoArray SNV analyses. We quantified 3402 proteins with 49 upregulated and 35 downregulated proteins at a minimum 2.1 fold change and FDR adjusted p-value (q-value) ≤ 0.01 when comparing tumour to normal adjacent tissue. Many of the upregulated proteins in the tumour samples are involved in extracellular matrix formation (ECM) and related intracellular pathways. In addition, proteins such as EMIL1, KBTB2, and ZCCHV involved in the regulation of ECM proteins were observed to be dysregulated in pancreatic tumours. Downregulation of pathways involved in oxygen and carbon dioxide transport were observed. Genotype data showed missense mutations in some upregulated proteins, such as MYPN, ESTY2 and SERPINB8. Approximately 11% of the dysregulated proteins, including ISLR, BP1, PTK7 and OLFL3, were predicted to be secretory proteins. These findings help in further elucidating the biology of PDAC and may aid in identifying future plausible markers for the disease.
- Published
- 2020
- Full Text
- View/download PDF
18. Whole-genome sequencing for an enhanced understanding of genetic variation among South Africans
- Author
-
Gerrit Botha, Michael S. Pepper, Nicola Mulder, Kathrine Elizabeth Scholtz, Alan Christoffels, Emile R. Chimusa, Raj Ramesar, Ananyo Choudhury, Ayton Meintjes, Michèle Ramsay, Fourie Joubert, Scott Hazelhurst, Louise Warnich, Soraya Bardien, Shaun Aron, Jasper Rees, Mahjoubeh J. Sefid-Dashti, Dhriti Sengupta, Philip Venter, Junaid Gamieldien, and Himla Soodyall
- Subjects
0301 basic medicine ,Male ,Science ,DNA Mutational Analysis ,General Physics and Astronomy ,Black People ,Pilot Projects ,Human genetic variation ,Computational biology ,030105 genetics & heredity ,Biology ,Genome ,General Biochemistry, Genetics and Molecular Biology ,DNA sequencing ,Article ,03 medical and health sciences ,South Africa ,Genetic variation ,Humans ,Genetic Predisposition to Disease ,lcsh:Science ,Whole genome sequencing ,Principal Component Analysis ,Multidisciplinary ,Data curation ,Genome, Human ,Genetic Variation ,General Chemistry ,Healthy Volunteers ,030104 developmental biology ,Functional annotation ,Mutation ,lcsh:Q ,Human genome - Abstract
The Southern African Human Genome Programme is a national initiative that aspires to unlock the unique genetic character of southern African populations for a better understanding of human genetic diversity. In this pilot study the Southern African Human Genome Programme characterizes the genomes of 24 individuals (8 Coloured and 16 black southeastern Bantu-speakers) using deep whole-genome sequencing. A total of ~16 million unique variants are identified. Despite the shallow time depth since divergence between the two main southeastern Bantu-speaking groups (Nguni and Sotho-Tswana), principal component analysis and structure analysis reveal significant (p, African populations show a high level of genetic diversity and extensive regional admixture. Here, the authors sequence the whole genomes of 24 South African individuals of different ethnolinguistic origin and find substantive genomic divergence between two southeastern Bantu-speaking groups.
- Published
- 2017
19. Author Correction: High-depth African genomes inform human migration and health
- Author
-
Trust Odia, Judit Kumuthini, Shaun Aron, Yasmina Jaufeerally Fakim, Donna M. Muzny, Anisah W. Ghoorah, Charles N. Rotimi, Oscar A. Nyangiri, Gerrit Botha, Sally N. Adebamowo, Neil A. Hanchard, Alia Benkahla, Emile R. Chimusa, Laura R. Botigué, Oluwadamilare Falola, Ananyo Choudhury, Nicola Mulder, Samar K. Kassim, Eileen Dareng, Scott Hazelhurst, Mamana Mbiyavanga, Zané Lombard, Gaston K. Mazandu, Ezekiel Adebiyi, Richard A. Gibbs, Ginger A. Metcalf, Taoufik Bensellak, Daniel Shriner, Michèle Ramsay, Adebowale Adeyemo, Gordon Wells, and Dhriti Sengupta
- Subjects
Multidisciplinary ,Human evolutionary genetics ,Human migration ,business.industry ,Published Erratum ,MEDLINE ,Computational biology ,Biology ,Genome ,Evolutionary genetics ,DNA sequencing ,Genetics research ,Genetic variation ,Next-generation sequencing ,Author Correction ,business - Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41586-021-03286-9.
- Published
- 2021
- Full Text
- View/download PDF
20. African genetic diversity provides novel insights into evolutionary history and local adaptations
- Author
-
Shaun Aron, Dhriti Sengupta, Scott Hazelhurst, Ananyo Choudhury, and Michèle Ramsay
- Subjects
0301 basic medicine ,Demographic history ,Population ,Adaptation, Biological ,Black People ,Present day ,Biology ,Genome ,Polymorphism, Single Nucleotide ,Evolution, Molecular ,03 medical and health sciences ,Genetic variation ,Ethnicity ,Genetics ,Humans ,Invited Reviews ,education ,Molecular Biology ,Genetics (clinical) ,Whole genome sequencing ,education.field_of_study ,Genetic diversity ,Whole Genome Sequencing ,Population size ,Genetic Variation ,General Medicine ,Genomics ,Biological Evolution ,030104 developmental biology ,Genetics, Population ,Genetic Techniques ,Haplotypes ,Evolutionary biology ,Africa - Abstract
Genetic variation and susceptibility to disease are shaped by human demographic history and adaptation. We can now study the genomes of extant Africans and uncover traces of population migration, admixture, assimilation and selection by applying sophisticated computational algorithms. There are four major ethnolinguistic divisions among present day Africans: Hunter-gatherer populations in southern and central Africa; Nilo-Saharan speakers from north and northeast Africa; Afro-Asiatic speakers from north and east Africa; and Niger-Congo speakers who are the predominant ethnolinguistic group spread across most of sub-Saharan Africa. The enormous ethnolinguistic diversity in sub-Saharan African populations is largely paralleled by extensive genetic diversity and until a decade ago, little was known about detailed origins and divergence of these groups. Results from large-scale population genetic studies, and more recently whole genome sequence data, are unravelling the critical role of events like migration and admixture and environmental factors including diet, infectious diseases and climatic conditions in shaping current population diversity. It is now possible to start providing quantitative estimates of divergence times, population size and dynamic processes that have affected populations and their genetic risk for disease. Finally, the availability of ancient genomes from Africa provides historical insights of unprecedented depth. In this review, we highlight some key interpretations that have emerged from recent African genome studies.
- Published
- 2018
21. Designing a course model for distance-based online bioinformatics training in Africa: The H3ABioNet experience
- Author
-
Shaun Aron, Rehab Ahmed, Colleen J. Saunders, Kim T. Gurwitz, Amel Ghouila, Jean-Baka Domelevo Entfellner, Jonathan K. Kayondo, Fatma Z. Guerfali, Ruben Cloete, Suresh Maslamoney, Deogratius Ssemwanga, H ABioNet Consortium's Education Training, Nicola Mulder, Samson Pandam Salifu, Sumir Panji, David P. Judge, Pedro Fernandes, Ahmed M. Alzohairy, University of Cape Town, University of the Witwatersrand [Johannesburg] (WITS), Instituto Gulbenkian de Ciência [Oeiras] (IGC), Fundação Calouste Gulbenkian, Independent bioinformatics training specialist, Cambridge, Laboratoire de Transmission, Contrôle et Immunobiologie des Infections - Laboratory of Transmission, Control and Immunobiology of Infection (LR11IPT02), Institut Pasteur de Tunis, Réseau International des Instituts Pasteur (RIIP)-Réseau International des Instituts Pasteur (RIIP), South African National Bioinformatics Institute (SANBI), University of the Western Cape, Zagazig University, Kumasi Centre for Collaborative Research in Tropical Medicine (KCCR), Future Univ Sudan, University of Khartoum, Uganda Virus Research Institute, Research reported in this publication was supported by National Human Genome Research Institute (NHGRI) and the Office of the Director (OD),National Institutes of Health under award number U41HG006941, and H3ABioNet Consortium’s Education Training and Working Group as members of the H3Africa Consortium
- Subjects
0301 basic medicine ,Science and Technology Workforce ,Databases, Factual ,Computer science ,[SDV]Life Sciences [q-bio] ,Social Sciences ,MESH: Africa ,Bioinformatics ,computer.software_genre ,Careers in Research ,Session (web analytics) ,Database and Informatics Methods ,User-Computer Interface ,0302 clinical medicine ,Videoconferencing ,Learning and Memory ,Open Science ,Sociology ,ComputingMilieux_COMPUTERSANDEDUCATION ,Psychology ,Computer Networks ,Biology (General) ,Ecology ,4. Education ,Software Engineering ,Professions ,MESH: Internet ,Computational Theory and Mathematics ,Order (business) ,Modeling and Simulation ,Lectures ,Engineering and Technology ,The Internet ,Workshops ,Open Source Software ,MESH: Computational Biology ,Computer and Information Sciences ,Science Policy ,QH301-705.5 ,Stability (learning theory) ,Computer-Assisted Instruction ,Research and Analysis Methods ,Training (civil) ,Course (navigation) ,Education ,World Wide Web ,Computer Software ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,Human Learning ,Genetics ,Learning ,Humans ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,MESH: User-Computer Interface ,MESH: Computer-Assisted Instruction ,Internet ,MESH: Humans ,business.industry ,Software Tools ,Cognitive Psychology ,Biology and Life Sciences ,Computational Biology ,MESH: Databases, Factual ,030104 developmental biology ,People and Places ,Africa ,Cognitive Science ,Scientists ,Population Groupings ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,business ,computer ,030217 neurology & neurosurgery ,Neuroscience - Abstract
This publication hasn't any creative commons license associated. This deposit is composed by the main article plus the supplementary materials of the publication. This deposit is composed by a publication in which the IGC's authors have had the role of collaboration (it's a collaboration publication). This type of deposit in ARCA is in restrictedAccess (it can't be in open access to the public), and can only be accessed by two ways: either by requesting a legal copy from the author (the email contact present in this deposit) or by visiting the following link: http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005715#sec022 Africa is not unique in its need for basic bioinformatics training for individuals from a diverse range of academic backgrounds. However, particular logistical challenges in Africa, most notably access to bioinformatics expertise and internet stability, must be addressed in order to meet this need on the continent. H3ABioNet (www.h3abionet.org), the Pan African Bioinformatics Network for H3Africa, has therefore developed an innovative, free-of-charge "Introduction to Bioinformatics" course, taking these challenges into account as part of its educational efforts to provide on-site training and develop local expertise inside its network. A multiple-delivery-mode learning model was selected for this 3-month course in order to increase access to (mostly) African, expert bioinformatics trainers. The content of the course was developed to include a range of fundamental bioinformatics topics at the introductory level. For the first iteration of the course (2016), classrooms with a total of 364 enrolled participants were hosted at 20 institutions across 10 African countries. To ensure that classroom success did not depend on stable internet, trainers pre-recorded their lectures, and classrooms downloaded and watched these locally during biweekly contact sessions. The trainers were available via video conferencing to take questions during contact sessions, as well as via online "question and discussion" forums outside of contact session time. This learning model, developed for a resource-limited setting, could easily be adapted to other settings. National Human Genome Research Institute; Office of the Director; National Institutes of Health grant: (U41HG006941). info:eu-repo/semantics/publishedVersion
- Published
- 2017
- Full Text
- View/download PDF
22. Assessing computational genomics skills: Our experience in the H3ABioNet African bioinformatics network
- Author
-
Shaun Aron, Mamana Mbiyavanga, Lerato E Magosi, Efejiro Ashano, Christopher J. Fields, C. Victor Jongeneel, Danny Mugutso, Phelelani T. Mpangase, Sumir Panji, Venesa Pillay, Seun Adeyemi, Adaobi Okafor, Oluwadamila Falola, Hocine Bendou, Ananyo Choudhury, Olaleye Oladipo, Ezekiel Adebiyi, Radhika S. Khetani, Ovokeraye Achinike-Oduaran, Bola Akanle, Richard J. Munthali, Suresh Maslamoney, Ayton Meintjes, Gloria Rendon, Nicola Mulder, Trust Odia, Andrew Ndhlovu, Ravikiran Donthu, Itunuoluwa Isewon, Liesl M. Hendry, Emile R. Chimusa, Jenny Drnevich, Judit Kumuthini, Magambo Phillip Kimuda, Scott Hazelhurst, Liudmila Sergeevna Mainzer, Marion O. Adebiyi, Victoria Nembaware, Dhriti Sengupta, and Gerrit Botha
- Subjects
0301 basic medicine ,Service (systems architecture) ,Computer science ,Data management ,Social Sciences ,Bioinformatics ,Database and Informatics Methods ,South Africa ,Sociology ,Databases, Genetic ,Medicine and Health Sciences ,Public and Occupational Health ,Biology (General) ,Ecology ,Health services research ,Genomics ,Research Assessment ,Sports Science ,3. Good health ,Test (assessment) ,Professions ,Computational Theory and Mathematics ,Modeling and Simulation ,Workshops ,Health Services Research ,QH301-705.5 ,Process (engineering) ,Developing country ,Black People ,Nigeria ,Research and Analysis Methods ,Education ,03 medical and health sciences ,Cellular and Molecular Neuroscience ,Genome-Wide Association Studies ,Genetics ,Humans ,Sports and Exercise Medicine ,Molecular Biology ,Exercise ,Developing Countries ,Ecology, Evolution, Behavior and Systematics ,business.industry ,Computational genomics ,Biology and Life Sciences ,Computational Biology ,Human Genetics ,Physical Activity ,Genome Analysis ,Data science ,Health Care ,030104 developmental biology ,Physical Fitness ,People and Places ,Scientists ,Database Management Systems ,Population Groupings ,business - Abstract
The H3ABioNet pan-African bioinformatics network, which is funded to support the Human Heredity and Health in Africa (H3Africa) program, has developed node-assessment exercises to gauge the ability of its participating research and service groups to analyze typical genome-wide datasets being generated by H3Africa research groups. We describe a framework for the assessment of computational genomics analysis skills, which includes standard operating procedures, training and test datasets, and a process for administering the exercise. We present the experiences of 3 research groups that have taken the exercise and the impact on their ability to manage complex projects. Finally, we discuss the reasons why many H3ABioNet nodes have declined so far to participate and potential strategies to encourage them to do so., Author summary Many programs have been developed to boost the technical and computational skills of scientists working in low to medium income countries (LMIC), who often struggle to remain competitive with their peers in more developed parts of the world. Typically, these programs rely on intensive workshops where students acquire and exercise these skills under the supervision of experienced trainers. However, when trainees return to their home institutions, even after extensive exposure to state of the art techniques, they often find it difficult to put the skills they have acquired into practice and to establish themselves as fully independent practitioners. We have attempted to build a framework through which teams of scientists in African research groups can demonstrate that they have acquired the necessary skills to analyze different types of genomic datasets. Three teams of scientists who have successfully submitted to this assessment exercise report their positive experiences. Many potential participants have so far declined the opportunity, and we discuss the reasons for their reluctance as well as possible ways to facilitate their engagement and provide them with incentives. We argue that assessments such as this could be part of any program aiming to develop technical skills in scientists wishing to support genomic research programs.
- Published
- 2017
23. Organizing and running bioinformatics hackathons within Africa: The H3ABioNet cloud computing experience
- Author
-
Michael R. Crusoe, Liudmila Sergeevna Mainzer, Eugene de Beste, Ayton Meintjes, Gerrit Botha, Azza Ahmed, Nicola Mulder, Fourie Joubert, Shaun Aron, Don Armstrong, Sumir Panji, Shakuntala Baichoo, Hocine Bendou, Scott Hazelhurst, C. Victor Jongeneel, Peter van Heusden, Faisal M. Fadlelmola, Mamana Mbiyavanga, Phelelani T. Mpangase, Oussema Souiai, Brian O'Connor, Yassine Souilmi, Long Yi, Mustafa Alghali, and Jennie Zermeno
- Subjects
0301 basic medicine ,Bioinformatics ,workflow ,business.industry ,capacity building ,Applied Mathematics ,pipeline ,Capacity building ,Symmetric multiprocessor system ,Cloud computing ,Articles ,Method Article ,Pipeline (software) ,03 medical and health sciences ,030104 developmental biology ,0302 clinical medicine ,Workflow ,Open research ,reproducible ,hackathon ,Container (abstract data type) ,Applied research ,business ,030217 neurology & neurosurgery - Abstract
The need for portable and reproducible genomics analysis pipelines is growing globally as well as in Africa, especially with the growth of collaborative projects like the Human Health and Heredity in Africa Consortium (H3Africa). The Pan-African H3Africa Bioinformatics Network (H3ABioNet) recognized the need for portable, reproducible pipelines adapted to heterogeneous computing environments, and for the nurturing of technical expertise in workflow languages and containerization technologies. Building on the network’s Standard Operating Procedures (SOPs) for common genomic analyses, H3ABioNet arranged its first Cloud Computing and Reproducible Workflows Hackathon in 2016, with the purpose of translating those SOPs into analysis pipelines able to run on heterogeneous computing environments and meeting the needs of H3Africa research projects. This paper describes the preparations for this hackathon and reflects upon the lessons learned about its impact on building the technical and scientific expertise of African researchers. The workflows developed were made publicly available in GitHub repositories and deposited as container images on Quay.io.
- Published
- 2019
- Full Text
- View/download PDF
24. H3ABioNet, a sustainable pan-African bioinformatics network for human heredity and health in Africa
- Author
-
Samson Pandam Salifu, Radhika Khetani, Jelili Oyelade, Anmol Kiran, Cornelis Victor Jongeneel, Raphael Zozimus Sangeda, Kais Ghedira, Faisal M. Fadlelmola, Ayton Pierre Meintjes, Jen Cornick, Daniel Masiga, Khalid SADKI, Shakuntala Baichoo, Samar Kamal Kassim, Scott Hazelhurst, Azeddine Ibrahimi, Ozlem Tastan Bishop, Judit Kumuthini, Arox Wadson Kamng'ona, Rehab Ahmed, Nicola J Mulder, Dean Everett, Ahmed Moussa, Julie Makani, Chimusa Emile Rugamika, Jean-Baka Domelevo Entfellner, Phelelani Mpangase, Marion Adebiyi, Mohamed Alibi, Peter Van Heusden, Winston Hide, Victor Osamor, Hugh-George Patterton, Christopher Fields, Benjamin Kumwenda, Itunuoluwa Isewon, Souiai Oussama, Niklas Blomberg, Bruno Mmbando, Benard Kulohoma, Nicki Tiffin, Zahra Mungloo-Dilmohamud, Shaun Aron, Patrick Musicha, Stochastic Studies and Statistics, University of Cape Town, Department of Computer and Information Sciences, Covenant University, Centre National de Transfusion Sanguine, Rabat, Institut Pasteur de Tunis, Réseau International des Instituts Pasteur (RIIP), Noguchi Memorial Institute for Medical Research [Accra, Ghana] (NMIMR), University of Ghana, University of Sciences, Techniques and Technology of Bamako, University of Liverpool, University of Khartoum, Institut National de Recherche Agronomique, Rabat, Botswana Harvard AIDS Institute Partnership, Université Mohammed Premier [Oujda], University of the Witwatersrand [Johannesburg] (WITS), University of Sheffield, Sheffield Institute for Translational Neuroscience, Department of Biotechnology Laboratory (Med-Biotech), Mohammed V University in Rabat, University of Mauritius, University of Illinois at Urbana-Champaign [Urbana], University of Illinois System, Université Grenoble Alpes - UFR Médecine (UGA UFRM), Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), Medical Biochemistry and Molecular Biology Department, Faculty of Medicine, Ain Shams University, Cairo, Egypt, Uganda Virus Research Institute, Entebbe, Uganda, Centre for Proteomic and Genomic Research, Cape Town, South Africa, University of Dar es Salaam, Dar es Salaam, Tanzania, Muhimbili University of Health and Allied Sciences, Zagazig University, International Centre of Insect Physiology and Ecology, Nairobi, Kenya, Institut de Chimie de Strasbourg, Université de Strasbourg (UNISTRA)-Institut de Chimie du CNRS (INC)-Centre National de la Recherche Scientifique (CNRS), National Biotechnology Development Agency, Abuja, Nigeria, Centre de Recherche Médicale et Sanitaire (Niamey, Niger) (CERMES), Kumasi Centre for Collaborative Research in Tropical Medicine (KCCR), Department of Biomedical Sciences, University of Cape Town, Faculty of Health Sciences, University of the Free State [South Africa], Institut Pasteur du Maroc, Faculty of Sciences of Rabat, University Mohammed V of Rabat, Rabat, Morocco, Institut National d'Hygiène, Rabat, Morocco, Rhodes University, Grahamstown, University of the Western Cape, Cape Town, Management and Development for Health, Dar es Salaam, Tanzania, and Musicha, P
- Subjects
Resource ,0301 basic medicine ,Genetics, Medical ,Genomic research ,[SDV]Life Sciences [q-bio] ,Black People ,Genomics ,Health Promotion ,Biology ,MESH: Africa ,SUSCEPTIBILITY ,ANCESTRY ,Bioinformatics ,TUBERCULOSIS ,DISEASE ,03 medical and health sciences ,Human health ,Computer Systems ,Genetics ,Humans ,MESH: Genetics, Medical ,MESH: Genetic Variation ,GENOME-WIDE ASSOCIATION ,Human heredity ,Genetics (clinical) ,2. Zero hunger ,MESH: Humans ,Pan african ,MESH: Genomics ,1. No poverty ,MESH: Computer Systems ,Computational Biology ,Genetic Variation ,Popularity ,Human genetics ,3. Good health ,030104 developmental biology ,Health promotion ,Africa ,MESH: Health Promotion ,MESH: African Continental Ancestry Group ,MESH: Computational Biology - Abstract
International audience; The application of genomics technologies to medicine and biomedical research is increasing in popularity, made possible by new high-throughput genotyping and sequencing technologies and improved data analysis capabilities. Some of the greatest genetic diversity among humans, animals, plants, and microbiota occurs in Africa, yet genomic research outputs from the continent are limited. The Human Heredity and Health in Africa (H3Africa) initiative was established to drive the development of genomic research for human health in Africa, and through recognition of the critical role of bioinformatics in this process, spurred the establishment of H3ABioNet, a pan-African bioinformatics network for H3Africa. The limitations in bioinformatics capacity on the continent have been a major contributory factor to the lack of notable outputs in high-throughput biology research. Although pockets of high-quality bioinformatics teams have existed previously, the majority of research institutions lack experienced faculty who can train and supervise bioinformatics students. H3ABioNet aims to address this dire need, specifically in the area of human genetics and genomics, but knock-on effects are ensuring this extends to other areas of bioinformatics. Here, we describe the emergence of genomics research and the development of bioinformatics in Africa through H3ABioNet.
- Published
- 2016
- Full Text
- View/download PDF
25. Development of Bioinformatics Infrastructure for Genomics Research
- Author
-
Nicola J. Mulder, Ezekiel Adebiyi, Marion Adebiyi, Seun Adeyemi, Azza Ahmed, Rehab Ahmed, Bola Akanle, Mohamed Alibi, Don L. Armstrong, Shaun Aron, Efejiro Ashano, Shakuntala Baichoo, Alia Benkahla, David K. Brown, Emile R. Chimusa, Faisal M. Fadlelmola, Dare Falola, Segun Fatumo, Kais Ghedira, Amel Ghouila, Scott Hazelhurst, Itunuoluwa Isewon, Segun Jung, Samar Kamal Kassim, Jonathan K. Kayondo, Mamana Mbiyavanga, Ayton Meintjes, Somia Mohammed, Abayomi Mosaku, Ahmed Moussa, Mustafa Muhammd, Zahra Mungloo-Dilmohamud, Oyekanmi Nashiru, Trust Odia, Adaobi Okafor, Olaleye Oladipo, Victor Osamor, Jellili Oyelade, Khalid Sadki, Samson Pandam Salifu, Jumoke Soyemi, Sumir Panji, Fouzia Radouani, Oussama Souiai, Özlem Tastan Bishop, The HABioNet Consortium, as Members of the HAfrica Consortium, University of Cape Town, Department of Computer and Information Sciences, Covenant University, Covenant University Bioinformatics Research (CUBRe), University of Khartoum, Laboratoire de Bioinformatique, biomathématiques, biostatistiques (BIMS) (LR11IPT09), Institut Pasteur de Tunis, Réseau International des Instituts Pasteur (RIIP)-Réseau International des Instituts Pasteur (RIIP)-Université de Tunis El Manar (UTM), University of Illinois at Urbana-Champaign [Urbana], University of Illinois System, University of the Witwatersrand [Johannesburg] (WITS), Federal Ministry of Science and Technology [Abuja] (FMST), University of Mauritius, Rhodes University, Grahamstown, Institute of Infectious Diseases and Molecular Medicine (IDM), Future University of Sudan, Laboratoire de Transmission, Contrôle et Immunobiologie des Infections - Laboratory of Transmission, Control and Immunobiology of Infection (LR11IPT02), Réseau International des Instituts Pasteur (RIIP)-Réseau International des Instituts Pasteur (RIIP), Computation Institute [Chicago], University of Chicago, Université Ain Shams, Uganda Virus Research Institute (UVRI), Laboratoire des Technologies de l'Information et de la Communication [Tanger] (Labtic), Ecole Nationale des Sciences Appliquées [Tanger] (ENSAT), Landmark University [Omu-Aran], Université Mohammed V, Kwame Nkrumah University of Science and Technology [GHANA] (KNUST), École polytechnique fédérale d'Ilaro, Institut Pasteur du Maroc, Réseau International des Instituts Pasteur (RIIP), and H3ABioNet is supported by the National Institutes of Health Common Fund (grant number U41HG006941)
- Subjects
0301 basic medicine ,MESH: Genomics/methods ,Epidemiology ,Computer science ,[SDV]Life Sciences [q-bio] ,media_common.quotation_subject ,Genomics ,MESH: Africa ,Bioinformatics ,Data type ,03 medical and health sciences ,0302 clinical medicine ,Excellence ,Controlled vocabulary ,media_common ,MESH: Computational Biology/trends ,Community and Home Care ,Spatial data infrastructure ,MESH: Humans ,Data collection ,MESH: Biomedical Research/methods ,Data science ,Metadata ,030104 developmental biology ,Workflow ,Cardiology and Cardiovascular Medicine ,030217 neurology & neurosurgery - Abstract
Background: Although pockets of bioinformatics excellence have developed in Africa, generally, large-scale genomic data analysis has been limited by the availability of expertise and infrastructure. H3ABioNet, a pan-African bioinformatics network, was established to build capacity specifically to enable H3Africa (Human Heredity and Health in Africa) researchers to analyze their data in Africa. Since the inception of the H3Africa initiative, H3ABioNet’s role has evolved in response to changing needs from the consortium and the African bioinformatics community.Objectives: H3ABioNet set out to develop core bioinformatics infrastructure and capacity for genomics research in various aspects of data collection, transfer, storage, and analysis.Methods and Results: Various resources have been developed to address genomic data management and analysis needs of H3Africa researchers and other scientific communities on the continent. NetMap was developed and used to build an accurate picture of network performance within Africa and between Africa and the rest of the world, and Globus Online has been rolled out to facilitate data transfer. A participant recruitment database was developed to monitor participant enrollment, and data is being harmonized through the use of ontologies and controlled vocabularies. The standardized metadata will be integrated to provide a search facility for H3Africa data and biospecimens. Because H3Africa projects are generating large-scale genomic data, facilities for analysis and interpretation are critical. H3ABioNet is implementing several data analysis platforms that provide a large range of bioinformatics tools or workflows, such as Galaxy, the Job Management System, and eBiokits. A set of reproducible, portable, and cloud-scalable pipelines to support the multiple H3Africa data types are also being developed and dockerized to enable execution on multiple computing infrastructures. In addition, new tools have been developed for analysis of the uniquely divergent African data and for downstream interpretation of prioritized variants. To provide support for these and other bioinformatics queries, an online bioinformatics helpdesk backed by broad consortium expertise has been established. Further support is provided by means of various modes of bioinformatics training.Conclusions: For the past 4 years, the development of infrastructure support and human capacity through H3ABioNet, have significantly contributed to the establishment of African scientific networks, data analysis facilities, and training programs. Here, we describe the infrastructure and how it has affected genomics and bioinformatics research in Africa.HighlightsH3ABioNet is building capacity to enable analysis of genomic data in Africa.Infrastructure has been built for clinical and genomic data storage, management, and analysis.New algorithms and pipelines for African genomic data analysis have been developed.Data are being harmonized using ontologies to enable easy search and retrieval.Genomics training is implemented using various online and face-to-face approaches.
- Published
- 2017
- Full Text
- View/download PDF
26. H3ABioNet: Developing Sustainable Bioinformatics Capacity in Africa
- Author
-
Kim T. Gurwitz, Nicola Mulder, H ABioNet Educ., Shaun Aron, and Sumir Panji
- Subjects
0301 basic medicine ,Engineering ,business.industry ,media_common.quotation_subject ,Bioinformatics ,Training (civil) ,03 medical and health sciences ,030104 developmental biology ,Component (UML) ,Quality (business) ,Human heredity ,business ,Training program ,media_common - Abstract
The diverse, rapidly developing discipline of bioinformatics has become an integral component of all areas of biomedical research in recent years. The ability to manage, analyse and interpret large complex datasets hinges on having access to skilled researchers who have received adequate training in specialised technical areas. Currently in Africa, individuals with these specialised skills are sparsely scattered across the continent and there is an urgent need to develop sustainable training programs to improve bioinformatics capacity. As part of the Human Heredity and Health in Africa (H3Africa) initiative, a pan-African bioinformatics network for H3Africa (H3ABioNet) was established to develop bioinformatics infrastructure together with a sustainable high quality training program. Here, we discuss the various training events and approaches developed by H3ABioNet to address the lack of bioinformatics capacity in Africa, together with the valuable lessons learnt.
- Published
- 2017
- Full Text
- View/download PDF
27. Population-specific common SNPs reflect demographic histories and highlight regions of genomic plasticity with functional relevance
- Author
-
Michèle Ramsay, Nicki Tiffin, Ananyo Choudhury, Ayton Meintjes, Shaun Aron, Scott Hazelhurst, Ovokeraye Achinike-Oduaran, Mahjoubeh Jalali Sefid Dashti, Junaid Gamieldien, Nicola Mulder, Department of Clinical Laboratory Sciences, and Faculty of Health Sciences
- Subjects
Population ,Single-nucleotide polymorphism ,Biology ,Polymorphism, Single Nucleotide ,Genome ,Evolution, Molecular ,Genetic variation ,Genetics ,Humans ,Selection, Genetic ,Allele ,1000 Genomes Project ,education ,Gene ,Alleles ,Recombination, Genetic ,education.field_of_study ,Genome, Human ,Racial Groups ,Computational Biology ,Genomics ,Genetics, Population ,Human genome ,Databases, Nucleic Acid ,Research Article ,Biotechnology - Abstract
Background Population differentiation is the result of demographic and evolutionary forces. Whole genome datasets from the 1000 Genomes Project (October 2012) provide an unbiased view of genetic variation across populations from Europe, Asia, Africa and the Americas. Common population-specific SNPs (MAF > 0.05) reflect a deep history and may have important consequences for health and wellbeing. Their interpretation is contextualised by currently available genome data. Results The identification of common population-specific (CPS) variants (SNPs and SSV) is influenced by admixture and the sample size under investigation. Nine of the populations in the 1000 Genomes Project (2 African, 2 Asian (including a merged Chinese group) and 5 European) revealed that the African populations (LWK and YRI), followed by the Japanese (JPT) have the highest number of CPS SNPs, in concordance with their histories and given the populations studied. Using two methods, sliding 50-SNP and 5-kb windows, the CPS SNPs showed distinct clustering across large genome segments and little overlap of clusters between populations. iHS enrichment score and the population branch statistic (PBS) analyses suggest that selective sweeps are unlikely to account for the clustering and population specificity. Of interest is the association of clusters close to recombination hotspots. Functional analysis of genes associated with the CPS SNPs revealed over-representation of genes in pathways associated with neuronal development, including axonal guidance signalling and CREB signalling in neurones. Conclusions Common population-specific SNPs are non-randomly distributed throughout the genome and are significantly associated with recombination hotspots. Since the variant alleles of most CPS SNPs are the derived allele, they likely arose in the specific population after a split from a common ancestor. Their proximity to genes involved in specific pathways, including neuronal development, suggests evolutionary plasticity of selected genomic regions. Contrary to expectation, selective sweeps did not play a large role in the persistence of population-specific variation. This suggests a stochastic process towards population-specific variation which reflects demographic histories and may have some interesting implications for health and susceptibility to disease.
- Published
- 2014
- Full Text
- View/download PDF
28. The elusive gene for keratolytic winter erythema
- Author
-
Shaun Aron, Angela Hobbs, Peter R. Hull, and Michèle Ramsay
- Subjects
Genetics ,Candidate gene ,DNA Copy Number Variations ,business.industry ,Computational Biology ,Skin Diseases, Genetic ,General Medicine ,Keratosis ,medicine.disease ,Phenotype ,Parakeratosis ,South Africa ,Genetic linkage ,Erythema ,Medicine ,Microsatellite ,Humans ,Copy-number variation ,Keratolytic winter erythema ,business ,Gene ,Genetic Association Studies ,Founder effect - Abstract
Keratolytic winter erythema (KWE), also known as Oudtshoorn skin disease, is characterised by a cyclical disruption of normal epidermal keratinisation affecting primarily the palmoplantar skin with peeling of the palms and soles, which is worse in the winter. It is a rare monogenic, autosomal dominant condition of unknown cause. However, due to a founder effect, it occurs at a prevalence of 1/7 200 among South African Afrikaans-speakers. In the mid-1980s, samples were collected from affected families for a linkage study to pinpoint the location of the KWE gene. A genome-wide linkage analysis, using microsatellite markers, identified the KWE critical region on chromosome 8p23.1-p22. Subsequent genetic studies focused on screening candidate genes in this critical region; however, no pathogenic mutations that segregated exclusively with KWE were identified. The cathepsin B (CTSB) and farnesyl-diphosphate farnesyltransferase 1 (FDFT1) genes revealed no potentially pathogenic variants, nor did they show differential gene expression in affected skin. Mutation detection in additional candidate genes also failed to identify the KWE-associated variant, suggesting that the causal variant may be in an uncharacterised functional region. Bioinformatic analysis revealed highly conserved regions within the KWE critical region and a custom tiling array was designed to cover this region and to search for copy number variation. Although the study did not identify a variant that segregates exclusively with KWE, it provided valuable insight into the complex KWE-linked region. Next-generation sequencing approaches are being used to comb the region, but the causal variant for this interesting hyperkeratotic palmoplantar phenotype still remains elusive.
- Published
- 2013
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.