46 results on '"Bordin N"'
Search Results
2. Detection and enumeration of Lak megaphages in microbiome samples by endpoint and quantitative PCR.
- Author
-
Crisci, MA, Corsini, PM, Bordin, N, Chen, L-X, Banfield, JF, Santini, JM, Crisci, MA, Corsini, PM, Bordin, N, Chen, L-X, Banfield, JF, and Santini, JM
- Abstract
Lak megaphages are prevalent across diverse gut microbiomes and may potentially impact animal and human health through lysis of Prevotella. Given their large genome size (up to 660 kbp), Lak megaphages are difficult to culture, and their identification relies on molecular techniques. Here, we present optimized protocols for identifying Lak phages in various microbiome samples, including procedures for DNA extraction, followed by detection and quantification of genes encoding Lak structural proteins using diagnostic endpoint and SYBR green-based quantitative PCR, respectively. For complete details on the use and execution of this protocol, please refer to Crisci et al., (2021).
- Published
- 2022
3. Empirical first-line treatment use and effectiveness trends in Europe in the period 2013- 2020: results from the European registry on H. pylori management (HP-EUREG)
- Author
-
O. P. Nyssen , Á. Pérez-Aísa , D. Vaira , L. Jonaitis , B. Tepes , A. Keco-Huerga , M. Castro- Fernández , A. Lucendo , D. Bordin , N. Brglez Jurecic , L. Vologzhanina, M. Caldas , G. Fadieienko, R. Abdulkhakov, L. Bujanda, M. Leja, M. Romano, S. Georgopoulos, Ante Tonkić, H. Simsek, A. Gasbarrini, G. M Buzas, P. Phull, M. Venerito, P. Malfertheiner, J. Kupčinskas , G. Babayeva, O. Shvets, F. Lerang, R. Marcos Pinto, T. Rokkas, I. Simsek, S. Smith, Y. Niv, D. Lamarque, F. Heluwaert, A. Goldis, W. Marlicz, V. Milivojevic, L. Boyanova, L. Kunovský, V. Lamy, C. Beglinger, P. Bytzer, L. Capelle, I. Puig, F. Mégraud, C. O’Morain, J. P. Gisbert
- Subjects
Helicobacter pylori - Abstract
Background: The impact of consensus, prescription choices and efficacy trends on clinical practice over time has not been studied in depth. Methods: International multicenter prospective non-interventional registry aimed to evaluate the decisions and outcomes of H. pylori management by European gastroenterologists. All infected adult patients were registered at AEG-REDCap e-CRF up to February 2021. Modified intention-to-treat (mITT) and time trend analyses were performed. Results: So far 29, 634 first-line empirical prescriptions from 31 European countries have been included. Overall, the most common prescribed treatments in the 2013-20 were triple therapies ; however, a shift in antibiotic regimens was identified. Triple therapies decreased from over 50% of prescription in 2013/15 to less than 20% in 2018/20. Non- bismuth concomitant therapy use decreased from 21% in 2013/14 to 13% in 2019/20, while Pylera® increased from 0-1% in 2014/2015 to 19% in 2019/20. An increase in the average duration of treatments from 11 to 13 days in 2013-2020, and of the daily dose of PPI, was identified (No trend was identified (data now shown) ; however, there was an 8% overall improvement in first-line mITT overall effectiveness from 2013 to 2020 (Table 1). Conclusions: European gastroenterological practice is constantly adapting to the newest published evidence and recommendations (reducing the use of triple therapies and increasing the duration of treatment and the dose of PPIs), with a subsequent progressive improvement in overall effectiveness.
- Published
- 2021
4. First-line empirical H. pylori eradication therapy in Europe: results from 30,000 cases of the European registry on H. pylori management (HP- EUREG)
- Author
-
O. P. Nyssen, Á. Pérez-Aísa, D. Vaira, L. Jonaitis, B. Tepeš, A. Keco-Huerga, M. Castro Fernández, A. Lucendo, D. Bordin, N. Brglez Jurecic, L. Vologzhanina, M. Caldas, G. Fadieienko, R. Abdulkhakov, L. Bujanda, M. Leja, M. Romano, S. Georgopoulos, V. Ntouli, Ante Tonkić, H. Simsek, A. Gasbarrini, G. M Buzas, P. Phull, M. Venerito, P. Malfertheiner, J. Kupčinskas, G. Babayeva, O. Shvets, F. Lerang, R. Marcos Pinto, T. Rokkas, I. Ilkay Simsek, S. Smith, Y. Niv, D. Lamarque, F. Heluwaert, A. Goldis, W. Marlicz, V. Milivojevic, L. Boyanova, L. Kunovský, V. Lamy, C. Beglinger, P. Bytzer, L. Capelle, I. Puig, F. Mégraud, C. O’Morain, J. P. Gisbert
- Subjects
Helicobacter pylori ,eradication therapy - Abstract
Introduction: The best approach for Helicobacter pylori management re-mains unclear. An audit process is essential to ensure clinical practice is aligned with best standards of care.Aims & Methods: International multicentre prospective non-intervention-al registry starting in 2013 aimed to evaluate the decisions and outcomes in H. pylori management by European gastroenterologists. Patients were registered in an e-CRF by AEG-REDCap up to February 2021. Variables in-cluded: demographics, previous eradication attempts, prescribed treat-ment, adverse events, and outcomes. Modified intention-to-treat (mITT) and per-protocol (PP) analyses were performed and data were subject to quality review to ensure information reliability.Results: In total 41, 562 patients from 31 European countries were evaluat-ed and 29, 634 (70%) first-line empirical H. pylori treatments were included for analysis. Triple therapy with amoxicillin and clarithromycin was most commonly prescribed (39%), followed by non-bismuth concomitant treat-ment (18%) and bismuth quadruple (three-in-one single capsule) (12%), achieving 84%, 90% and 94% mITT eradication rate, respectively. Over 90% effectiveness was obtained only with 10 and 14-day bismuth quadru- ple or with 14-day concomitant treatment (Table). Longer treatment dura-tion, higher acid inhibition and compliance were associated with higher eradication rates. Conclusion: Management of H. pylori infection by European gastroenter-ologists is heterogeneous. Only quadruple therapies lasting at least ten days are able to achieve over 90% eradication rates.
- Published
- 2021
5. Closely related Lak megaphages replicate in the microbiomes of diverse animals
- Author
-
Crisci, MA, Chen, L-X, Devoto, AE, Borges, AL, Bordin, N, Sachdeva, R, Tett, A, Sharrar, AM, Segata, N, Debenedetti, F, Bailey, M, Burt, R, Wood, RM, Rowden, LJ, Corsini, PM, van Winden, S, Holmes, MA, Lei, S, Banfield, JF, Santini, JM, Crisci, MA, Chen, L-X, Devoto, AE, Borges, AL, Bordin, N, Sachdeva, R, Tett, A, Sharrar, AM, Segata, N, Debenedetti, F, Bailey, M, Burt, R, Wood, RM, Rowden, LJ, Corsini, PM, van Winden, S, Holmes, MA, Lei, S, Banfield, JF, and Santini, JM
- Abstract
Lak phages with alternatively coded ∼540 kbp genomes were recently reported to replicate in Prevotella in microbiomes of humans that consume a non-Western diet, baboons, and pigs. Here, we explore Lak phage diversity and broader distribution using diagnostic polymerase chain reaction and genome-resolved metagenomics. Lak phages were detected in 13 animal types, including reptiles, and are particularly prevalent in pigs. Tracking Lak through the pig gastrointestinal tract revealed significant enrichment in the hindgut compared to the foregut. We reconstructed 34 new Lak genomes, including six curated complete genomes, all of which are alternatively coded. An anomalously large (∼660 kbp) complete genome reconstructed for the most deeply branched Lak from a horse microbiome is also alternatively coded. From the Lak genomes, we identified proteins associated with specific animal species; notably, most have no functional predictions. The presence of closely related Lak phages in diverse animals indicates facile distribution coupled to host-specific adaptation.
- Published
- 2021
6. SARS-CoV-2 spike protein predicted to form complexes with host receptor protein orthologues from a broad range of mammals
- Author
-
Lam, SD, primary, Bordin, N, additional, Waman, VP, additional, Scholes, HM, additional, Ashford, P, additional, Sen, N, additional, van Dorp, L, additional, Rauer, C, additional, Dawson, NL, additional, Pang, CSM, additional, Abbasian, M, additional, Sillitoe, I, additional, Edwards, SJL, additional, Fraternali, F, additional, Lees, JG, additional, Santini, JM, additional, and Orengo, CA, additional
- Published
- 2020
- Full Text
- View/download PDF
7. p-Si Based Bifacial Solar Cell with Improved PERT Structure
- Author
-
Eisenberg, Y., Arumughan, J., Kreinin, L., Bordin, N., and Eisenberg, N.
- Subjects
Manufacturing & Production ,Silicon Photovoltaics - Abstract
33rd European Photovoltaic Solar Energy Conference and Exhibition; 943-946, Photovoltaic and recombination characteristics of new generation bifacial solar cells are demonstrated. The cells were fabricated using 6" pseudo square wafers of 5 to 6 .cm single crystalline Cz p-Si. Their front efficiency is exceeding 20 % with back to front short circuit current ratio 89 - 92%. The design is characterized by high bulk minority carrier lifetime above 0.5 ms, not degraded during the fabrication process. Rear p+ layer was prepared by controllable doping process using preliminary deposited thin B contained solid layer. Effective back surface recombination is below 10 cm/s. Measured implied Voc values of the PERT structure exceeding 700 mV are evidencing the intrinsic potential of the structure to provide the cell efficiency far exceeding 22 %. Textured front surface in combination with smoothly etched back provides effective light trapping. The equivalent efficiency of a bifacial solar cell, which characterizes its energy generation capability, will achieve values in the range 23 – 27 % as a function of use conditions.
- Published
- 2017
- Full Text
- View/download PDF
8. Effective Surface Recombination of p+-Layer in p-Type Silicon PERT Bifacial Cell
- Author
-
Eisenberg, Y., Kreinin, L., Bordin, N., Eisenberg, N., Grigorieva, G., Kagan, M., and Hava, S.
- Subjects
Silicon Solar Cells Improvements and Innovation ,Wafer-Based Silicon Solar Cells and Materials Technology - Abstract
32nd European Photovoltaic Solar Energy Conference and Exhibition; 711-714, Boron doped BSF is a critically important factor in high efficiency p-Si bifacial solar cell design. Recombination losses represented as effective surface recombination, Seff, are evaluated by simulation and experimentally. For thick heavily doped p+ layers Seff is determined mainly by recombination inside the layer with positive effect of top over doped region. Spectral response analysis of n+-p-p+ solar cells with highly doped and deep BSF (~1.6 m) demonstrates a possibility of providing Seff of 75±20 cm/s without surface passivation. For thinner p+ layers surface recombination is a critical factor. n-Si wafers symmetrically doped by boron ion-implantation or by boron thermal diffusion were used for experimental validation of the simulated surface recombination contribution to Seff and its dependence on the charge trapped in the passivation layer. The recombination parameters of the samples were measured using the photoconductivity decay method.
- Published
- 2016
- Full Text
- View/download PDF
9. Design of a Bifacial Si Solar Cell with Uniformly Doped B Implanted and P Thermal Diffused Layers
- Author
-
Eisenberg, Y., Grigorieva, G., Kagan, M., Zviagina, K., Kreinin, L., Bordin, N., Eisenberg, N., and Hava, S.
- Subjects
WAFER-BASED SILICON SOLAR CELLS AND MATERIALS TECHNOLOGY ,Silicon Solar Cell Improvements - Abstract
29th European Photovoltaic Solar Energy Conference and Exhibition; 950-953, Combined thermal diffusion – ion implantation technology, which was extensively used for n+-p-p+ space Si cell fabrication, can be applied also for terrestrial cell production. Starting Si of both p- and n- conductivity types can be used. The drawback of B ion implantation as a doping process for p+ layer formation is the introduction of a defect layer behind the implanted layer that can result in additional recombination losses. The effect of 30 keV B ion implantation and subsequent annealing was studied. Back IQE analysis of n+-p-p+ cells reveals the main factors affecting the formation of the defect layer and its influence on solar cell base recombination. Among these factors are: implantation dose, starting Si parameters and injection level during the measurements. Decrease of implantation dose and increase of annealing temperature suppress the formation of the defect layer. Experiments with annealing temperature in the range 910 - 1015 oC were carried out. SiN cap layer as a tool for decrease of out- diffusion can be used for rminimizing the implantation dose. The depth profiles of electro active B atoms after annealing at high temperatures are quite deep, and due to the back junction n+-n-p+ cell design was found as preferable. This is demonstrated by simulation of front and back IQE of the cell confirmed by measurement results. According to simulation the effective surface recombination at n+-n high-low barrier can be improved by using high resistivity Si (above ~ 5 .cm). Thermal oxidation was tested as a relative simple and controllable process for the p+ layer passivation. The influence of the positive charge which is typical for a silicon oxide film was estimated. According to simulation, the surface doping concentration is the dominating parameter influencing the effective surface recombination. The charge in the passivation layer is not influencing significantly when surface doping concentration is relatively high (above ~1019 cm-3)
- Published
- 2014
- Full Text
- View/download PDF
10. Industrially Fabricated Bifacial Si Solar Cells with n+-p-p+ Structure
- Author
-
Kreinin, L., Bordin, N., Eisenberg, N., Grabitz, P., and Wahl, G.
- Subjects
WAFER-BASED SILICON SOLAR CELLS AND MATERIALS TECHNOLOGY ,Silicon Solar Cell Improvements - Abstract
28th European Photovoltaic Solar Energy Conference and Exhibition; 1835-1838, Photovoltaic and recombination properties of bifacial solar cells are analyzed. The cells were fabricated in a pilot production line using 6" pseudosquare wafers of 3 to 6 .cm single crystalline Cz Si. Their front efficiency is in the range 18.1 – 18.8 % with back to front short circuit current ratio 74 - 79 %. The design is characterized by high bulk minority carrier lifetime 0.2 – 1 ms, not degraded during the fabrication process, when good starting wafers were used. Effective back surface recombination is in the range 55 - 95 cm/s. Light trapping due to high back internal reflection contributes to current improvement compared to a regular mono facial cell with Al alloyed back. Equivalent efficiency as a quality criteria of a bifacial solar cell is proposed. Routine adjustment of fabrication process promise the increase of average front efficiency above 19 % and equivalent efficiency above 24 %.
- Published
- 2013
- Full Text
- View/download PDF
11. Experimental Analysis of the Increases in Energy Generation of Bifacial Over Mono-Facial PV Modules
- Author
-
Kreinin, L., Bordin, N., Karsenty, A., Drori, A., and Eisenberg, N.
- Subjects
PV Modules ,Components for PV Systems - Abstract
26th European Photovoltaic Solar Energy Conference and Exhibition; 3140-3143, I-V characteristics of bifacial and mono-facial modules in the test roof-top PV field in Jerusalem were monitored simultaneously. Analysis of factors affecting the energy gains due to contribution of bifacial module back is based on time-of-day, daily and monthly energy generation data. Among these factors: diffuse to global radiation ratio, season and time-of-day Sun position. Daily energy gain of an in-field bifacial vs. a mono-facial module at underlying surface albedo ~0.50 varies in the range 5 – 38 %. An evaluated equivalent efficiency of 21 % is based on the measured yearly gain for bifacial solar cells with a front efficiency 18 %. Optimization of the PV field design should increase these values.
- Published
- 2011
- Full Text
- View/download PDF
12. Efficiency of Bifacial Si Solar Cells at Low Irradiance. Effect of Design and Fabrication Technology Factors
- Author
-
Grigorieva, G., Kagan, M., Nekrasov, V., Zviagina, K., Kreinin, L., Bordin, N., Broder, J., Eisenberg, Y., and Eisenberg, N.
- Subjects
Mono- and Multicrystalline Silicon Materials and Cells ,Wafer-Based Silicon Solar Cells and Materials Technology - Abstract
25th European Photovoltaic Solar Energy Conference and Exhibition / 5th World Conference on Photovoltaic Energy Conversion, 6-10 September 2010, Valencia, Spain; 1805-1809, There are applications of solar cells under illumination conditions for lower than "one sun". Solar cell efficiency decreases with decreasing irradiance. Several factors affecting conversion efficiency at low irradiances (down to ~0.01 sun) are studied. Experiments were performed with bifacial n+-p-p+ Si solar cells, fabricated using an ion implantation procedure for BSF formation. Optimal resistivity of the starting Si is evaluated taking into consideration recombination in the ion induced defect layer. Use of high resistivity Si eliminates the influence of this defect layer on recombination in the base region. Lifetime dependence on injection level can cause current non linearity and therefore an additional efficiency drop at low irradiance. Light induced defects can be responsible for this effect. The quality of the p-n junction as expressed by the ideality factors of the two-diode model is one of the most important factors affecting solar cell conversion efficiency at low light intensities.
- Published
- 2010
- Full Text
- View/download PDF
13. Future of Bifacial Si Solar Cells for Space Applications
- Author
-
Grigorieva, G., Kagan, M., Zviagina, K.N., Kalikauskas, V.S., Kreinin, L., Bordin, N., and Eisenberg, N.
- Subjects
Solar Cells, Modules and PV Systems for Space Applications ,Advanced Photovoltaics - Abstract
23rd European Photovoltaic Solar Energy Conference and Exhibition, 1-5 September 2008, Valencia, Spain; 756-761, The ability to generate photocurrent under front as well as back illumination makes the bifacial solar cells very promising for low earth orbits (LEO) where the earth’s albedo is significant. Calculations show a 11–41% increase in light power received by cells due to illumination of the solar cell backs. Besides back illumination, an additional contribution in output increase results from the cells transparency in the IR range which decreases equilibrium working temperature. Therefore bifacial Si solar cells are able to supply 18-37 % more energy than regular Si cells. Bifacial cells with an n+-p-p+ structure, are experimentally tested and used for several space vehicle power supplies. Proven energy generation increases of 10-20 % are observed when using bifacial Si solar cells instead of regular Si solar cells. A further increase of energy generation can be expected if bifacial Si cells with improved parameters are used. The advantage of the bifacial solar cell may be characterized by its “equivalent efficiency” which is equal to the efficiency of a regular Si solar cell able to generate the same energy. The “equivalent efficiency” of bifacial Si solar cell in a typical LEO space craft feathered solar array tracking mode, can achieve 25 %.
- Published
- 2008
- Full Text
- View/download PDF
14. Recombination Parameters of Si Solar Cells with Back Surface Field Formed by ION Implantation
- Author
-
Bordin, N., primary, Kreinin, L., additional, Broder, J., additional, Eisenberg, N., additional, Grigorieva, G., additional, Zvyagina, K., additional, and Kagan, M., additional
- Published
- 2006
- Full Text
- View/download PDF
15. PV module power gain due to bifacial design. Preliminary experimental and simulation data.
- Author
-
Kreinin, L., Bordin, N., Karsenty, A., Drori, A., Grobgeld, D., and Eisenberg, N.
- Published
- 2010
- Full Text
- View/download PDF
16. A novel method for determining bulk diffusion length in bifacial silicon solar cells.
- Author
-
Kreinin, L., Bordin, N., and Eisenberg, N.
- Published
- 2000
- Full Text
- View/download PDF
17. Fast determination of voriconazole in oral fluid using microextraction by packed sorbent and HPLC with fluorescence detection
- Author
-
Bordin, N. A., Antunes, M. V., Spaniol, B., Andreolla, H. F., Pasqualotto, A. C., and Rafael Linden
18. A novel method for determining bulk diffusion length in bifacial silicon solar cells
- Author
-
Kreinin, L., primary, Bordin, N., additional, and Eisenberg, N., additional
- Full Text
- View/download PDF
19. The JCT buried BSF silicon solar cell a model of simplicity and high efficiency.
- Author
-
Mandelkorn, J., Broder, J., Kreinin, L., Bordin, N., and Eisenberg, N.P.
- Published
- 1994
- Full Text
- View/download PDF
20. Exploring structural diversity across the protein universe with The Encyclopedia of Domains.
- Author
-
Lau AM, Bordin N, Kandathil SM, Sillitoe I, Waman VP, Wells J, Orengo CA, and Jones DT
- Subjects
- Databases, Protein, Protein Domains, Protein Folding, Proteins chemistry, Deep Learning
- Abstract
The AlphaFold Protein Structure Database (AFDB) contains more than 214 million predicted protein structures composed of domains, which are independently folding units found in multiple structural and functional contexts. Identifying domains can enable many functional and evolutionary analyses but has remained challenging because of the sheer scale of the data. Using deep learning methods, we have detected and classified every domain in the AFDB, producing The Encyclopedia of Domains. We detected nearly 365 million domains, over 100 million more than can be found by sequence methods, covering more than 1 million taxa. Reassuringly, 77% of the nonredundant domains are similar to known superfamilies, greatly expanding representation of their domain space. We uncovered more than 10,000 new structural interactions between superfamilies and thousands of new folds across the fold space continuum.
- Published
- 2024
- Full Text
- View/download PDF
21. Quest for Orthologs in the Era of Biodiversity Genomics.
- Author
-
Langschied F, Bordin N, Cosentino S, Fuentes-Palacios D, Glover N, Hiller M, Hu Y, Huerta-Cepas J, Coelho LP, Iwasaki W, Majidian S, Manzano-Morales S, Persson E, Richards TA, Gabaldón T, Sonnhammer E, Thomas PD, Dessimoz C, and Ebersberger I
- Subjects
- Animals, Evolution, Molecular, Molecular Sequence Annotation, Computational Biology methods, Genomics methods, Biodiversity
- Abstract
The era of biodiversity genomics is characterized by large-scale genome sequencing efforts that aim to represent each living taxon with an assembled genome. Generating knowledge from this wealth of data has not kept up with this pace. We here discuss major challenges to integrating these novel genomes into a comprehensive functional and evolutionary network spanning the tree of life. In summary, the expanding datasets create a need for scalable gene annotation methods. To trace gene function across species, new methods must seek to increase the resolution of ortholog analyses, e.g. by extending analyses to the protein domain level and by accounting for alternative splicing. Additionally, the scope of orthology prediction should be pushed beyond well-investigated proteomes. This demands the development of specialized methods for the identification of orthologs to short proteins and noncoding RNAs and for the functional characterization of novel gene families. Furthermore, protein structures predicted by machine learning are now readily available, but this new information is yet to be integrated with orthology-based analyses. Finally, an increasing focus should be placed on making orthology assignments adhere to the findable, accessible, interoperable, and reusable (FAIR) principles. This fosters green bioinformatics by avoiding redundant computations and helps integrating diverse scientific communities sharing the need for comparative genetics and genomics information. It should also help with communicating orthology-related concepts in a format that is accessible to the public, to counteract existing misinformation about evolution., (© The Author(s) 2024. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.)
- Published
- 2024
- Full Text
- View/download PDF
22. CATH 2024: CATH-AlphaFlow Doubles the Number of Structures in CATH and Reveals Nearly 200 New Folds.
- Author
-
Waman VP, Bordin N, Alcraft R, Vickerstaff R, Rauer C, Chan Q, Sillitoe I, Yamamori H, and Orengo C
- Subjects
- Proteins chemistry, Proteins metabolism, Protein Conformation, Models, Molecular, Computational Biology methods, Protein Domains, Animals, Software, Humans, Databases, Protein, Protein Folding
- Abstract
CATH (https://www.cathdb.info) classifies domain structures from experimental protein structures in the PDB and predicted structures in the AlphaFold Database (AFDB). To cope with the scale of the predicted data a new NextFlow workflow (CATH-AlphaFlow), has been developed to classify high-quality domains into CATH superfamilies and identify novel fold groups and superfamilies. CATH-AlphaFlow uses a novel state-of-the-art structure-based domain boundary prediction method (ChainSaw) for identifying domains in multi-domain proteins. We applied CATH-AlphaFlow to process PDB structures not classified in CATH and AFDB structures from 21 model organisms, expanding CATH by over 100%. Domains not classified in existing CATH superfamilies or fold groups were used to seed novel folds, giving 253 new folds from PDB structures (September 2023 release) and 96 from AFDB structures of proteomes of 21 model organisms. Where possible, functional annotations were obtained using (i) predictions from publicly available methods (ii) annotations from structural relatives in AFDB/UniProt50. We also predicted functional sites and highly conserved residues. Some folds are associated with important functions such as photosynthetic acclimation (in flowering plants), iron permease activity (in fungi) and post-natal spermatogenesis (in mice). CATH-AlphaFlow will allow us to identify many more CATH relatives in the AFDB, further characterising the protein structure landscape., Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2024 The Author(s). Published by Elsevier Ltd.. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
23. Clustering protein functional families at large scale with hierarchical approaches.
- Author
-
Bordin N, Scholes H, Rauer C, Roca-Martínez J, Sillitoe I, and Orengo C
- Subjects
- Cluster Analysis, Computational Biology methods, Protein Domains, Proteins chemistry, Proteins metabolism, Algorithms, Databases, Protein
- Abstract
Proteins, fundamental to cellular activities, reveal their function and evolution through their structure and sequence. CATH functional families (FunFams) are coherent clusters of protein domain sequences in which the function is conserved across their members. The increasing volume and complexity of protein data enabled by large-scale repositories like MGnify or AlphaFold Database requires more powerful approaches that can scale to the size of these new resources. In this work, we introduce MARC and FRAN, two algorithms developed to build upon and address limitations of GeMMA/FunFHMMER, our original methods developed to classify proteins with related functions using a hierarchical approach. We also present CATH-eMMA, which uses embeddings or Foldseek distances to form relationship trees from distance matrices, reducing computational demands and handling various data types effectively. CATH-eMMA offers a highly robust and much faster tool for clustering protein functions on a large scale, providing a new tool for future studies in protein function and evolution., (© 2024 The Author(s). Protein Science published by Wiley Periodicals LLC on behalf of The Protein Society.)
- Published
- 2024
- Full Text
- View/download PDF
24. Predicting human and viral protein variants affecting COVID-19 susceptibility and repurposing therapeutics.
- Author
-
Waman VP, Ashford P, Lam SD, Sen N, Abbasian M, Woodridge L, Goldtzvik Y, Bordin N, Wu J, Sillitoe I, and Orengo CA
- Subjects
- Humans, Drug Repositioning, Viral Proteins genetics, Viral Proteins metabolism, Protein Binding, Genetic Predisposition to Disease, Disease Susceptibility, COVID-19 Drug Treatment, SARS-CoV-2 genetics, SARS-CoV-2 immunology, COVID-19 genetics, COVID-19 virology, COVID-19 immunology, Mutation, Missense
- Abstract
The COVID-19 disease is an ongoing global health concern. Although vaccination provides some protection, people are still susceptible to re-infection. Ostensibly, certain populations or clinical groups may be more vulnerable. Factors causing these differences are unclear and whilst socioeconomic and cultural differences are likely to be important, human genetic factors could influence susceptibility. Experimental studies indicate SARS-CoV-2 uses innate immune suppression as a strategy to speed-up entry and replication into the host cell. Therefore, it is necessary to understand the impact of variants in immunity-associated human proteins on susceptibility to COVID-19. In this work, we analysed missense coding variants in several SARS-CoV-2 proteins and their human protein interactors that could enhance binding affinity to SARS-CoV-2. We curated a dataset of 19 SARS-CoV-2: human protein 3D-complexes, from the experimentally determined structures in the Protein Data Bank and models built using AlphaFold2-multimer, and analysed the impact of missense variants occurring in the protein-protein interface region. We analysed 468 missense variants from human proteins and 212 variants from SARS-CoV-2 proteins and computationally predicted their impacts on binding affinities for the human viral protein complexes. We predicted a total of 26 affinity-enhancing variants from 13 human proteins implicated in increased binding affinity to SARS-CoV-2. These include key-immunity associated genes (TOMM70, ISG15, IFIH1, IFIT2, RPS3, PALS1, NUP98, AXL, ARF6, TRIMM, TRIM25) as well as important spike receptors (KREMEN1, AXL and ACE2). We report both common (e.g., Y13N in IFIH1) and rare variants in these proteins and discuss their likely structural and functional impact, using information on known and predicted functional sites. Potential mechanisms associated with immune suppression implicated by these variants are discussed. Occurrence of certain predicted affinity-enhancing variants should be monitored as they could lead to increased susceptibility and reduced immune response to SARS-CoV-2 infection in individuals/populations carrying them. Our analyses aid in understanding the potential impact of genetic variation in immunity-associated proteins on COVID-19 susceptibility and help guide drug-repurposing strategies., (© 2024. Crown.)
- Published
- 2024
- Full Text
- View/download PDF
25. Chainsaw: protein domain segmentation with fully convolutional neural networks.
- Author
-
Wells J, Hawkins-Hooker A, Bordin N, Sillitoe I, Paige B, and Orengo C
- Subjects
- Databases, Protein, Computational Biology methods, Software, Humans, Neural Networks, Computer, Proteins chemistry, Algorithms, Protein Domains
- Abstract
Motivation: Protein domains are fundamental units of protein structure and play a pivotal role in understanding folding, function, evolution, and design. The advent of accurate structure prediction techniques has resulted in an influx of new structural data, making the partitioning of these structures into domains essential for inferring evolutionary relationships and functional classification., Results: This article presents Chainsaw, a supervised learning approach to domain parsing that achieves accuracy that surpasses current state-of-the-art methods. Chainsaw uses a fully convolutional neural network which is trained to predict the probability that each pair of residues is in the same domain. Domain predictions are then derived from these pairwise predictions using an algorithm that searches for the most likely assignment of residues to domains given the set of pairwise co-membership probabilities. Chainsaw matches CATH domain annotations in 78% of protein domains versus 72% for the next closest method. When predicting on AlphaFold models, expert human evaluators were twice as likely to prefer Chainsaw's predictions versus the next best method., Availability and Implementation: github.com/JudeWells/Chainsaw., (© The Author(s) 2024. Published by Oxford University Press.)
- Published
- 2024
- Full Text
- View/download PDF
26. Large-scale clustering of AlphaFold2 3D models shines light on the structure and function of proteins.
- Author
-
Bordin N, Lau AM, and Orengo C
- Subjects
- Databases, Factual, Cluster Analysis
- Abstract
Two recent studies exploited ultra-fast structural aligners and deep-learning approaches to cluster the protein structure space in the AlphaFold Database. Barrio-Hernandez et al.
1 and Durairaj et al.2 uncovered fascinating new protein functions and structural features previously unknown., Competing Interests: Declaration of interests The authors declare no competing interests., (Copyright © 2023 Elsevier Inc. All rights reserved.)- Published
- 2023
- Full Text
- View/download PDF
27. Broad functional profiling of fission yeast proteins using phenomics and machine learning.
- Author
-
Rodríguez-López M, Bordin N, Lees J, Scholes H, Hassan S, Saintain Q, Kamrad S, Orengo C, and Bähler J
- Subjects
- Humans, Phenomics, Phenotype, Machine Learning, Schizosaccharomyces pombe Proteins genetics, Schizosaccharomyces genetics
- Abstract
Many proteins remain poorly characterized even in well-studied organisms, presenting a bottleneck for research. We applied phenomics and machine-learning approaches with Schizosaccharomyces pombe for broad cues on protein functions. We assayed colony-growth phenotypes to measure the fitness of deletion mutants for 3509 non-essential genes in 131 conditions with different nutrients, drugs, and stresses. These analyses exposed phenotypes for 3492 mutants, including 124 mutants of 'priority unstudied' proteins conserved in humans, providing varied functional clues. For example, over 900 proteins were newly implicated in the resistance to oxidative stress. Phenotype-correlation networks suggested roles for poorly characterized proteins through 'guilt by association' with known proteins. For complementary functional insights, we predicted Gene Ontology (GO) terms using machine learning methods exploiting protein-network and protein-homology data (NET-FF). We obtained 56,594 high-scoring GO predictions, of which 22,060 also featured high information content. Our phenotype-correlation data and NET-FF predictions showed a strong concordance with existing PomBase GO annotations and protein networks, with integrated analyses revealing 1675 novel GO predictions for 783 genes, including 47 predictions for 23 priority unstudied proteins. Experimental validation identified new proteins involved in cellular aging, showing that these predictions and phenomics data provide a rich resource to uncover new protein functions., Competing Interests: MR, NB, JL, HS, SH, QS, SK, CO, JB No competing interests declared, (© 2023, Rodríguez-López, Bordin et al.)
- Published
- 2023
- Full Text
- View/download PDF
28. The opportunities and challenges posed by the new generation of deep learning-based protein structure predictors.
- Author
-
Varadi M, Bordin N, Orengo C, and Velankar S
- Subjects
- Computational Biology methods, Proteins chemistry, Amino Acid Sequence, Deep Learning
- Abstract
The function of proteins can often be inferred from their three-dimensional structures. Experimental structural biologists spent decades studying these structures, but the accelerated pace of protein sequencing continuously increases the gaps between sequences and structures. The early 2020s saw the advent of a new generation of deep learning-based protein structure prediction tools that offer the potential to predict structures based on any number of protein sequences. In this review, we give an overview of the impact of this new generation of structure prediction tools, with examples of the impacted field in the life sciences. We discuss the novel opportunities and new scientific and technical challenges these tools present to the broader scientific community. Finally, we highlight some potential directions for the future of computational protein structure prediction., Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 The Author(s). Published by Elsevier Ltd.. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
29. Novel machine learning approaches revolutionize protein knowledge.
- Author
-
Bordin N, Dallago C, Heinzinger M, Kim S, Littmann M, Rauer C, Steinegger M, Rost B, and Orengo C
- Subjects
- Computational Biology methods, Protein Conformation, Proteins chemistry, Machine Learning
- Abstract
Breakthrough methods in machine learning (ML), protein structure prediction, and novel ultrafast structural aligners are revolutionizing structural biology. Obtaining accurate models of proteins and annotating their functions on a large scale is no longer limited by time and resources. The most recent method to be top ranked by the Critical Assessment of Structure Prediction (CASP) assessment, AlphaFold 2 (AF2), is capable of building structural models with an accuracy comparable to that of experimental structures. Annotations of 3D models are keeping pace with the deposition of the structures due to advancements in protein language models (pLMs) and structural aligners that help validate these transferred annotations. In this review we describe how recent developments in ML for protein science are making large-scale structural bioinformatics available to the general scientific community., Competing Interests: Declaration of interests No interests are declared., (Copyright © 2022 The Author(s). Published by Elsevier Ltd.. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
30. AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms.
- Author
-
Bordin N, Sillitoe I, Nallapareddy V, Rauer C, Lam SD, Waman VP, Sen N, Heinzinger M, Littmann M, Kim S, Velankar S, Steinegger M, Rost B, and Orengo C
- Subjects
- Humans, Databases, Protein, Furylfuramide, Proteins chemistry
- Abstract
Deep-learning (DL) methods like DeepMind's AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique 'global' folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
31. KinFams: De-Novo Classification of Protein Kinases Using CATH Functional Units.
- Author
-
Adeyelu T, Bordin N, Waman VP, Sadlej M, Sillitoe I, Moya-Garcia AA, and Orengo CA
- Subjects
- Humans, Databases, Protein, Sequence Homology, Amino Acid, Protein Kinases metabolism, Proteins chemistry
- Abstract
Protein kinases are important targets for treating human disorders, and they are the second most targeted families after G-protein coupled receptors. Several resources provide classification of kinases into evolutionary families (based on sequence homology); however, very few systematically classify functional families (FunFams) comprising evolutionary relatives that share similar functional properties. We have developed the FunFam-MARC (Multidomain ARchitecture-based Clustering) protocol, which uses multi-domain architectures of protein kinases and specificity-determining residues for functional family classification. FunFam-MARC predicts 2210 kinase functional families (KinFams), which have increased functional coherence, in terms of EC annotations, compared to the widely used KinBase classification. Our protocol provides a comprehensive classification for kinase sequences from >10,000 organisms. We associate human KinFams with diseases and drugs and identify 28 druggable human KinFams, i.e., enriched in clinically approved drugs. Since relatives in the same druggable KinFam tend to be structurally conserved, including the drug-binding site, these KinFams may be valuable for shortlisting therapeutic targets. Information on the human KinFams and associated 3D structures from AlphaFold2 are provided via our CATH FTP website and Zenodo. This gives the domain structure representative of each KinFam together with information on any drug compounds available. For 32% of the KinFams, we provide information on highly conserved residue sites that may be associated with specificity.
- Published
- 2023
- Full Text
- View/download PDF
32. CATHe: detection of remote homologues for CATH superfamilies using embeddings from protein language models.
- Author
-
Nallapareddy V, Bordin N, Sillitoe I, Heinzinger M, Littmann M, Waman VP, Sen N, Rost B, and Orengo C
- Subjects
- Humans, Sequence Homology, Amino Acid, Databases, Protein, Proteins chemistry, Algorithms
- Abstract
Motivation: CATH is a protein domain classification resource that exploits an automated workflow of structure and sequence comparison alongside expert manual curation to construct a hierarchical classification of evolutionary and structural relationships. The aim of this study was to develop algorithms for detecting remote homologues missed by state-of-the-art hidden Markov model (HMM)-based approaches. The method developed (CATHe) combines a neural network with sequence representations obtained from protein language models. It was assessed using a dataset of remote homologues having less than 20% sequence identity to any domain in the training set., Results: The CATHe models trained on 1773 largest and 50 largest CATH superfamilies had an accuracy of 85.6 ± 0.4% and 98.2 ± 0.3%, respectively. As a further test of the power of CATHe to detect more remote homologues missed by HMMs derived from CATH domains, we used a dataset consisting of protein domains that had annotations in Pfam, but not in CATH. By using highly reliable CATHe predictions (expected error rate <0.5%), we were able to provide CATH annotations for 4.62 million Pfam domains. For a subset of these domains from Homo sapiens, we structurally validated 90.86% of the predictions by comparing their corresponding AlphaFold2 structures with structures from the CATH superfamilies to which they were assigned., Availability and Implementation: The code for the developed models is available on https://github.com/vam-sin/CATHe, and the datasets developed in this study can be accessed on https://zenodo.org/record/6327572., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2023. Published by Oxford University Press.)
- Published
- 2023
- Full Text
- View/download PDF
33. Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs.
- Author
-
Sen N, Anishchenko I, Bordin N, Sillitoe I, Velankar S, Baker D, and Orengo C
- Subjects
- Databases, Protein, Humans, Models, Molecular, Mutation, Mutation, Missense, Proteins chemistry, Proteins genetics
- Abstract
Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein-protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2022
- Full Text
- View/download PDF
34. Contrastive learning on protein embeddings enlightens midnight zone.
- Author
-
Heinzinger M, Littmann M, Sillitoe I, Bordin N, Orengo C, and Rost B
- Abstract
Experimental structures are leveraged through multiple sequence alignments, or more generally through homology-based inference (HBI), facilitating the transfer of information from a protein with known annotation to a query without any annotation. A recent alternative expands the concept of HBI from sequence-distance lookup to embedding-based annotation transfer (EAT). These embeddings are derived from protein Language Models (pLMs). Here, we introduce using single protein representations from pLMs for contrastive learning. This learning procedure creates a new set of embeddings that optimizes constraints captured by hierarchical classifications of protein 3D structures defined by the CATH resource. The approach, dubbed ProtTucker , has an improved ability to recognize distant homologous relationships than more traditional techniques such as threading or fold recognition. Thus, these embeddings have allowed sequence comparison to step into the 'midnight zone' of protein similarity, i.e. the region in which distantly related sequences have a seemingly random pairwise sequence similarity. The novelty of this work is in the particular combination of tools and sampling techniques that ascertained good performance comparable or better to existing state-of-the-art sequence comparison methods. Additionally, since this method does not need to generate alignments it is also orders of magnitudes faster. The code is available at https://github.com/Rostlab/EAT., (© The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)
- Published
- 2022
- Full Text
- View/download PDF
35. Detection and enumeration of Lak megaphages in microbiome samples by endpoint and quantitative PCR.
- Author
-
Crisci MA, Corsini PM, Bordin N, Chen LX, Banfield JF, and Santini JM
- Subjects
- Animals, Prevotella genetics, Real-Time Polymerase Chain Reaction methods, Bacteriophages genetics, Gastrointestinal Microbiome, Microbiota genetics
- Abstract
Lak megaphages are prevalent across diverse gut microbiomes and may potentially impact animal and human health through lysis of Prevotella . Given their large genome size (up to 660 kbp), Lak megaphages are difficult to culture, and their identification relies on molecular techniques. Here, we present optimized protocols for identifying Lak phages in various microbiome samples, including procedures for DNA extraction, followed by detection and quantification of genes encoding Lak structural proteins using diagnostic endpoint and SYBR green-based quantitative PCR, respectively. For complete details on the use and execution of this protocol, please refer to Crisci et al., (2021)., Competing Interests: The authors of this paper declare no competing interests., (© 2021 The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
36. Clustering FunFams using sequence embeddings improves EC purity.
- Author
-
Littmann M, Bordin N, Heinzinger M, Schütze K, Dallago C, Orengo C, and Rost B
- Abstract
Motivation: Classifying proteins into functional families can improve our understanding of protein function and can allow transferring annotations within one family. For this, functional families need to be 'pure', i.e., contain only proteins with identical function. Functional Families (FunFams) cluster proteins within CATH superfamilies into such groups of proteins sharing function. 11% of all FunFams (22 830 of 203 639) contain EC annotations and of those, 7% (1526 of 22 830) have inconsistent functional annotations., Results: We propose an approach to further cluster FunFams into functionally more consistent sub-families by encoding their sequences through embeddings. These embeddings originate from language models transferring knowledge gained from predicting missing amino acids in a sequence (ProtBERT) and have been further optimized to distinguish between proteins belonging to the same or a different CATH superfamily (PB-Tucker). Using distances between embeddings and DBSCAN to cluster FunFams and identify outliers, doubled the number of pure clusters per FunFam compared to random clustering. Our approach was not limited to FunFams but also succeeded on families created using sequence similarity alone. Complementing EC annotations, we observed similar results for binding annotations. Thus, we expect an increased purity also for other aspects of function. Our results can help generating FunFams; the resulting clusters with improved functional consistency allow more reliable inference of annotations. We expect this approach to succeed equally for any other grouping of proteins by their phenotypes., Availability and Implementation: Code and embeddings are available via GitHub: https://github.com/Rostlab/FunFamsClustering., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2021. Published by Oxford University Press.)
- Published
- 2021
- Full Text
- View/download PDF
37. SARS-CoV-2 structural coverage map reveals viral protein assembly, mimicry, and hijacking mechanisms.
- Author
-
O'Donoghue SI, Schafferhans A, Sikta N, Stolte C, Kaur S, Ho BK, Anderson S, Procter JB, Dallago C, Bordin N, Adcock M, and Rost B
- Subjects
- Amino Acid Transport Systems, Neutral chemistry, Amino Acid Transport Systems, Neutral genetics, Amino Acid Transport Systems, Neutral metabolism, Angiotensin-Converting Enzyme 2 chemistry, Angiotensin-Converting Enzyme 2 genetics, Binding Sites, COVID-19 genetics, COVID-19 metabolism, COVID-19 virology, Computational Biology methods, Coronavirus Envelope Proteins chemistry, Coronavirus Envelope Proteins genetics, Coronavirus Envelope Proteins metabolism, Coronavirus Nucleocapsid Proteins chemistry, Coronavirus Nucleocapsid Proteins genetics, Coronavirus Nucleocapsid Proteins metabolism, Humans, Mitochondrial Membrane Transport Proteins chemistry, Mitochondrial Membrane Transport Proteins genetics, Mitochondrial Membrane Transport Proteins metabolism, Mitochondrial Precursor Protein Import Complex Proteins, Models, Molecular, Molecular Mimicry, Neuropilin-1 chemistry, Neuropilin-1 genetics, Neuropilin-1 metabolism, Phosphoproteins chemistry, Phosphoproteins genetics, Phosphoproteins metabolism, Protein Binding, Protein Conformation, alpha-Helical, Protein Conformation, beta-Strand, Protein Interaction Domains and Motifs, Protein Interaction Mapping methods, Protein Multimerization, SARS-CoV-2 chemistry, SARS-CoV-2 genetics, Spike Glycoprotein, Coronavirus chemistry, Spike Glycoprotein, Coronavirus genetics, Viral Matrix Proteins chemistry, Viral Matrix Proteins genetics, Viral Matrix Proteins metabolism, Viroporin Proteins chemistry, Viroporin Proteins genetics, Viroporin Proteins metabolism, Virus Replication, Angiotensin-Converting Enzyme 2 metabolism, Host-Pathogen Interactions genetics, Protein Processing, Post-Translational, SARS-CoV-2 metabolism, Spike Glycoprotein, Coronavirus metabolism
- Abstract
We modeled 3D structures of all SARS-CoV-2 proteins, generating 2,060 models that span 69% of the viral proteome and provide details not available elsewhere. We found that ˜6% of the proteome mimicked human proteins, while ˜7% was implicated in hijacking mechanisms that reverse post-translational modifications, block host translation, and disable host defenses; a further ˜29% self-assembled into heteromeric states that provided insight into how the viral replication and translation complex forms. To make these 3D models more accessible, we devised a structural coverage map, a novel visualization method to show what is-and is not-known about the 3D structure of the viral proteome. We integrated the coverage map into an accompanying online resource (https://aquaria.ws/covid) that can be used to find and explore models corresponding to the 79 structural states identified in this work. The resulting Aquaria-COVID resource helps scientists use emerging structural data to understand the mechanisms underlying coronavirus infection and draws attention to the 31% of the viral proteome that remains structurally unknown or dark., (© 2021 The Authors. Published under the terms of the CC BY 4.0 license.)
- Published
- 2021
- Full Text
- View/download PDF
38. Closely related Lak megaphages replicate in the microbiomes of diverse animals.
- Author
-
Crisci MA, Chen LX, Devoto AE, Borges AL, Bordin N, Sachdeva R, Tett A, Sharrar AM, Segata N, Debenedetti F, Bailey M, Burt R, Wood RM, Rowden LJ, Corsini PM, van Winden S, Holmes MA, Lei S, Banfield JF, and Santini JM
- Abstract
Lak phages with alternatively coded ∼540 kbp genomes were recently reported to replicate in Prevotella in microbiomes of humans that consume a non-Western diet, baboons, and pigs. Here, we explore Lak phage diversity and broader distribution using diagnostic polymerase chain reaction and genome-resolved metagenomics. Lak phages were detected in 13 animal types, including reptiles, and are particularly prevalent in pigs. Tracking Lak through the pig gastrointestinal tract revealed significant enrichment in the hindgut compared to the foregut. We reconstructed 34 new Lak genomes, including six curated complete genomes, all of which are alternatively coded. An anomalously large (∼660 kbp) complete genome reconstructed for the most deeply branched Lak from a horse microbiome is also alternatively coded. From the Lak genomes, we identified proteins associated with specific animal species; notably, most have no functional predictions. The presence of closely related Lak phages in diverse animals indicates facile distribution coupled to host-specific adaptation., Competing Interests: The authors declare no competing interests., (© 2021 The Authors.)
- Published
- 2021
- Full Text
- View/download PDF
39. Tracing Evolution Through Protein Structures: Nature Captured in a Few Thousand Folds.
- Author
-
Bordin N, Sillitoe I, Lees JG, and Orengo C
- Abstract
This article is dedicated to the memory of Cyrus Chothia, who was a leading light in the world of protein structure evolution. His elegant analyses of protein families and their mechanisms of structural and functional evolution provided important evolutionary and biological insights and firmly established the value of structural perspectives. He was a mentor and supervisor to many other leading scientists who continued his quest to characterise structure and function space. He was also a generous and supportive colleague to those applying different approaches. In this article we review some of his accomplishments and the history of protein structure classifications, particularly SCOP and CATH. We also highlight some of the evolutionary insights these two classifications have brought. Finally, we discuss how the expansion and integration of protein sequence data into these structural families helps reveal the dark matter of function space and can inform the emergence of novel functions in Metazoa. Since we cover 25 years of structural classification, it has not been feasible to review all structure based evolutionary studies and hence we focus mainly on those undertaken by the SCOP and CATH groups and their collaborators., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Bordin, Sillitoe, Lees and Orengo.)
- Published
- 2021
- Full Text
- View/download PDF
40. CATH: increased structural coverage of functional space.
- Author
-
Sillitoe I, Bordin N, Dawson N, Waman VP, Ashford P, Scholes HM, Pang CSM, Woodridge L, Rauer C, Sen N, Abbasian M, Le Cornu S, Lam SD, Berka K, Varekova IH, Svobodova R, Lees J, and Orengo CA
- Subjects
- Amino Acid Sequence, COVID-19 epidemiology, COVID-19 prevention & control, COVID-19 virology, Computational Biology methods, Epidemics, Humans, Internet, Molecular Sequence Annotation, Proteins genetics, Proteins metabolism, SARS-CoV-2 genetics, SARS-CoV-2 metabolism, SARS-CoV-2 physiology, Sequence Analysis, Protein methods, Sequence Homology, Amino Acid, Viral Proteins chemistry, Viral Proteins genetics, Viral Proteins metabolism, Computational Biology statistics & numerical data, Databases, Protein statistics & numerical data, Protein Domains, Proteins chemistry
- Abstract
CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt., (© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2021
- Full Text
- View/download PDF
41. Pex24 and Pex32 are required to tether peroxisomes to the ER for organelle biogenesis, positioning and segregation in yeast.
- Author
-
Wu F, de Boer R, Krikken AM, Akşit A, Bordin N, Devos DP, and van der Klei IJ
- Subjects
- Endoplasmic Reticulum genetics, Membrane Proteins genetics, Organelle Biogenesis, Peroxins genetics, Saccharomyces cerevisiae genetics, Saccharomycetales, Peroxisomes, Saccharomyces cerevisiae Proteins genetics
- Abstract
The yeast Hansenula polymorpha contains four members of the Pex23 family of peroxins, which characteristically contain a DysF domain. Here we show that all four H. polymorpha Pex23 family proteins localize to the endoplasmic reticulum (ER). Pex24 and Pex32, but not Pex23 and Pex29, predominantly accumulate at peroxisome-ER contacts. Upon deletion of PEX24 or PEX32 - and to a much lesser extent, of PEX23 or PEX29 - peroxisome-ER contacts are lost, concomitant with defects in peroxisomal matrix protein import, membrane growth, and organelle proliferation, positioning and segregation. These defects are suppressed by the introduction of an artificial peroxisome-ER tether, indicating that Pex24 and Pex32 contribute to tethering of peroxisomes to the ER. Accumulation of Pex32 at these contact sites is lost in cells lacking the peroxisomal membrane protein Pex11, in conjunction with disruption of the contacts. This indicates that Pex11 contributes to Pex32-dependent peroxisome-ER contact formation. The absence of Pex32 has no major effect on pre-peroxisomal vesicles that occur in pex3 atg1 deletion cells., Competing Interests: Competing interestsThe authors declare no competing or financial interests., (© 2020. Published by The Company of Biologists Ltd.)
- Published
- 2020
- Full Text
- View/download PDF
42. Hansenula polymorpha Pex37 is a peroxisomal membrane protein required for organelle fission and segregation.
- Author
-
Singh R, Manivannan S, Krikken AM, de Boer R, Bordin N, Devos DP, and van der Klei IJ
- Subjects
- Fungal Proteins chemistry, Membrane Proteins chemistry, Organelles chemistry, Peroxisomes chemistry, Saccharomycetales cytology, Saccharomycetales metabolism, Fungal Proteins metabolism, Membrane Proteins metabolism, Organelles metabolism, Peroxisomes metabolism, Saccharomycetales chemistry
- Abstract
Here, we describe a novel peroxin, Pex37, in the yeast Hansenula polymorpha. H. polymorpha Pex37 is a peroxisomal membrane protein, which belongs to a protein family that includes, among others, the Neurospora crassa Woronin body protein Wsc, the human peroxisomal membrane protein PXMP2, the Saccharomyces cerevisiae mitochondrial inner membrane protein Sym1, and its mammalian homologue MPV17. We show that deletion of H. polymorpha PEX37 does not appear to have a significant effect on peroxisome biogenesis or proliferation in cells grown at peroxisome-inducing growth conditions (methanol). However, the absence of Pex37 results in a reduction in peroxisome numbers and a defect in peroxisome segregation in cells grown at peroxisome-repressing conditions (glucose). Conversely, overproduction of Pex37 in glucose-grown cells results in an increase in peroxisome numbers in conjunction with a decrease in their size. The increase in numbers in PEX37-overexpressing cells depends on the dynamin-related protein Dnm1. Together our data suggest that Pex37 is involved in peroxisome fission in glucose-grown cells. Introduction of human PXMP2 in H. polymorpha pex37 cells partially restored the peroxisomal phenotype, indicating that PXMP2 represents a functional homologue of Pex37. H.polymorpha pex37 cells did not show aberrant growth on any of the tested carbon and nitrogen sources that are metabolized by peroxisomal enzymes, suggesting that Pex37 may not fulfill an essential function in transport of these substrates or compounds required for their metabolism across the peroxisomal membrane., (© 2019 The Authors. The FEBS Journal published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.)
- Published
- 2020
- Full Text
- View/download PDF
43. ICBdocker: a Docker image for proteome annotation and visualization.
- Author
-
Bordin N and Devos DP
- Subjects
- Computational Biology, Proteomics, Proteome, Software
- Abstract
Summary: We introduce ICBdocker, a Docker environment that allows the annotation of functional and structural features of proteomes through a Python/Perl pipeline. DataTables pages make it easy to set up a web-resource for research groups with a focus on the same organisms or datasets. The results are available as tab-separated values files and HTML, allowing data analysis and browsing. The pipeline focuses on modularity and scalability, with capability of integrating with multi-processing and high-performance computing clusters., Availability and Implementation: ICBdocker is freely available on DockerHub at https://hub.docker.com/r/bordin89/icb/ Source code and documentation are available on GitHub at: https://github.com/bordin89/ICB_docker.
- Published
- 2018
- Full Text
- View/download PDF
44. Biocontrol traits of Bacillus licheniformis GL174, a culturable endophyte of Vitis vinifera cv. Glera.
- Author
-
Nigris S, Baldan E, Tondello A, Zanella F, Vitulo N, Favaro G, Guidolin V, Bordin N, Telatin A, Barizza E, Marcato S, Zottini M, Squartini A, Valle G, and Baldan B
- Subjects
- Bacillus licheniformis genetics, Biodiversity, Endophytes genetics, Endophytes physiology, Genome, Bacterial, Phylogeny, Plant Diseases microbiology, Plant Leaves microbiology, Plant Roots microbiology, Sequence Analysis, DNA, Whole Genome Sequencing, Bacillus licheniformis physiology, Biological Control Agents, Vitis microbiology
- Abstract
Background: Bacillus licheniformis GL174 is a culturable endophytic strain isolated from Vitis vinifera cultivar Glera, the grapevine mainly cultivated for the Prosecco wine production. This strain was previously demonstrated to possess some specific plant growth promoting traits but its endophytic attitude and its role in biocontrol was only partially explored. In this study, the potential biocontrol action of the strain was investigated in vitro and in vivo and, by genome sequence analyses, putative functions involved in biocontrol and plant-bacteria interaction were assessed., Results: Firstly, to confirm the endophytic behavior of the strain, its ability to colonize grapevine tissues was demonstrated and its biocontrol properties were analyzed. Antagonism test results showed that the strain could reduce and inhibit the mycelium growth of diverse plant pathogens in vitro and in vivo. The strain was demonstrated to produce different molecules of the lipopeptide class; moreover, its genome was sequenced, and analysis of the sequences revealed the presence of many protein-coding genes involved in the biocontrol process, such as transporters, plant-cell lytic enzymes, siderophores and other secondary metabolites., Conclusions: This step-by-step analysis shows that Bacillus licheniformis GL174 may be a good biocontrol agent candidate, and describes some distinguished traits and possible key elements involved in this process. The use of this strain could potentially help grapevine plants to cope with pathogen attacks and reduce the amount of chemicals used in the vineyard.
- Published
- 2018
- Full Text
- View/download PDF
45. Planctomycetes attached to algal surfaces: Insight into their genomes.
- Author
-
Faria M, Bordin N, Kizina J, Harder J, Devos D, and Lage OM
- Subjects
- Bacterial Outer Membrane Proteins genetics, Biofilms, Chlorophyta microbiology, Lipopolysaccharides biosynthesis, Lipopolysaccharides genetics, Phaeophyceae microbiology, Planctomycetales pathogenicity, Planctomycetales physiology, Proteoglycans genetics, Genome, Bacterial, Planctomycetales genetics
- Abstract
Planctomycetes are bacteria with complex molecular and cellular biology. They have large genomes, some over 7Mb, and complex life cycles that include motile cells and sessile cells. Some live on the complex biofilm of macroalgae. Factors governing their life in this environment were investigated at the genomic level. We analyzed the genomes of three planctomycetes isolated from algal surfaces. The genomes were 6.6Mbp to 8.1Mbp large. Genes for outer-membrane proteins, peptidoglycan and lipopolysaccharide biosynthesis were present. Rubripirellula obstinata LF1
T , Roseimaritima ulvae UC8T and Mariniblastus fucicola FC18T shared with Rhodopirellula baltica and R. rubra SWK7 unique proteins related to metal binding systems, phosphate metabolism, chemotaxis, and stress response. These functions may contribute to their ecological success in such a complex environment. Exceptionally huge proteins (6000 to 10,000 amino-acids) with extracellular, periplasmic or membrane-associated locations were found which may be involved in biofilm formation or cell adhesion., (Copyright © 2017 Elsevier Inc. All rights reserved.)- Published
- 2018
- Full Text
- View/download PDF
46. PVCbase: an integrated web resource for the PVC bacterial proteomes.
- Author
-
Bordin N, González-Sánchez JC, and Devos DP
- Subjects
- Bacterial Proteins metabolism, Chlamydiaceae metabolism, Planctomycetales metabolism, Proteome metabolism, Verrucomicrobia metabolism, Web Browser, Bacterial Proteins genetics, Chlamydiaceae genetics, Databases, Protein, Internet, Planctomycetales genetics, Proteome genetics, Verrucomicrobia genetics
- Abstract
Interest in the Planctomycetes-Verrucomicrobia-Chlamydiae (PVC) bacterial superphylum is growing within the microbiology community. These organisms do not have a specialized web resource that gathers in silico predictions in an integrated fashion. Hence, we are providing the PVC community with PVCbase, a specialized web resource that gathers in silico predictions in an integrated fashion. PVCbase integrates protein function annotations obtained through sequence analysis and tertiary structure prediction for 39 representative PVC proteomes (PVCdb), a protein feature visualizer (Foundation) and a custom BLAST webserver (PVCBlast) that allows to retrieve the annotation of a hit directly from the DataTables. We display results from various predictors, encompassing most functional aspects, allowing users to have a more comprehensive overview of protein identities. Additionally, we illustrate how the application of PVCdb can be used to address biological questions from raw data. PVCbase is freely accessible at: www.pvcbacteria.org/pvcbase.
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.