Maria Luisa Chiusano, Hamed Bostan, Marco Miralto, Nicola Vitulo, Giorgio Valle, Mario Pezzotti, Mohamed Zouine, Amalia Barone, Mondher Bouzayen, Luca Ambrosino, Luigi Frusciante, Valentino Ruggieri, Ambrosino, Luca, Ruggieri, Valentino, Bostan, Hamed, Miralto, Marco, Vitulo, Nicola, Zouine, Mohamed, Barone, Amalia, Bouzayen, Mondher, Frusciante, Luigi, Pezzotti, Mario, Valle, Giorgio, Chiusano, Maria Luisa, Department of Agriculture, Università degli studi di Napoli Federico II, Stazione Zoologica Anton Dohrn (SZN), Centre de Recerca en Agrigenòmica - Center for Research in Agricultural Genomics, Partenaires INRAE, North Carolina State University, Center for High Performance Simulation and Department of Chemical and Biomolecular Engineering, University of Verona (UNIVR), Génomique et Biotechnologie des Fruits (GBF), Institut National de la Recherche Agronomique (INRA)-École nationale supérieure agronomique de Toulouse [ENSAT]-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées, Universita di Padova, Genopom Pro (PON02_00395_3082360) and HORT (PON02_00395_3215002) Projects (Ministero dell’Istruzione, dell’Università e della Ricerca (MIUR), Italy), The Cost Action FA1106, European Project: 289220,EC:FP7:PEOPLE,FP7-PEOPLE-2011-ITN,SPOT-ITN(2012), Producció Vegetal, Genòmica i Biotecnologia, Ministero dell'Istruzione, dell'Università e della Ricerca, and European Commission
[Background]: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework., [Results]: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at http://biosrv.cab.unina.it/comparalogs/gene/search., [Conclusions]: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species., Publication costs for this manuscript were sponsored by the Genopom Pro (PON02_00395_3082360) and HORT (PON02_00395_3215002) Projects (Ministero dell’Istruzione, dell’Università e della Ricerca (MIUR), Italy) and from the Solanaceae Pollen Thermotolerance - Marie Curie Initial Training Network project (Grant Agreement No. 289220). This work is the frame of the Cost Action FA1106.