Back to Search Start Over

Developing scalable software infrastructure for data storage and processing for computational biology problems

Developing scalable software infrastructure for data storage and processing for computational biology problems

Authors :
O. Borisenko
A. Laguta
D. Turdakov
S. Kuznetsov
Source :
Труды Института системного программирования РАН, Vol 26, Iss 4, Pp 45-54 (2018)
Publication Year :
2018
Publisher :
Ivannikov Institute for System Programming of the Russian Academy of Sciences, 2018.

Abstract

This article is an overview of scalable infrastructure for storage and processing of genome data in genetics problems. The overview covers used technologies descriptions, the organization of unified access to genome processing API of different underlying services. The article also covers methods for scalable and cloud computing technologies support. The first service in virtual genome processing laboratory is provided and presented. The service solves transcription factors bindning sites prediction problem. The main principles of service construction are provided. Basic requirements for underlying comptutaion software in virtual laboratory environments are provided. Overview describes the implemented web-service (https://api.ispras.ru/demo/gen) for transcription factors binding site prediction. Provided solution is based on ISPRAS API project as an API gateway and load-balancer; the middle-ware task-manager software for pool of workers support and for communications with Openstack infrastructure; OpenZFS as an intermediate storage with transparent compression support. The described solution is easy to extend with new services fitting the basic requirements.

Details

Language :
English, Russian
ISSN :
20798156 and 22206426
Volume :
26
Issue :
4
Database :
Directory of Open Access Journals
Journal :
Труды Института системного программирования РАН
Publication Type :
Academic Journal
Accession number :
edsdoj.577454c07f694759920b6831f428d3b2
Document Type :
article
Full Text :
https://doi.org/10.15514/ISPRAS-2014-26(4)-4