Back to Search
Start Over
Data science strategies leading to the development of data scientists’ skills in organizations
- Source :
- Repositório Científico de Acesso Aberto de Portugal, Repositório Científico de Acesso Aberto de Portugal (RCAAP), instacron:RCAAP
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- The purpose of this paper is to compare the strategies of companies with data science practices and methodologies and the data specificities/variables that can influence the definition of a data science strategy in pharma companies. The current paper is an empirical study, and the research approach consists of verifying against a set of statistical tests the differences between companies with a data science strategy and companies without a data science strategy. We have designed a specific questionnaire and applied it to a sample of 280 pharma companies. The main findings are based on the analysis of these variables: overwhelming volume, managing unstructured data, data quality, availability of data, access rights to data, data ownership issues, cost of data, lack of pre-processing facilities, lack of technology, shortage of talent/skills, privacy concerns and regulatory risks, security, and difficulties of data portability regarding companies with a data science strategy and companies without a data science strategy. The paper offers an in-depth comparative analysis between companies with or without a data science strategy, and the key limitation is regarding the literature review as a consequence of the novelty of the theme; there is a lack of scientific studies regarding this specific aspect of data science. In terms of the practical business implications, an organization with a data science strategy will have better direction and management practices as the decision-making process is based on accurate and valuable data, but it needs data scientists skills to fulfil those goals. info:eu-repo/semantics/acceptedVersion
- Subjects :
- 0209 industrial biotechnology
Pharma
Process (engineering)
Skills
Novelty
Ciências Naturais::Ciências da Computação e da Informação [Domínio/Área Científica]
Sample (statistics)
Unstructured data
02 engineering and technology
Data science
Health sector
Set (abstract data type)
Big data
020901 industrial engineering & automation
Empirical research
Artificial Intelligence
Data quality
Data technostructure
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Business
Data management structure
Software
Statistical hypothesis testing
Subjects
Details
- ISSN :
- 14333058 and 09410643
- Volume :
- 33
- Database :
- OpenAIRE
- Journal :
- Neural Computing and Applications
- Accession number :
- edsair.doi.dedup.....8feebc996391d2dfd7273f5785f48c1c
- Full Text :
- https://doi.org/10.1007/s00521-021-06095-3