This work package will provide the expertise and the computing infrastructure required to support all aspects of the ens@t-ht project. This includes creation of the ens@t-ht virtual research environment and linkage with the ENS@T-CANCER patient data registries. This work package will also ensure multi-dimensional bioinformatics modelling of omics data for the identification of omics-derived biomarkers. The specific objectives of this WP are:
- Definition of the ENSAT-HT schema utilizing structures and information from the ENS@T-CANCER systems.
- Establishment and population of the ENSAT-HT registry including patients recruited from the ENS@T systems (and the on-going linkage of the two).
- Develop the governance and data sharing protocols and security processes to support the other WPs.
- Load, link, and transform the omics data from the ENSAT-HT study cohort from WP2 and WP3 into the HIC system and provide security-oriented linkages with the ENSAT-HT phenotypic database hosted at MEG.
- Support for the clinical trials (eCRFs) and databases in WP4 and their linkages with the ENS@T and ENSAT-HT registries;
- Create user interfaces to allow data annotation from both local and standard ‘omics’ reference sources.
- Configure the infrastructure and hardware to host the ENSAT-HT study ‘omics’ data sets within a Safe Haven environment.
- Systematic review of literature to understand the current state of knowledge about machine learning algorithms on ‘omics’ datasets
- Develop bioinformatics tools and modelling for data analysis including support for advanced machine learning-driven data analytics;
- Apply data-driven bioinformatics modelling approaches to biomarker identification and discovery
- Supply molecular signatures for validation by WP4.