1.9.2023Combining data from different sources for personalised treatment
15.8.2023Better treatments for leukaemia
10.6.2023MicroRNAs may reveal type 1 diabetes
16.5.2023Single-cell RNA sequencing enabling individual disease treatment
12.4.2023Tissue samples analysed with Sensitive Data (SD) services provide new information on celiac disease and other autoimmune diseases
20.3.2023DNA isolated from Baltic Sea sediment shedding light on climate change and biodiversity
27.2.2023Organoids grown from stem cells boost cancer research
19.12.2022Sensitive Data (SD) services for Research: with a few clicks a researcher can launch a personal secure computing environment
30.11.2022Microbiota in permafrost play an important role in climate change
20.10.2022Reusable, accurately described and high-quality data – tools created by the research community for agile data management
29.9.2022Gene sequencing used for study of structure and functioning of microbial communities in oceans
1.9.2022Antibiotic-resistant bacteria are a global problem
23.8.2022Personalised medicine against cancer and viruses
30.6.2022Studying the human microbiome is a key towards holistic understanding of our health
23.5.2022FINRISK: one of the world’s longest-running population survey time series
8.4.2022Combining biobank data with data from health registers enables research towards personalised treatment
3.3.2022Finnish research team sequences the genomes of thousands of individuals with diabetes to look for genetic risk factors
10.2.2022BIGPICTURE helps pathology go digital
30.12.2021Sensitive data infrastructure
23.11.2021In the future, an algorithm may diagnose glaucoma from fundus photos
26.10.2021Patient data creating better artificial intelligence models
15.9.2021Teaching an algorithm to identify cancer from sequence data
3.12.2020Efficient processing and sharing of data improving disease diagnosis and treatment
10.11.2020Bioinformatics to revolutionise healthcare: Efficient data processing speeds up diagnoses and enables personalised drug treatments
27.8.2020Tissue samples into digital images, interpreted by artificial intelligence
9.6.2020Digital pathology speeds up diagnosis
18.5.2020Searching markers for breast cancer by machine learning
8.4.2020Metabolomics measures and analyses metabolic changes caused by illness, diet or medication
1.3.2020Deep learning algorithms help in breast cancer screening
13.2.2020All breast cancer risk factors evaluated with AI
6.2.2020A dog can smell diseases
2.12.2019ELIXIR Compute Platform for life and health sciences
18.11.2019New bioinformatics methods and measurement technologies call for continuously updated courses and analysis software
30.10.2019No need to turn up personally: SisuID improves electronic authentication
30.9.2019Risk assessment of cardiovascular diseases for all citizens
20.8.2019Federated user ID management: a single identity giving access to numerous bioinformatics services
4.9.2019Targeted treatment for venous diseases with vascular system modelling
4.7.2019Research on rare genetic disorders can be utilised in understanding the mechanisms behind even more common diseases
3.6.2019VEIL.AI: patient data in a veil
20.5.2019Biocenter Oulu: technology services for biomedical research
23.4.2019Mouse models provide insights into the causal mechanisms of diseases
4.3.2019Euro-BioImaging: imaging infrastructure
26.2.2019Imaging helps to highlight significance of data
14.1.2019Data harmony and standards: data must be processed, described and stored by uniform means
10.12.2018Hundreds of genes could lie behind a single disease
5.11.2018Help from the Finnish genome for the prevention of cardiovascular diseases
8.10.2018Disease prediction models are becoming more accurate thanks to the computational methods
11.9.2018Genetic data under control and in the desired format
23.8.2018Massive data management project: Finns’ heredity is collected and safeguarded
14.6.2018Half of all drug ingredients affect only three protein families
12.6.2018Looking for a good drug
29.5.2018Quick DNA analysis of patient samples with artificial intelligence
7.5.2018Secrets of the intestines
4.4.2018Algorithm determines the appropriate drug
19.3.2018Bank of million patient samples
20.2.2018Mapping the genomes of all organisms enables the development of new vaccines and medicines
7.2.2018Ordered and secured
2.11.2017Striving for a national service to utilise genomic data in health care
11.8.2017Better harvests on the horizon? Data will also be harvested
19.6.2017Microbes and climate change
21.5.2017Storing the whole genome of the Finnish population? The data will benefit disease research
6.4.2017”Smart life insurances” offered: human biological data is only useful when interpreted correctly
15.1.2016New drug molecules through determining the structure of proteins
26.10.2015BBMRI.fi: an IT Infrastructure for shared biobanks
24.9.2015Fighting cancer with mathematics
10.8.2015Saimaa ringed seal aids the study of population genomes
1.8.2015Webmicroscope stores tissue samples in the cloud
15.7.2015Pups and Pooches Behind Genetic Discoveries in Human Diseases Canine Genetic Research Benefits from ELIXIR Databases
5.6.2015Life sciences in European cloud
The Bioinformatics Center at the University of Eastern Finland (UEF), led by Virpi Ahola, is developing new applications for analysing biomedical and multimodal data. These can be used to study cancers, metabolics, cardiovascular and neurodegenerative diseases.
Ahola has had a long career in bioinformatics. She was part of professor Ilkka Hanski’s metapopulation biology research group, which sequenced the whole genome of the Glanville fritillary butterfly that was the first reference genome solved in Finland. At the Karolinska Institute in Hong Kong, Ahola analysed gene function in different diseases at the single-cell level, and thereby studied how stem cells can be used to develop new drugs and treatments. She now heads the UEF Bioinformatics Center.
The Bioinformatics Center integrates different types of omics data (genomics, proteomics, transcriptomics) with clinical data and, in the future, possibly also imaging data.
“In addition to the usual omics analyses, we carry out multimodal data analysis for different research groups. This entails combining the analysis of different types of data in order to provide more information than if they are analysed separately.”
Analysis of the multimodal data varies depending on whether the data originates from different patients.
Omics is a research method that aims to analyse all genetically determined variables of a research subject simultaneously. Genomics analyses genetic variation and the function of genes, proteomics focuses on proteins, and epigenetics on the regulation of gene function and the storage of hereditable information without changes in the DNA sequence. Metabolomics, for its part, analyses changes in metabolism caused by disease, diet or medication.
“We are developing bioinformatics services in collaboration with biomedical experts. One focus at the University of Eastern Finland is on understanding the molecular basis of key chronic diseases and improving their prevention and treatment,” Ahola says.
Translational medicine uses basic research in clinical trials, but also patient samples and disease models to identify disease mechanisms and drug targets. The research approach is interdisciplinary, which provides a good starting point for research but also improves treatments for patients.
“What is delaying the era of translational medicine is that we simply don’t know enough. The idea behind combining several different data sources is to obtain more information. The integration is very much computational, and requires CSC – IT Center for Science’s resources and infrastructures like ELIXIR.”
One example Ahola gives is single-cell technologies.
In transcription, the genetic code in DNA is copied into RNA. This is the first phase of protein synthesis. Transcriptomics provides precise information about the gene expression in an individual cell at a given moment.
“The use of single-cell transcriptomics is still expensive. The principles of open science exist, and therefore all data must be shared when it is published. This allows data to be reused, and different data sources to be combined.”
However, the challenge is that data is produced using different technologies.
“Different data sources may have different numbers of cells or different cell types. Which methods should be used to combine the different data? If this could be solved, we could analyse more effectively cell development and specialisation.”
Ahola’s aim is to provide more assistance in the use of computational methods.
The University of Eastern Finland’s Bioinformatics Center provides researchers computing capacity and helps researchers in data pre-processing and analysis, and also assists in the use and installations of different computational methods and software.
“If there are no bioinformaticians in the same team or collaborative teams, researchers are expected to be proficient in computational methods and processing big data.”
Ahola admits that the requirements are tough, for example for postgraduate students. Fortunately, the University of Eastern Finland has taken up this challenge by providing a Computational Biomedicine as an orientation option.
“One example of the reuse of data is Finnish biobanks, which contain the genomes of over half a million Finns. It’s not a simple matter to analyse the biobank data, because the amount of data is insane.”
Ahola is referring to the FinnGen research project, which was launched in autumn 2017. Its main goal is to increase understanding of the causes of diseases and promote their diagnosis, prevention and the development of treatment methods. FinnGen uses samples collected by all Finnish biobanks. By June 2023, more than 553,000 samples were collected for the FinnGen survey. The first phase of the research project lasted six years. There are only a few research projects of this scale in the world.
The research projects can combine genomic data with data from national health registries. Indeed, Finland has exceptionally good resources to carry out genetic research covering the entire population.
Clinical data from longitudinal studies combined with genetic data offers many opportunities. But there must be a lot of data.
“Data collections are needed because no single researcher can collect data from 10,000 or 100,000 individuals. If the dataset is smaller, it may not provide reliable information for studying genetically complex diseases.”
There are many research projects using different data sources underway at the University of Eastern Finland. A project on Alzheimer’s disease at the University of Eastern Finland and Kuopio University Hospital will combine clinical data collected from patient visits with FinnGen data. In this way, researchers are aiming to understand the biological mechanisms leading to the onset of Alzheimer’s disease.
“FinnGen’s biobank is a unique resource that could be used much more in research,” Ahola says.
“Another example of research on Alzheimer’s disease is a project with Rappta Therapeutics and UEF professors Mikko Hiltunen and Annakaisa Haapasalo. This project uses transgenic cell lines to study the effect of different Alzheimer’s treatments on protein function.”
One interesting collaboration project is underway with Academy of Finland researcher Kirsi Ketola.
“The study investigates carboplatin treatment resistance mechanisms in prostate cancer. Carboplatin produces DNA cross-links, which lead to activation of a mechanism that repairs DNA and causes resistance, allowing cancer cells to divide again. The research uses single-cell techniques to measure both gene expression and chromatin changes at the single-cell level.”
Chromosomes are located in the nucleus, in the form of long chromatin strands.
According to Ahola, careful data integration and analysis could promote development of personalised treatment.
Ahola is a tireless advocate for the openness and reuse of data, and for the development of methods and infrastructures that facilitate and encourage this. She cites the European Genome-phenome Archive (EGA) as an example. This is a data archive that makes it possible to share and, with permission, access biomedical data that has already been published.
“The archive contains human genomic data, combined with clinical and other metadata. Since in principle it may be possible to identify a person by genome and phenotype, data sharing is strictly regulated.”
According to Ahola, the EGA allows for data sharing in the appropriate way. This makes it possible to reuse valuable biomedical research data, for instance for creating or testing new research hypotheses.
“Existing data can be approached from a different perspective – for example, patients can be selected using different criteria than in an already published study, or data can be used as part of a larger data set.”
Referring to Biocenter Finland, Ahola says that more should be done together. The centre brings together seven biocentres from different Finnish universities. It should not be impossible to increase collaboration between different biocentres and internationally, for example through the Finnish ELIXIR node CSC.
“ELIXIR is an avenue for us to network and learn from the experiences of other bioinformatics core facilities, and to be part of discussions where research infrastructure issues are brought up and new initiatives are taken.”
Because new technologies produce large and complex data sets, research infrastructures should also include data science experts, not research equipment alone.
“To make effective use of data, the computing capacity offered by CSC, for example, is not enough. Data processing and reuse also requires staff with expertise in the field. As I see it, better resourcing and systematic collaboration between biocentres could substantially facilitate and improve the processing, integration and reuse of large omics data sets.”
Ari Turunen
1.9.2023
Read article in PDF