Biodiversity Informatics Group, PI Claus Weiland, Senckenberg Society for Nature Research

I am currently a scientific software engineer in the Biodiversity Informatics group of Claus Weiland working mainly on projects related to research data infrastructure and semantic modeling. In the context of to the NFDI4Earth, a consortium with the aim to improve FAIR access to research data in the earth system sciences (ESS), I am a member of the working group for the NFDI4Earth Community Label. The Label group works on developing and evaluating guidelines for ESS data repositories. Furthermore I am involved in the technical development of the NFDI4Earth Knowledge Hub, the backend for the NFDI4Earth Web Portal based on a knowledge graph managed in an Apache Jena database.



Fries Lab, Ernst Strüngmann Institute (ESI) for Neuroscience

I was a scientific software developer in the lab of Prof. Dr. Pascal Fries at ESI, working on the Syncopy Python package for electrophysiology data analysis (MEG, EEG, ECoG). Syncopy is easy-to-use, with an interface similar to Fieldtrip, the standard Matlab package for the analysis of electrophysiology data. Syncopy is designed for large-scale data analyses and parallel computations, and can utilize multi-core machines and high-performance computing (HPC) systems like clusters running the Slurm job scheduler.

Electrophysiology data in Syncopy

Figure 1: Visualization of electrophysiological data, created with Syncopy.



Ecker Lab, University Hospital Frankfurt

I was a postdoc in Computational Neuroimaging at University Hospital Frankfurt in the Department of Child and Adolescent Psychiatry, Psychosomatics and Psychotherapy in the group of Prof. Dr. Christine Ecker from 03/2018 - 04/2022. We used structural magnetic resonance imaging (sMRI) in combination with genetic data to better understand the mechanisms of autism spectrum disorders, and the complex interplay of neuroanatomy and brain function. We applied statistical and machine learning methods implemented in R, Python and Matlab to surface-based cortical reconstructions generated with FreeSurfer.

sMRI descriptor visualization

Figure 2: Visualization of sMRI neuroimaging data, created with my fsbrain package for R. A Visualization of raw morphometry data (cortical thickness) from native space on the white surface of a subject. The view shows the data in tiles from 8 different angles. B Arbitrary data (p-values in this case) visualized on the regions of the Desikan atlas, using the surface of the fsaverage (standard space template) subject from FreeSurfer. The view shows the data in tiles from 4 different angles. C The regions of the Desikan atlas on the white surface of a subject. The colors were loaded from the respective annotation file.



Molecular Health GmbH, Heidelberg

I worked as a scientific software developer at Molecular Health, a bioinformatics company that produces software solutions for precision medicine, with the goal to make the latest research available to clinicians based on AI and curated data from publications.

The main project was an internal web application for biomedical data curation based on Python, PostgreSQL and SQLAlchemy for the backend and React.js for the frontend. Another project was a high-throughput natural language processing system, which used Perl to orchestrate interdependent jobs in compute pipelines running on an HPC cluster.



Koch Lab, Goethe University Frankfurt am Main

I finished my doctorate in the lab of Prof. Dr. Ina Koch in 2016 and mainly worked on the following two projects in the fields of digital pathology and structural biology:


Digital pathology: The spatial distribution of immune cells in Hodgkin lymphoma

In this project, I worked on the analysis of Hodgkin lymphoma, a cancer of the lymphatic system, based on high-resolution images. We were interested in better understanding the way tumour cells interact with their environment, communicate and spread through the lymphatic system. We implemented a digital image analysis pipeline to perform cell detection, description and classification with the goal to aid pathologists during the diagnosis of the various sub types of lymphoma.

We used graphs to model and compare spatial cell distributions in different Hodgkin lymphoma subtypes as well as lymphadenitis. The project is a collaboration with Prof. Dr. Dr. h.c. Martin-Leo Hansmann at the Senckenberg Institute of Pathology at University Hospital Frankfurt.

Cellgraph

Figure 3: Part of a whole slide image from a Hodgkin lymphoma case. Cell nuclei are stained in blue, and CD30+ cells in red. A cell graph is displayed as an overlay. Each vertex represents a cell detected by our imaging pipeline. Edges are added between cells which are close to each other. The graphs can be used to quantify clustering and to compare cell distributions.


Structural biology: The new Protein Topology Graph Library (PTGL) webserver

My diploma thesis dealt with modeling protein structure topologies by graph-theoretical methods. A part of the thesis was the development of the Visualization of Protein Ligand Graphs (VPLG) software. VPLG computes and visualizes protein ligand graphs. It works on the super-secondary structure level and uses the atom coordinates from PDB files and the SSE assignments of the DSSP algorithm. The graphs can be saved to a database or exported in standard graph formats for further analysis, including protein structure comparison and evolutionary studies.

VPLG is free software and available from the PTGL project website at GitHub. It powers the PTGL protein topology database, a web server which also supports motif detection and other advanced queries based on the graphs computed for all proteins of the RCSB Protein Data Bank.

PTGL Protein graph creation

Figure 4: From 3D atom data to protein graph. The 3D atom coordinates and the secondary structure assignments are used to compute contacts between secondary structure elements (SSEs). In the final cell graph, each vertex represents an SSE, and edges model spatial contacts and relative spatial orientations between SSEs.



Education