Position: Data Manager / Data Steward
Department: Computational Genomics and System Genetics
Code number: 2020-0170
The German Cancer Research Center is the largest biomedical research institution in Germany. With approximately 3,000 employees, we operate an extensive scientific program in the field of cancer research.
A position as data manager / data steward for the recently funded infrastructure initiative German Human Genome Phenome Archive (GHGA, http://www.ghga.de) is available at the German Cancer Research Center (DKFZ).
Briefly, GHGA has been funded as one of currently 9 national infrastructure consortia which will be operated within the framework of the national research data infrastructure (NFDI, https://www.nfdi.de/). The goal of GHGA is to establish a national infrastructure for the archival, analysis and the dissemination of human omics data. GHGA is implemented as a federated infrastructure, which currently connects six data centers across Germany to the federated network of the European Genome Phenome Archive (EGA, https://www.ebi.ac.uk/ega). To enable long-term scalability and respect national data security regulations, the EGA is undergoing a transition to a federated model, with GHGA operating as an EGA node for Germany. GHGA has an ambitious long-term concept with an initial focus on providing technological infrastructure and services for rare disease genetics and oncology. With its 16 partners the GHGA network spans all major omics centers in Germany. At DKFZ, GHGA brings together research and service activities across key teams that develop and apply genomics technologies:
- Ivo Buchhalter (http://www.dkfz.de/en/forschung/zentrale_einrichtungen/cf-omics): data stewardship & first-level user support, Data hub implementation & operation
- Benedikt Brors: (http://www.dkfz.de/en/applied-bioinformatics): Training & user experience, Community data curation
- Stefan Fröhling (http://www.dkfz.de/en/translationale-medizinische-onkologie): Translation & cancer community
- Daniel Hübschmann (http://www.hi-stem.de/staff/74-daniel-huebschmann): Front end development & maintenance
- Martin Lablans (http://www.dkfz.de/en/verbis): FAIR Data & Interoperability
- Peter Lichter (http://www.dkfz.de/en/genetics): Translation & cancer community
- Oliver Stegle (http://www.dkfz.de/en/bioinformatik-genomik-systemgenetik): GHGA consortium spokesperson & back end development
We are looking for a system administrator, physicist, bioinformatician or similar background experience in data management and data privacy and security to manage the flow of GHGA data from ingest to download and implement and oversee standard operating procedures (SOPs). The successful candidate will be part of an interdisciplinary research and data management team developing and applying a diverse range of state-of-the-art methodology to securely manage human omics data. This will be done in close cooperation with the participating GHGA institutions and other (inter)national partners. We are looking for a range of profiles, including data and NGS specialists.
Your responsibilities (will be distributed in a small team):
- Development and implementation of novel concepts for automated processing of different omics data
- Development and implementation of data security concepts, including risk management
- Development and implementation of SOPs for data management of human omics data as port of the GHGA infrastructure
- Strategy and processes for data ingest, metadata validation and quality control of data
- Develop novel metadata concepts for emerging omics data types
- Research and software development of staging mechanisms for data download and cloud-based analysis
- Presenting and disseminating data management concepts within the consortium and at (inter)national conferences
- Establishment of data-driven quality control strategies
- Closely follow the field of secure data handling and sharing and interaction with international initiatives such as the Global Alliance for Genomics and Health (GA4GH)
The successful applicant will hold a master's degree or equivalent qualification in computer science, physics, mathematics, engineering, bioinformatics, computational biology, biological science or other fields, ideally with a PhD degree. Demonstrated experience and expertise in the development of data management / data curation concepts is expected.
Previous experience in working with (big) data or a service related field is expected. Expertise in NGS, (human) omics data, data security and data privacy is beneficial, as is communicating results and ideas to colleagues and (inter)national collaboration partners.
Proficiency with UNIX based systems and relevant programming languages such as R or Python is beneficial.
The ideal applicant should have demonstrated the ability to work independently and creatively. The candidate should have excellent communications skills and be able to articulate clearly the requirements, set clear goals and work within an interdisciplinary setting, communicating with other partners.
- Interesting, versatile workplace
- International, attractive working environment
- Campus with modern state-of-the-art infrastructure
- Salary according to TV-L including social benefits
- Possibility to work part-time
- Flexible working hours
- Comprehensive further training program
Earliest Possible Start Date: 01.10.2020
Duration: The position is limited to 2 years with the possibility of prolongation.
The position can in principle be part-time.
Application Deadline: 18.08.2020
Eva Sabine Blum
Phone +49 6221/42-3601
Please note that we do not accept applications submitted via email.