Data Engineer

EMBL (European Molecular Biology Lab)

Cambridge, UK
Grade 5 monthly salary starting at £3,303 pm after...
Hybrid (2 days onsite, 3 days remote)
Expert in data modelling and advanced sql
Proficiency in python programming
Strong understanding of relational databases
The European Molecular Biology Laboratory (EMBL) is seeking a Data Engineer to optimize and enhance data pipelines for their macromolecular structure databases. The role requires strong technical expertise in data modeling, SQL, and data processing tools, while offering a collaborative and innovative work environment with generous benefits

Job Summary

  • The role involves optimizing data pipelines for essential resources like the Protein Data Bank (PDB) and AlphaFold Protein Structure Database (AFDB).
  • Candidates must have proven experience migrating databases from Oracle to PostgreSQL, including handling compatibility layers and stored procedure conversion.
  • EMBL-EBI offers a hybrid working model, generous benefits including 30 days annual leave, and a relocation package for international applicants.

Matching Summary

Match Score: 85

The European Molecular Biology Laboratory (EMBL) is seeking a Data Engineer to optimize and enhance data pipelines for their macromolecular structure databases. The role requires strong technical expertise in data modeling, SQL, and data processing tools, while offering a collaborative and innovative work environment with generous benefits.

Salary

Grade 5 monthly salary starting at £3,303 per month after tax; Pension and insurance contributions excluded; Generous benefits including allowances and private medical insurance

Skills & Requirements

Must-have

  • Expert in Data Modelling and Advanced SQL
  • Proficiency in Python programming
  • Strong understanding of relational databases
  • Hands-on experience with ETL processes
  • Oracle to PostgreSQL migration expertise

Nice-to-have

  • Experience with big data technologies like Apache Spark
  • Familiarity with CI/CD tools such as GitLab CI
  • Knowledge of Google Cloud Platform or AWS
  • Familiarity with Neo4J or graph databases
  • Affinity with structural biology and bioinformatics

Key Requirements

  • MSc in computer science, IT, or related field
  • Demonstrated IT expertise in bioinformatics
  • Deep knowledge of PostgreSQL architecture and tuning
  • Extensive experience with Oracle databases and PL/SQL
  • Proven track record in database migration projects

Work Rights

Not specified

Sponsorship: available

Tailored Resume

Cover Letter