Senior Data Engineer with Data Science/MLOps Background

1 Month ago • 5 Years + • Data Analyst

Job Summary

Job Description

This Senior Data Engineer role requires 5+ years of experience in data engineering, preferably in pharmaceuticals or life sciences. Responsibilities include designing, developing, and maintaining data pipelines in Palantir Foundry, collaborating with data scientists on model deployment, optimizing data workflows, implementing ML Ops practices (model versioning, monitoring), and troubleshooting pipeline issues. The ideal candidate will have strong Python, PySpark, and cloud technologies (AWS, Azure, GCP) experience, expertise in data modeling and warehousing, and familiarity with containerization and ML frameworks. The role involves working with big data technologies (Hadoop, Spark, Kafka, BigQuery), database systems (PostgreSQL, MySQL, NoSQL), and implementing data governance and security best practices.
Must have:
  • 5+ years data engineering experience
  • Proficiency in Python & PySpark
  • Cloud technologies expertise
  • Data modeling & warehousing
  • ETL/ELT processes
  • ML Ops familiarity
  • Data governance & security
Good to have:
  • Cloud certifications
  • Apache Lucene
  • Veeva CRM, Reltio, SAP experience
  • Palantir Foundry experience
  • Pharmaceutical regulations knowledge
  • JavaScript and TypeScript
Perks:
  • Flexible working format
  • Competitive salary
  • Personalized career growth
  • Professional development tools
  • Education reimbursement
  • Corporate events

Job Details

Senior Data Engineer

We are seeking a proactive Senior Data Engineer to join our vibrant team. As a Senior Data Engineer, you will play a critical role in designing, developing, and maintaining sophisticated data pipelines, Ontology Objects, and Foundry Functions within Palantir Foundry. Your background in machine learning and data science will be valuable in optimizing data workflows, enabling efficient model deployment, and supporting AI-driven initiatives.  The ideal candidate will possess a robust background in cloud technologies, data architecture, and a passion for solving complex data challenges.

 

Key Responsibilities:

  • Collaborate with cross-functional teams to understand data requirements, and design, implement and maintain scalable data pipelines in Palantir Foundry, ensuring end-to-end data integrity and optimizing workflows.
  • Gather and translate data requirements into robust and efficient solutions, leveraging your expertise in cloud-based data engineering. Create data models, schemas, and flow diagrams to guide development.
  • Develop, implement, optimize and maintain efficient and reliable data pipelines and ETL/ELT processes to collect, process, and integrate data to ensure timely and accurate data delivery to various business applications, while implementing data governance and security best practices to safeguard sensitive information.
  • Monitor data pipeline performance, identify bottlenecks, and implement improvements to optimize data processing speed and reduce latency. 
  • Collaborate with Data Scientists to facilitate model deployment and integration into production environments.
  • Support the implementation of basic ML Ops practices, such as model versioning and monitoring.
  • Assist in optimizing data pipelines to improve machine learning workflows.
  • Troubleshoot and resolve issues related to data pipelines, ensuring continuous data availability and reliability to support data-driven decision-making processes.
  • Stay current with emerging technologies and industry trends, incorporating innovative solutions into data engineering practices, and effectively document and communicate technical solutions and processes.

Tools and skills you will use in this role:

  • Palantir Foundry
  • Python
  • PySpark
  • SQL
  • TypeScript

Required:

  • 5+ years of experience in data engineering, preferably within the pharmaceutical or life sciences industry;
  • Strong proficiency in Python and PySpark;
  • Proficiency with big data technologies (e.g., Apache Hadoop, Spark, Kafka, BigQuery, etc.);
  • Hands-on experience with cloud services (e.g., AWS Glue, Azure Data Factory, Google Cloud Dataflow);
  • Expertise in data modeling, data warehousing, and ETL/ELT concepts;
  • Hands-on experience with database systems (e.g., PostgreSQL, MySQL, NoSQL, etc.);
  • Proficiency in containerization technologies (e.g., Docker, Kubernetes);
  • Familiarity with ML Ops concepts, including model deployment and monitoring.
  • Basic understanding of machine learning frameworks such as TensorFlow or PyTorch.
  • Exposure to cloud-based AI/ML services (e.g., AWS SageMaker, Azure ML, Google Vertex AI).
  • Experience working with feature engineering and data preparation for machine learning models.
  • Effective problem-solving and analytical skills, coupled with excellent communication and collaboration abilities;
  • Strong communication and teamwork abilities;
  • Understanding of data security and privacy best practices;
  • Strong mathematical, statistical, and algorithmic skills.

Nice to have:

  • Certification in Cloud platforms, or related areas;
  • Experience with search engine Apache Lucene, Webservice Rest API;
  • Familiarity with Veeva CRM, Reltio, SAP, and/or Palantir Foundry;
  • Knowledge of pharmaceutical industry regulations, such as data privacy laws, is advantageous;
  • Previous experience working with JavaScript and TypeScript.





We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Similar Jobs

Synechron - Lead AI/ML Engineer

Synechron

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Synechron - .Net Full Stack Lead – Banking Domain

Synechron

Kansas City, Missouri, United States (On-Site)
3 Weeks ago
Tamatem - Retail Banking Analytics Product Director

Tamatem

Raleigh, North Carolina, United States (Remote)
3 Weeks ago
Sony Pictures Entertainment - Manager, Insights, Strategy & Analytics

Sony Pictures Entertainment

Culver City, California, United States (Hybrid)
4 Days ago
Reddit - Senior Staff Machine Learning Engineer, Ads Marketplace Quality

Reddit

United States (Remote)
2 Weeks ago
PwC - AWS Data Engineer|Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
8 Months ago
PwC - Senior Associate_ GCP Data Visualization_ Data and  Analytics_Advisory_Bengaluru

PwC

Bengaluru, Karnataka, India (On-Site)
8 Months ago
PwC - IN_Manager_ Data Governance, D&A_Advisory_Manager

PwC

Mumbai, Maharashtra, India (On-Site)
8 Months ago
PwC - D&A - GDC

PwC

Kolkata, West Bengal, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Beyond Sports - Sports Visualization Specialist

Beyond Sports

Alkmaar, North Holland, Netherlands (On-Site)
1 Month ago
Canonical - Software Engineer - OpenStack

Canonical

(Remote)
2 Weeks ago
Social Point - Principal Data Analyst

Social Point

Barcelona, Catalonia, Spain (On-Site)
2 Weeks ago
C3 IoT - AI Engagement Manager / Director

C3 IoT

Redwood City, California, United States (On-Site)
4 Days ago
Wildlife Studios - Data Scientist

Wildlife Studios

São Paulo, Brazil (On-Site)
1 Week ago
fairmatic - Senior Data Scientist

fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
7 Months ago
Canonical - Senior Jira Software Engineer

Canonical

(Remote)
2 Weeks ago
sound cloud - Engineering Manager, Anti-Abuse

sound cloud

Berlin, Berlin, Germany (On-Site)
1 Month ago
PwC - Manager - Strategy& e Inteligencia Artificial

PwC

Buenos Aires, Buenos Aires, Argentina (On-Site)
4 Months ago
Henkel - Data Scientist-Intern

Henkel

Pune, Maharashtra, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Poland

Intel  - Infrastructure and DevOps Engineer

Intel

Gdańsk, Pomeranian Voivodeship, Poland (Hybrid)
3 Weeks ago
Meta - Production Engineer

Meta

Warsaw, Masovian Voivodeship, Poland (On-Site)
6 Months ago
Keywords Studios - Player Support Agent - German/English

Keywords Studios

Silesian Voivodeship, Poland (Hybrid)
2 Months ago
Opendoor - Staff Software Engineer - Application Security (SAST, DAST, IAST)

Opendoor

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
2 Weeks ago
Keywords Studios - Content Moderator - French (Video Games) - Remote

Keywords Studios

Silesian Voivodeship, Poland (Remote)
1 Month ago
CD PROJEKT RED - Senior Weapons Artist

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
CD PROJEKT RED - Senior Engine Programmer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Weeks ago
luxsoft - Senior Java Developer

luxsoft

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Weeks ago
dun bradstreet - Senior Solutions Sales Advisor (R-16673)

dun bradstreet

Warsaw, Masovian Voivodeship, Poland (Hybrid)
7 Months ago
PwC - Senior Workday Talent and Learning Consultant

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Voodoo - Senior Data Analyst (Growth) - Freelance

Voodoo

Paris, Île-de-France, France (Hybrid)
2 Months ago
Zen consultancy - Data science interns

Zen consultancy

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Trendyol - Senior Data Analyst ( Data Science - Site Management)

Trendyol

Istanbul, İstanbul, Türkiye (Hybrid)
7 Months ago
Trendyol - Pricing Data Analyst

Trendyol

İstanbul, Türkiye (Hybrid)
7 Months ago
Electronic Arts - Advanced Data Analyst, UGX

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
2 Months ago
Token Metrics - Senior Crypto Data Engineer (Remote)

Token Metrics

Budapest, Hungary (Remote)
7 Months ago
HoYoverse - Data Analyst - Honkai: Star Rail - Fresh Grad

HoYoverse

Singapore (On-Site)
6 Months ago
playrix  - Senior Data Analyst (Game)

playrix

Cyprus (Remote)
7 Months ago
Luxoft - BI Developer (SSIS and SSAS)

Luxoft

Gurugram, Haryana, India (On-Site)
6 Months ago
Dream Games - Product Specialist

Dream Games

İstanbul, Türkiye (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded