Senior Data Engineer with Data Science/MLOps Background

2 Months ago • 5 Years + • Data Analyst

Job Summary

Job Description

This Senior Data Engineer role requires 5+ years of experience, focusing on designing, developing, and maintaining data pipelines within Palantir Foundry. The ideal candidate will have a strong background in cloud technologies, data architecture, and machine learning. Key responsibilities include collaborating with cross-functional teams, creating data models, implementing ETL/ELT processes, monitoring pipeline performance, facilitating model deployment, and supporting ML Ops practices. Proficiency in Python, PySpark, SQL, and experience with big data technologies (Hadoop, Spark, Kafka, etc.) are essential. The role involves troubleshooting data pipelines, ensuring data integrity and security, and staying current with emerging technologies. Experience in the pharmaceutical or life sciences industry is preferred.
Must have:
  • 5+ years data engineering experience
  • Proficiency in Python and PySpark
  • Big data technologies expertise (Hadoop, Spark)
  • Cloud services experience (AWS, Azure, GCP)
  • Data modeling & ETL/ELT
  • Database systems knowledge (PostgreSQL, MySQL)
  • Containerization (Docker, Kubernetes)
  • ML Ops familiarity (model deployment)
  • Pharmaceutical/life sciences experience preferred
Good to have:
  • Cloud certifications
  • Apache Lucene experience
  • REST API experience
  • Veeva CRM, Reltio, SAP, Palantir Foundry familiarity
  • Knowledge of pharmaceutical regulations
  • JavaScript and TypeScript experience
Perks:
  • Flexible working format
  • Competitive salary & compensation
  • Personalized career growth
  • Professional development tools
  • Education reimbursement
  • Corporate events & team buildings

Job Details

Senior Data Engineer

We are seeking a proactive Senior Data Engineer to join our vibrant team. As a Senior Data Engineer, you will play a critical role in designing, developing, and maintaining sophisticated data pipelines, Ontology Objects, and Foundry Functions within Palantir Foundry. Your background in machine learning and data science will be valuable in optimizing data workflows, enabling efficient model deployment, and supporting AI-driven initiatives.  The ideal candidate will possess a robust background in cloud technologies, data architecture, and a passion for solving complex data challenges.

 

Key Responsibilities:

  • Collaborate with cross-functional teams to understand data requirements, and design, implement and maintain scalable data pipelines in Palantir Foundry, ensuring end-to-end data integrity and optimizing workflows.
  • Gather and translate data requirements into robust and efficient solutions, leveraging your expertise in cloud-based data engineering. Create data models, schemas, and flow diagrams to guide development.
  • Develop, implement, optimize and maintain efficient and reliable data pipelines and ETL/ELT processes to collect, process, and integrate data to ensure timely and accurate data delivery to various business applications, while implementing data governance and security best practices to safeguard sensitive information.
  • Monitor data pipeline performance, identify bottlenecks, and implement improvements to optimize data processing speed and reduce latency. 
  • Collaborate with Data Scientists to facilitate model deployment and integration into production environments.
  • Support the implementation of basic ML Ops practices, such as model versioning and monitoring.
  • Assist in optimizing data pipelines to improve machine learning workflows.
  • Troubleshoot and resolve issues related to data pipelines, ensuring continuous data availability and reliability to support data-driven decision-making processes.
  • Stay current with emerging technologies and industry trends, incorporating innovative solutions into data engineering practices, and effectively document and communicate technical solutions and processes.

Tools and skills you will use in this role:

  • Palantir Foundry
  • Python
  • PySpark
  • SQL
  • TypeScript

Required:

  • 5+ years of experience in data engineering, preferably within the pharmaceutical or life sciences industry;
  • Strong proficiency in Python and PySpark;
  • Proficiency with big data technologies (e.g., Apache Hadoop, Spark, Kafka, BigQuery, etc.);
  • Hands-on experience with cloud services (e.g., AWS Glue, Azure Data Factory, Google Cloud Dataflow);
  • Expertise in data modeling, data warehousing, and ETL/ELT concepts;
  • Hands-on experience with database systems (e.g., PostgreSQL, MySQL, NoSQL, etc.);
  • Proficiency in containerization technologies (e.g., Docker, Kubernetes);
  • Familiarity with ML Ops concepts, including model deployment and monitoring.
  • Basic understanding of machine learning frameworks such as TensorFlow or PyTorch.
  • Exposure to cloud-based AI/ML services (e.g., AWS SageMaker, Azure ML, Google Vertex AI).
  • Experience working with feature engineering and data preparation for machine learning models.
  • Effective problem-solving and analytical skills, coupled with excellent communication and collaboration abilities;
  • Strong communication and teamwork abilities;
  • Understanding of data security and privacy best practices;
  • Strong mathematical, statistical, and algorithmic skills.

Nice to have:

  • Certification in Cloud platforms, or related areas;
  • Experience with search engine Apache Lucene, Webservice Rest API;
  • Familiarity with Veeva CRM, Reltio, SAP, and/or Palantir Foundry;
  • Knowledge of pharmaceutical industry regulations, such as data privacy laws, is advantageous;
  • Previous experience working with JavaScript and TypeScript.





We offer:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

Similar Jobs

NVIDIA - Solution Architect - Auto

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
Canonical - Product Manager - Desktop

Canonical

(Remote)
2 Weeks ago
Applike Group - Director of Technology (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
7 Months ago
Reddit - Senior Staff Machine Learning Engineer, Ads Measurement

Reddit

United States (Remote)
1 Month ago
Google - Senior Software Engineer, Applied AI

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Month ago
Zazz - Marketing Data Specialist

Zazz

India (On-Site)
5 Months ago
Tap nation  - Product Manager Data Platform

Tap nation

United States (Hybrid)
2 Months ago
Voodoo - Senior Data Scientist - Ad network

Voodoo

Paris, Île-de-France, France (Hybrid)
3 Months ago
Krafton - Product Analyst

Krafton

Bengaluru, Karnataka, India (On-Site)
6 Months ago
bytedance - Data Analyst - Global Payment

bytedance

Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Penumbrainc - Procurement Process Excellence Principal

Penumbrainc

Alameda, California, United States (On-Site)
3 Weeks ago
Keen Games - Data Analyst

Keen Games

Frankfurt Am Main, Hessen, Germany (Remote)
11 Months ago
commerce iq - Senior Manager- Data Science

commerce iq

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Meta - Data Engineer, Product Analytics

Meta

Los Angeles, California, United States (On-Site)
6 Months ago
Motive - Engineering Manager, Backend (AI Dashcam Products)

Motive

Pakistan (Remote)
2 Weeks ago
Ethos Life - Senior Salesforce Business Systems Analyst

Ethos Life

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
Mercury - Senior Product Manager - Risk

Mercury

(Remote)
1 Month ago
MIQ Digital - Senior Data Scientist

MIQ Digital

Bengaluru, Karnataka, India (Hybrid)
1 Week ago
Microsoft - Member of Technical Staff - Backend Growth Engineer

Microsoft

Mountain View, California, United States (Hybrid)
2 Months ago
Reddit - Machine Learning Manager - Ads Targeting Modeling

Reddit

United States (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Poland

Keywords Studios - AI - Senior Research Associate (Prompts)

Keywords Studios

Silesian Voivodeship, Poland (On-Site)
2 Months ago
Tesla - Sales Advisor

Tesla

Ząbki, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Pocket Worlds - Staff Backend Engineer - Infrastructure

Pocket Worlds

Poland (On-Site)
2 Months ago
Haleon - Systems Engineer QMS

Haleon

Poznań, Greater Poland Voivodeship, Poland (Hybrid)
1 Month ago
Axel Springer News Media National - Prompt Engineer

Axel Springer News Media National

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Palo Alto Networks - Sr. Technical Support Engineer, Focused Services

Palo Alto Networks

Warsaw, Masovian Voivodeship, Poland (Remote)
1 Week ago
Springer Group - Digital Campaigns Executive

Springer Group

Warsaw, Masovian Voivodeship, Poland (Hybrid)
5 Days ago
Veeam Software - C++ Developer

Veeam Software

Poland (Remote)
1 Week ago
Room 8 Group - Engagement Manager

Room 8 Group

Poland (On-Site)
2 Weeks ago
Playtika - Service Operations Group Manager

Playtika

Poland (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Rackspace Technology - Marketing Operations Analyst II

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Months ago
Ion - Internship - Data Science

Ion

Pisa, Tuscany, Italy (On-Site)
7 Months ago
Amanotes - Product Data Analyst

Amanotes

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-Site)
1 Month ago
Netflix - Data Analyst, Production Finance Operations & Innovation

Netflix

Los Angeles, California, United States (On-Site)
1 Month ago
Riot Games - Insights Analyst III

Riot Games

Singapore (On-Site)
8 Months ago
Ubisoft - Regional Project Intelligence Director (Nordics & Romania)

Ubisoft

Malmö, Skåne County, Sweden (Hybrid)
1 Month ago
Netflix - Data Scientist (L5) - Product Promotion & Algorithm Performance

Netflix

Los Angeles, California, United States (On-Site)
1 Month ago
Ubisoft - Mobile Market Analyst Assistant Intern

Ubisoft

Paris, Île-de-France, France (On-Site)
1 Month ago
DraftKings - Senior Data Engineer, Platform

DraftKings

Boston, Massachusetts, United States (On-Site)
1 Month ago
PwC - D&A - GDC

PwC

Kolkata, West Bengal, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded