Lead Big Data Engineer

10 Minutes ago • 5 Years + • Data Analysis

Job Summary

Job Description

N-iX is seeking a proactive Senior Data Engineer to design, develop, and maintain sophisticated data pipelines, Ontology Objects, and Foundry Functions within Palantir Foundry. This role involves collaborating with cross-functional teams to understand data requirements, implementing scalable data pipelines, and optimizing workflows. The ideal candidate will have a strong background in cloud technologies, data architecture, and a passion for solving complex data challenges, with valuable experience in machine learning and data science to support AI-driven initiatives and efficient model deployment.
Must have:
  • Design, implement, and maintain scalable data pipelines in Palantir Foundry.
  • Gather and translate data requirements into robust and efficient solutions.
  • Develop, optimize, and maintain efficient and reliable data pipelines and ETL/ELT processes.
  • Monitor data pipeline performance and implement improvements.
  • Collaborate with Data Scientists for model deployment and integration.
  • Support basic ML Ops practices like model versioning and monitoring.
  • Optimize data pipelines to improve machine learning workflows.
  • Troubleshoot and resolve data pipeline issues.
  • Stay current with emerging technologies and document technical solutions.
  • 5+ years experience in data engineering, preferably in pharma/life sciences.
  • Strong proficiency in Python.
  • Hands-on experience with cloud services (AWS Glue, Azure Data Factory, Google Cloud Dataflow).
  • Expertise in data modeling, data warehousing, and ETL/ELT concepts.
  • Hands-on experience with database systems (PostgreSQL, MySQL, NoSQL).
  • Hands-on experience in containerization technologies (Docker, Kubernetes).
  • Familiarity with ML Ops concepts, including model deployment and monitoring.
  • Basic understanding of machine learning frameworks (TensorFlow or PyTorch).
  • Exposure to cloud-based AI/ML services (AWS SageMaker, Azure ML, Google Vertex AI).
  • Experience with feature engineering and data preparation for ML models.
  • Effective problem-solving, analytical, communication, and collaboration skills.
  • Understanding of data security and privacy best practices.
  • Strong mathematical, statistical, and algorithmic skills.
Good to have:
  • Certification in Cloud platforms or related areas.
  • Experience with search engine Apache Lucene, Web Service Rest API.
  • Familiarity with Veeva CRM, Reltio, SAP, and/or Palantir Foundry.
  • Knowledge of pharmaceutical industry regulations.
  • Previous experience working with JavaScript and TypeScript.
Perks:
  • Flexible working format (remote, office-based or flexible).
  • Competitive salary and good compensation package.
  • Personalized career growth.
  • Professional development tools (mentorship, tech talks, trainings, centers of excellence).
  • Active tech communities with regular knowledge sharing.
  • Education reimbursement.
  • Memorable anniversary presents.
  • Corporate events and team buildings.
  • Other location-specific benefits.

Job Details

N-iX is seeking a proactive Senior Data Engineer to join our vibrant team. As a Senior Data Engineer, you will play a critical role in designing, developing, and maintaining sophisticated data pipelines, Ontology Objects, and Foundry Functions within Palantir Foundry. Your background in machine learning and data science will be valuable in optimizing data workflows, enabling efficient model deployment, and supporting AI-driven initiatives. The ideal candidate will possess a robust background in cloud technologies, data architecture, and a passion for solving complex data challenges.

Key Responsibilities:

  • Collaborate with cross-functional teams to understand data requirements, and design, implement, and maintain scalable data pipelines in Palantir Foundry, ensuring end-to-end data integrity and optimizing workflows.
  • Gather and translate data requirements into robust and efficient solutions, leveraging your expertise in cloud-based data engineering. Create data models, schemas, and flow diagrams to guide development.
  • Develop, implement, optimize and maintain efficient and reliable data pipelines and ETL/ELT processes to collect, process, and integrate data to ensure timely and accurate data delivery to various business applications, while implementing data governance and security best practices to safeguard sensitive information.
  • Monitor data pipeline performance, identify bottlenecks, and implement improvements to optimize data processing speed and reduce latency.
  • Collaborate with Data Scientists to facilitate model deployment and integration into production environments.
  • Support the implementation of basic ML Ops practices, such as model versioning and monitoring.
  • Assist in optimizing data pipelines to improve machine learning workflows.
  • Troubleshoot and resolve issues related to data pipelines, ensuring continuous data availability and reliability to support data-driven decision-making processes.
  • Stay current with emerging technologies and industry trends, incorporating innovative solutions into data engineering practices, and effectively document and communicate technical solutions and processes.

Tools and skills you will use in this role:

  • Palantir Foundry
  • Python
  • PySpark
  • SQL
  • TypeScript

Required:

  • 5+ years of experience in data engineering, preferably within the pharmaceutical or life sciences industry;
  • Strong proficiency in Python;
  • Hands-on experience with cloud services (e.g., AWS Glue, Azure Data Factory, Google Cloud Dataflow);
  • Expertise in data modeling, data warehousing, and ETL/ELT concepts;
  • Hands-on experience with database systems (e.g., PostgreSQL, MySQL, NoSQL, etc.);
  • Hands-on experience in containerization technologies (e.g., Docker, Kubernetes);
  • Familiarity with ML Ops concepts, including model deployment and monitoring.
  • Basic understanding of machine learning frameworks such as TensorFlow or PyTorch.
  • Exposure to cloud-based AI/ML services (e.g., AWS SageMaker, Azure ML, Google Vertex AI).
  • Experience working with feature engineering and data preparation for machine learning models.
  • Effective problem-solving and analytical skills, coupled with excellent communication and collaboration abilities;
  • Strong communication and teamwork abilities;
  • Understanding of data security and privacy best practices;
  • Strong mathematical, statistical, and algorithmic skills.

Nice to have:

  • Certification in Cloud platforms, or related areas;
  • Experience with search engine Apache Lucene, Web Service Rest API;
  • Familiarity with Veeva CRM, Reltio, SAP, and/or Palantir Foundry;
  • Knowledge of pharmaceutical industry regulations, such as data privacy laws, is advantageous;
  • Previous experience working with JavaScript and TypeScript.

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Ukraine

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Data Analysis Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!