Senior HPC Data Engineer

2 Weeks ago • 2 Years + • Data Analyst

Job Summary

Job Description

This Senior HPC Data Engineer role involves designing and implementing a data service module for an HPC modeling project. Responsibilities include developing high-performance data storage and retrieval systems using HDF5 (or similar) with parallel I/O capabilities, ensuring tight integration with HPC and visualization workflows, optimizing I/O for CPU/GPU workflows, implementing caching and compression strategies, designing data structures for 3D grid data, ensuring data integrity, developing test cases, conducting benchmarking, and documenting the module. The ideal candidate will have strong Python and HDF5 expertise, experience with parallel I/O frameworks, and skills in performance optimization. SQL and RDMS knowledge are also beneficial. The role requires collaboration with development and visualization teams.
Must have:
  • HDF5 expertise & parallel I/O
  • Python proficiency
  • Data service design & implementation for HPC
  • Performance optimization skills
  • Experience with large datasets
Good to have:
  • C++/Rust experience
  • Knowledge of orchestration tools (Airflow, etc.)
  • Familiarity with scientific computing data formats
Perks:
  • Flexible working format
  • Competitive salary
  • Personalized career growth
  • Professional development tools
  • Education reimbursement
  • Corporate events

Job Details

About our customer:

Our client is a global company specializing in software development and consulting who combines science and technology with deep industry expertise to solve complex subsurface and surface challenges in the evolving energy sector.
The client headquartered is located in Canada, with offices around the world. The company serves organizations globally, providing cutting-edge software technology and unparalleled customer support.

Key Software Solutions:
Our client offers a range of reservoir simulation software, including:

  • IMEX – A black oil simulator for primary, secondary, and tertiary recovery processes.
  • GEM – An advanced simulator for compositional, chemical, and unconventional reservoir modeling.
  • STARS – The industry standard for thermal and advanced recovery processes.
  • CMOST – An intelligent optimization and analysis tool that integrates statistical analysis, machine learning, and unbiased data interpretation to determine optimal reservoir solutions.


The company invests in research and development, continuously improving its products and delivering state-of-the-art solutions for energy modeling and optimization.

About the Role:

We are looking for an experienced Data Service Module Engineer to develop and deploy the data service module for the HPC modeling project. This role focuses on implementing high-performance data storage and retrieval systems using HDF5 or similar, with parallel and concurrent I/O capabilities. The ideal candidate will have expertise in designing scalable data services optimized for HPC or distributed workflows, ensuring low latency and high throughput.

 

Key Responsibilities:

  • Design and implement the data service module using HDF5 for efficient data storage and retrieval.
  • Develop parallel and concurrent I/O mechanisms to optimize performance for large-scale datasets.
  • Ensure the module is tightly integrated with HPC and visualization workflows.
  • Optimize I/O operations for CPU/GPU-based workflows to minimize bottlenecks.
  • Implement caching, compression, and other strategies to enhance performance.
  • Design data structures and schemas suitable for storing 3D grid data and other simulation outputs.
  • Ensure data integrity and consistency during concurrent read/write operations.
  • Develop and execute test cases to validate module performance and reliability under various load conditions.
  • Conduct benchmarking to ensure scalability across different hardware configurations.
  • Document the architecture, APIs, and usage guidelines for the data service module.
  • Provide technical support to the development and visualization teams for data integration.

 

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or related fields.
  • 2+ years of experience in developing and deploying data services for HPC or similar systems.
  • Proven expertise in Big Data
  • Proven expertise with HDF5 or similar, in parallel I/O operations. Equivalent experience in distributed systems is also applicable.
  • Programming: Strong proficiency in Python. An additional experience with C++ and/or Rust is a nice to have and it will be considered a plus
  • HDF5 Expertise: In-depth knowledge of HDF5 APIs and advanced features like parallel HDF5.
  • Parallel I/O: Experience with MPI I/O, POSIX I/O, or similar frameworks for concurrent/parallel data access.
  • Performance Optimization: Skills in profiling and optimizing I/O operations for large datasets.
  • Proficiency in SQL and experience with any RDMS
  • Might be a plus: knowledge of at least one orchestration and scheduling tool, for example, Airflow, Prefect, Dagster, etc.
  • Strong problem-solving skills and ability to work in a multidisciplinary team.
  • Excellent communication skills for cross-team collaboration and documentation.

 

Preferred Qualifications:

  • Familiarity with data formats used in scientific computing, 3D visualization, and simulation workflows.

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Similar Jobs

Spruce Systems - Software Engineer, Cross-Platform Rust

Spruce Systems

(Remote)
1 Year ago
ByteDance - Site Reliability Engineer - Network Automation - 2025 start

ByteDance

Singapore (On-Site)
3 Months ago
The Walt Disney Company - Artist / Painter - Full Time (HKD$6,000 Special Welcome Reward)

The Walt Disney Company

Hong Kong (On-Site)
5 Months ago
ByteDance - Service Framework Software Engineer Intern

ByteDance

Singapore (On-Site)
1 Month ago
Proof of Play - Senior Blockchain Infrastructure Engineer

Proof of Play

(Remote)
1 Month ago
Netflix - Data Scientist (L5) - App QoE

Netflix

United States (Remote)
2 Weeks ago
Activision - Game Security Analyst

Activision

Los Angeles, California, United States (Hybrid)
1 Week ago
Hawk Eye Innovations - Data Processing Assistant

Hawk Eye Innovations

Atlanta, Georgia, United States (On-Site)
1 Month ago
Meta - Data Engineer Intern

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
PlayStation Global - Analytics Engineering Manager (Partner Insights - Analytics and Visualisation)

PlayStation Global

London, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Windranger Labs - Protocol Engineer

Windranger Labs

Central Sulawesi, Indonesia (Remote)
1 Month ago
OKX - Graduate Hire 2024/25 - Blockchain Engineer

OKX

Hong Kong (On-Site)
6 Months ago
Drive mode - Staff Backend Engineer

Drive mode

Mountain View, California, United States (Hybrid)
3 Months ago
ByteDance - Site Reliability Engineer Graduate (Technical Infrastructure) - 2025 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
ByteDance - Senior Software Development Engineer - Distributed KV System

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
ByteDance - Service Framework Software Engineer Intern

ByteDance

Singapore (On-Site)
1 Month ago
Coda - Senior/Staff Software Engineer

Coda

(Remote)
1 Year ago
Google - Information Security Engineer

Google

Munich, Bavaria, Germany (On-Site)
2 Days ago
Google - Software Engineering Manager II, Mobile (iOS), Core

Google

State Of Minas Gerais, Brazil (On-Site)
1 Week ago
Google - Senior Staff Software Engineer, Platform Security

Google

Mountain View, California, United States (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

Jobs in Argentina

Devrev - Account Executive, SMB (US Territory)

Devrev

Buenos Aires, Buenos Aires, Argentina (Hybrid)
1 Day ago
Red Point Labs - 2D Artist (Remote OK)

Red Point Labs

Argentina (Remote)
11 Months ago
Evolution - Studio Manager

Evolution

Buenos Aires, Buenos Aires, Argentina (On-Site)
7 Months ago
GoFundMe - Senior Software Engineer (Frontend)

GoFundMe

Buenos Aires, Buenos Aires, Argentina (On-Site)
6 Hours ago
Evolution - Service Support Specialist (On-Site Canning, Ezeiza)

Evolution

Buenos Aires, Buenos Aires, Argentina (On-Site)
4 Months ago
Salesforce - Business Development Representative - Argentina

Salesforce

Buenos Aires, Buenos Aires, Argentina (On-Site)
4 Months ago
pixel owl  - Game Designer / Game Analyst

pixel owl

Buenos Aires, Buenos Aires, Argentina (On-Site)
1 Day ago
GoFundMe - Senior Software Engineer (Pricing)

GoFundMe

Buenos Aires, Buenos Aires, Argentina (On-Site)
6 Hours ago
Dialpad AI - Sr. Product Manager

Dialpad AI

Buenos Aires, Buenos Aires, Argentina (Hybrid)
20 Hours ago
Google - Solutions Consultant III

Google

Buenos Aires, Buenos Aires, Argentina (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Salesforce - Solution Architect - Tableau

Salesforce

Tokyo, Japan (Remote)
1 Month ago
Google - Data Engineer, Google Trust and Safety

Google

Austin, Texas, United States (On-Site)
2 Days ago
Dream Sports - SDE 3 - Backend (Platform)

Dream Sports

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Zazz - Data Engineer (6–8 Years) Adhoc

Zazz

India (On-Site)
4 Months ago
Google - Data Engineering Lead, Google Trust and Safety

Google

Austin, Texas, United States (On-Site)
2 Days ago
Playtika - Data Scientist

Playtika

Israel (On-Site)
1 Month ago
Gaming Innovation Group  - Data Science Apprentice

Gaming Innovation Group

St. Julian's, Malta (On-Site)
4 Weeks ago
Playrix - Senior Data Analyst (Attribution)

Playrix

Almaty, Almaty Region, Kazakhstan (Remote)
6 Months ago
Varonis  - C# Software Engineer

Varonis

Herzliya, Tel Aviv District, Israel (On-Site)
6 Months ago
Velotio Technologies - Data Architect

Velotio Technologies

Maharashtra, India (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded