Middle HPC Data Engineer

1 Month ago • 2 Years + • Data Analyst

Job Summary

Job Description

This role involves designing and implementing a high-performance data service module for an HPC modeling project. Responsibilities include developing parallel and concurrent I/O mechanisms using HDF5 for efficient data storage and retrieval of large-scale datasets. The engineer will optimize I/O operations for CPU/GPU workflows, implement caching and compression strategies, design data structures for 3D grid data, ensure data integrity, and conduct performance testing and benchmarking. The position requires strong programming skills (C++, Python, GoLang, or Fortran), expertise with HDF5 and parallel I/O frameworks (MPI I/O, POSIX I/O), and experience in optimizing I/O for large datasets. Collaboration with development and visualization teams is essential.
Must have:
  • HDF5 expertise for parallel I/O
  • C++, Python, GoLang, or Fortran proficiency
  • Experience with HPC data services
  • Parallel I/O framework knowledge (MPI, POSIX)
  • Performance optimization skills
Good to have:
  • Knowledge of Airflow, Prefect, or Dagster
  • Familiarity with scientific computing data formats
  • 3D visualization and simulation workflow experience
  • SQL and RDMS experience
Perks:
  • Flexible working format
  • Competitive salary and benefits
  • Personalized career growth
  • Professional development tools
  • Education reimbursement
  • Corporate events

Job Details

About our customer:

Our client is a global company specializing in software development and consulting who combines science and technology with deep industry expertise to solve complex subsurface and surface challenges in the evolving energy sector.
The client headquartered is located in Canada, with offices around the world. The company serves organizations globally, providing cutting-edge software technology and unparalleled customer support.

Key Software Solutions:
Our client offers a range of reservoir simulation software, including:

  • IMEX – A black oil simulator for primary, secondary, and tertiary recovery processes.
  • GEM – An advanced simulator for compositional, chemical, and unconventional reservoir modeling.
  • STARS – The industry standard for thermal and advanced recovery processes.
  • CMOST – An intelligent optimization and analysis tool that integrates statistical analysis, machine learning, and unbiased data interpretation to determine optimal reservoir solutions.


The company invests in research and development, continuously improving its products and delivering state-of-the-art solutions for energy modeling and optimization.

About the Role:

We are looking for an experienced Data Service Module Engineer to develop and deploy the data service module for the HPC modeling project. This role focuses on implementing high-performance data storage and retrieval systems using HDF5 or similar, with parallel and concurrent I/O capabilities. The ideal candidate will have expertise in designing scalable data services optimized for HPC or distributed workflows, ensuring low latency and high throughput.

 

Key Responsibilities:

  • Design and implement the data service module using HDF5 for efficient data storage and retrieval.
  • Develop parallel and concurrent I/O mechanisms to optimize performance for large-scale datasets.
  • Ensure the module is tightly integrated with HPC and visualization workflows.
  • Optimize I/O operations for CPU/GPU-based workflows to minimize bottlenecks.
  • Implement caching, compression, and other strategies to enhance performance.
  • Design data structures and schemas suitable for storing 3D grid data and other simulation outputs.
  • Ensure data integrity and consistency during concurrent read/write operations.
  • Develop and execute test cases to validate module performance and reliability under various load conditions.
  • Conduct benchmarking to ensure scalability across different hardware configurations.
  • Document the architecture, APIs, and usage guidelines for the data service module.
  • Provide technical support to the development and visualization teams for data integration.

 

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or related fields.
  • 2+ years of experience in developing and deploying data services for HPC or similar systems.
  • Proven expertise with HDF5 or similar, in parallel I/O operations. Equivalent experience in distributed systems is also applicable.
  • Programming: Strong proficiency in (at least one): C++, Python, GoLang, or Fortran.
  • HDF5 Expertise: In-depth knowledge of HDF5 APIs and advanced features like parallel HDF5.
  • Parallel I/O: Experience with MPI I/O, POSIX I/O, or similar frameworks for concurrent/parallel data access.
  • Performance Optimization: Skills in profiling and optimizing I/O operations for large datasets.
  • Proficiency in SQL and experience with any RDMS
  • Might be a plus: knowledge of at least one orchestration and scheduling tool, for example, Airflow, Prefect, Dagster, etc.
  • Strong problem-solving skills and ability to work in a multidisciplinary team.
  • Excellent communication skills for cross-team collaboration and documentation.

 

Preferred Qualifications:

  • Familiarity with data formats used in scientific computing, 3D visualization, and simulation workflows.

We offer:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

Similar Jobs

Skytex Unmanned Aerial Solutions - Unity (C#) Developer

Skytex Unmanned Aerial Solutions

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Sperasoft - Senior Landscape Artist

Sperasoft

Vojvodina, Serbia (Hybrid)
1 Week ago
Google - Software Engineer, Early Career, iOS, Photos

Google

Sydney, New South Wales, Australia (On-Site)
2 Weeks ago
Canva - Staff Frontend Engineer - Apps API Platform

Canva

Sydney, New South Wales, Australia (Remote)
1 Month ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

(Remote)
1 Month ago
Rackspace Technology - Senior Practice Manager - Data Engineering

Rackspace Technology

California, United States (Remote)
4 Months ago
Google - Staff Data Scientist, Research, Search AI

Google

Mountain View, California, United States (On-Site)
2 Days ago
ION - Data Associate - KYC6

ION

Budapest, Hungary (On-Site)
6 Months ago
Google - Senior Data Scientist, Research, Global Networking and Infrastructure

Google

Atlanta, Georgia, United States (On-Site)
2 Days ago
Truecaller - Data Architect

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Staff Software Engineer, Infrastructure, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Days ago
Google - Tech Lead, Software Engineering, Black Community Inclusion

Google

Belo Horizonte, State Of Minas Gerais, Brazil (On-Site)
5 Months ago
Google - Software Engineer II, Pixel Graphics

Google

London, England, United Kingdom (On-Site)
2 Weeks ago
Google - Software Engineer II, User Protections, Core

Google

Zürich, Zurich, Switzerland (On-Site)
1 Week ago
Google - Senior Software Engineer, Authentication and Autofill, Android

Google

Beijing, Beijing, China (On-Site)
2 Weeks ago
Avalanche Studios Group - Senior Animator

Avalanche Studios Group

Malmö, Skåne County, Sweden (Hybrid)
5 Months ago
Google - Software Engineer III, Google Cloud Platforms

Google

(On-Site)
4 Months ago
Fluence - Lead Engineer - Advanced Battery Modules

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago
Google - Staff Software Engineer, Google Cloud Business Platforms

Google

Kirkland, Washington, United States (On-Site)
1 Week ago
Bethesda - Senior Animator (Creatures)

Bethesda

Austin, Texas, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Argentina

GoFundMe - Senior Software Engineer (Classy Live)

GoFundMe

Buenos Aires, Buenos Aires, Argentina (On-Site)
5 Hours ago
Google - Technical Solutions Consultant, gTech Ads Sellside (Spanish, English)

Google

Buenos Aires, Buenos Aires, Argentina (On-Site)
2 Weeks ago
GoFundMe - Senior Cloud Ops Engineer

GoFundMe

Buenos Aires, Buenos Aires, Argentina (Hybrid)
5 Hours ago
GoFundMe - Senior Software Engineer (Pricing)

GoFundMe

Buenos Aires, Buenos Aires, Argentina (On-Site)
5 Hours ago
Devrev - Revenue Operations Analyst

Devrev

Buenos Aires, Buenos Aires, Argentina (On-Site)
1 Day ago
Go Fund Me - Senior Software Engineer (Frontend)

Go Fund Me

Buenos Aires, Buenos Aires, Argentina (On-Site)
4 Months ago
GoFundMe - Senior Software Engineer

GoFundMe

Buenos Aires, Buenos Aires, Argentina (Hybrid)
5 Hours ago
GoFundMe - Software Engineer (Backend)

GoFundMe

Buenos Aires, Buenos Aires, Argentina (On-Site)
5 Hours ago
GoFundMe - Staff Software Engineer

GoFundMe

Buenos Aires, Buenos Aires, Argentina (On-Site)
5 Hours ago
Evolution - Technical Integration Specialist

Evolution

Buenos Aires, Buenos Aires, Argentina (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Google - Energy Program Manager, Strategic Business Operations

Google

Paris, Île-de-France, France (On-Site)
1 Week ago
BabaEntertainment - BI Developer

BabaEntertainment

California, United States (On-Site)
2 Weeks ago
Velotio Technologies - Senior Engineer (Data Engineer)

Velotio Technologies

Maharashtra, India (Remote)
1 Month ago
Match Group - Data Science II (Marketing Analytics)

Match Group

San Francisco, California, United States (Hybrid)
6 Months ago
Playtika - Data Science Expert

Playtika

Israel (On-Site)
3 Months ago
Luxoft - Senior Data Ops Engineer

Luxoft

Delhi, India (On-Site)
5 Months ago
Google - Analytical Lead, Large Customer Sales

Google

Buenos Aires, Buenos Aires, Argentina (On-Site)
1 Week ago
Dream Sports - Senior Manager - Business Analytics

Dream Sports

Mumbai, Maharashtra, India (On-Site)
7 Months ago
ByteDance - Data Engineer, Cloud and System

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
PENN Interactive - Senior Technical Product Manager, Data

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded