Senior Data Engineer

Bosch Group India

8+ Years | Bangalore, Karnataka, India (On Site) | Full Time | 2 weeks ago

Apply Now

Job Summary

We are seeking a highly skilled Senior Data Engineer to design, build, and optimize large-scale, cloud-native data platforms for Digital Twin and Industrial AI solutions. This role involves developing high-performance data ingestion and transformation pipelines that unify IoT, enterprise, and AI/ML data, enabling real-time insights, scalability, and interoperability across hybrid environments. Responsibilities include data architecture, cloud integration, performance optimization, security, governance, and collaboration with cross-functional teams.

Must Have

Design and implement end-to-end data pipelines for ingestion, transformation, and storage.
Build real-time and batch processing frameworks using Databricks.
Develop and maintain ETL/ELT workflows using Python and SQL.
Architect and optimize data lakes/lakehouses (Azure Synapse, Delta Lake, BigQuery, Snowflake).
Design and manage NoSQL (MongoDB) and time-series databases (InfluxDB, TimescaleDB, Azure Data Explorer).
Enable AI/ML readiness by developing feature pipelines and integrating with model inference systems.
Orchestrate and monitor data pipelines using Azure Data Factory, Azure Functions, and Event Hub.
Build serverless, event-driven applications using Azure Functions, AWS Lambda, or GCP Cloud Functions.
Implement hybrid data integration between edge, on-prem, and cloud.
Integrate data from IoT devices, ERP, MES, PLM, and simulation tools.
Develop containerized microservices using Docker and Kubernetes.
Implement frameworks for data quality, lineage, and observability (Great Expectations, Azure Purview, OpenMetadata).
Enforce data governance, privacy, and compliance (GDPR, ISO 27001).
Optimize resource utilization and cost across compute, storage, and database layers.
Establish data retention, access control, and lifecycle policies.
Collaborate with cloud architects, AI/ML engineers, and domain experts.
Evaluate and introduce emerging technologies.
Mentor junior engineers and promote best practices in Pythonic coding, DevOps, and GitOps.
Develop and maintain data engineering accelerators and reusable frameworks.
Strong programming skills in Python and SQL.
Expertise in Azure data services.
Hands-on experience with MongoDB, Cosmos DB, and time-series databases.
Proven experience with streaming frameworks (Kafka, Event Hub, Kinesis) and workflow orchestrators (Airflow, Argo, Prefect).
Proficiency in Docker and Kubernetes.
Familiarity with data lake/lakehouse architectures, NoSQL models, and cloud-agnostic patterns.
Knowledge of CI/CD pipelines and infrastructure-as-code tools (Terraform, Bicep, ARM templates).

Good to Have

Experience with industrial IoT, Digital Twin data models, and protocols such as OPC-UA and MQTT.
Exposure to edge-to-cloud data flows and predictive maintenance or anomaly detection solutions.
Knowledge of data quality, governance, and metadata management tools.
Strong communication and analytical skills to align data solutions with business and operational KPIs.

Job Description

Company Description

Bosch Global Software Technologies Private Limited is a 100% owned subsidiary of Robert Bosch GmbH, one of the world's leading global supplier of technology and services, offering end-to-end Engineering, IT and Business Solutions. With over 27,000+ associates, it’s the largest software development center of Bosch, outside Germany, indicating that it is the Technology Powerhouse of Bosch in India with a global footprint and presence in the US, Europe and the Asia Pacific region.

Job Description

Roles & Responsibilities :

Data Architecture & Engineering

Design and implement end-to-end data pipelines for ingestion, transformation, and storage of structured, semi-structured, and time-series data.
Build both real-time and batch processing frameworks using Databricks, supporting scalable analytics and AI workloads.
Develop and maintain ETL/ELT workflows using Python and SQL, ensuring reusability and maintainability.
Architect and optimize data lakes/lakehouses (Azure Synapse, Delta Lake, BigQuery, or Snowflake) for efficient querying and cost control.
Design and manage NoSQL databases (MongoDB) and time-series databases (InfluxDB, TimescaleDB, Azure Data Explorer) for sensor and operational data.
Enable AI/ML readiness by developing feature pipelines, managing datasets, and integrating with model inference systems.

Cloud & Integration

Orchestrate and monitor data pipelines using Azure Data Factory, Azure Functions, and Event Hub for real-time ingestion and transformation.
Build serverless, event-driven applications using Azure Functions (Python-based), AWS Lambda, or GCP Cloud Functions.
Implement hybrid data integration between edge, on-prem, and cloud using secure APIs, message queues, and connectors.
Integrate data from IoT devices, ERP, MES, PLM, and simulation tools to enable enterprise-wide digital twin insights.
Develop containerized microservices using Docker and Kubernetes to support portable, cloud-agnostic deployments across Azure, AWS, and GCP.

Performance, Security & Governance

Implement frameworks for data quality, lineage, and observability (Great Expectations, Azure Purview, OpenMetadata).
Enforce data governance, privacy, and compliance with standards such as GDPR, ISO 27001, and industry regulations.
Optimize resource utilization and cost across compute, storage, and database layers.
Establish data retention, access control, and lifecycle policies across multi-tenant environments.

Collaboration & Strategy

Collaborate with cloud architects, AI/ML engineers, and domain experts to align data architecture with Industry 4.0 and Digital Twin goals.
Evaluate and introduce emerging technologies such as vector databases, streaming analytics, and data mesh frameworks.
Mentor junior engineers and promote best practices in Pythonic coding, DevOps, and GitOps workflows.
Develop and maintain data engineering accelerators and reusable frameworks for internal adoption.

Qualifications

Educational qualification:

Required Qualifications

Bachelor’s or Master’s degree in Computer Science, Data Engineering, or related field.
8+ years of experience in data engineering, analytics, or big data systems.

Mandatory skills:

Strong programming skills in Python and SQL for data transformation, orchestration, and automation.
Expertise in Azure data services (Synapse, Data Factory, Event Hub, Azure Functions, Databricks).
Hands-on experience with MongoDB, Cosmos DB, and time-series databases such as InfluxDB, TimescaleDB, or Azure Data Explorer (ADX).
Proven experience with streaming frameworks (Kafka, Event Hub, Kinesis) and workflow orchestrators (Airflow, Argo, or Prefect).
Proficiency in Docker and Kubernetes for containerization and scalable deployment.
Familiarity with data lake/lakehouse architectures, NoSQL models, and cloud-agnostic patterns.
Knowledge of CI/CD pipelines and infrastructure-as-code tools (Terraform, Bicep, ARM templates).

Preferred Skills

Experience with industrial IoT, Digital Twin data models, and protocols such as OPC-UA and MQTT.
Exposure to edge-to-cloud data flows and predictive maintenance or anomaly detection solutions.
Knowledge of data quality, governance, and metadata management tools.
Strong communication and analytical skills to align data solutions with business and operational KPIs.

Additional Information

Position Overview

We are seeking a highly skilled Senior Data Engineer to design, build, and optimize large-scale, cloud-native data platforms that power Digital Twin and Industrial AI solutions. This role focuses on developing high-performance data ingestion and transformation pipelines that unify IoT, enterprise, and AI/ML data, enabling real-time insights, scalability, and interoperability across hybrid environments.

13 Skills Required For This Role

Data Analytics Game Texts Aws Nosql Azure Terraform Mongodb Ci Cd Docker Microservices Kubernetes Python Sql

Similar Jobs

Senior Data Engineer

Job Summary

Must Have

Good to Have

Job Description

Company Description

Job Description

Data Architecture & Engineering

Cloud & Integration

Performance, Security & Governance

Collaboration & Strategy

Qualifications

Required Qualifications

Mandatory skills:

Preferred Skills

Additional Information

Position Overview

13 Skills Required For This Role

Similar Jobs

Data Analysis

Business