Data Architect (Azure and Databricks)

1 Month ago • 6-8 Years • Data Analyst

Job Summary

Job Description

This role involves designing and implementing scalable, secure, and efficient data architectures on the Databricks platform. Responsibilities include leading the technical design of data warehouse/data lake migrations from legacy systems, developing data engineering frameworks, establishing CI/CD pipelines, implementing data catalog solutions and governance frameworks, creating technical specifications, providing technical leadership, collaborating with cross-functional teams, and ensuring data architectures meet security and compliance requirements. The ideal candidate will have extensive experience in data architecture design and implementation, strong software engineering skills (Python or Scala), experience with Databricks, and knowledge of healthcare data requirements and regulations.
Must have:
  • Extensive data architecture experience
  • Strong software engineering (Python/Scala)
  • Databricks Lakehouse expertise
  • CI/CD pipeline implementation
  • Data catalog & governance
  • Healthcare data knowledge
Good to have:
  • AWS, Azure, GCP experience
  • Terraform, CloudFormation
  • Collibra, Alation
  • HL7, FHIR knowledge

Job Details

Overview 
We are seeking an experienced Data Architect with extensive expertise in designing and implementing modern data architectures. This role requires strong software engineering principles, hands-on coding abilities, and experience building data engineering frameworks. The ideal candidate will have a proven track record of implementing Databricks-based solutions in the healthcare industry, with expertise in data catalog implementation and governance frameworks. 
 
About the Role 
As a Data Architect, you will be responsible for designing and implementing scalable, secure, and efficient data architectures on the Databricks platform. You will lead the technical design of data migration initiatives from legacy systems to modern Lakehouse architecture, ensuring alignment with business requirements, industry best practices, and regulatory compliance. 
 
Key Responsibilities 

Design and implement modern data architectures using Databricks Lakehouse platform 
Lead the technical design of Data Warehouse/Data Lake migration initiatives from legacy systems 
Develop data engineering frameworks and reusable components to accelerate delivery 
Establish CI/CD pipelines and infrastructure-as-code practices for data solutions 
Implement data catalog solutions and governance frameworks 
Create technical specifications and architecture documentation 
Provide technical leadership to data engineering teams 
Collaborate with cross-functional teams to ensure alignment of data solutions 
Evaluate and recommend technologies, tools, and approaches for data initiatives 
Ensure data architectures meet security, compliance, and performance requirements 
Mentor junior team members on data architecture best practices 
Stay current with emerging technologies and industry trends 

Qualifications 
Extensive experience in data architecture design and implementation 
Strong software engineering background with expertise in Python or Scala 
Proven experience building data engineering frameworks and reusable components 
Experience implementing CI/CD pipelines for data solutions 
Expertise in infrastructure-as-code and automation 
Experience implementing data catalog solutions and governance frameworks 
Deep understanding of Databricks platform and Lakehouse architecture 
Experience migrating workloads from legacy systems to modern data platforms 
Strong knowledge of healthcare data requirements and regulations 
Experience with cloud platforms (AWS, Azure, GCP) and their data services 
Bachelor's degree in Computer Science, Information Systems, or related field; advanced degree preferred 

Technical Skills 
Programming languages: Python and/or Scala (required) 
Data processing frameworks: Apache Spark, Delta Lake 
CI/CD tools: Jenkins, GitHub Actions, Azure DevOps 
Infrastructure-as-code (optional): Terraform, CloudFormation, Pulumi 
Data catalog tools: Databricks Unity Catalog, Collibra, Alation 
Data governance frameworks and methodologies 
Data modeling and design patterns 
API design and development 
Cloud platforms: AWS, Azure, GCP 
Container technologies: Docker, Kubernetes 
Version control systems: Git 
SQL and NoSQL databases 
Data quality and testing frameworks 

Optional - Healthcare Industry Knowledge 
Healthcare data standards (HL7, FHIR, etc.) 
Clinical and operational data models 
Healthcare interoperability requirements 
Healthcare analytics use cases 
undefinedundefinedundefined

Similar Jobs

Luxoft - DevOps Engineer with Azure

Luxoft

Pune, Maharashtra, India (On-Site)
4 Months ago
WinZO - Data Engineer

WinZO

New Delhi, Delhi, India (On-Site)
1 Day ago
Hitachi - Kubernetes Engineer

Hitachi

Pune, Maharashtra, India (On-Site)
6 Months ago
Starkflow - Principal Full Stack Developer

Starkflow

Karnataka, India (Hybrid)
1 Month ago
Conga - Staff Software Engineer

Conga

Bengaluru, Karnataka, India (On-Site)
19 Hours ago
DraftKings - Lead Data Science Engineer, Search

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Months ago
ION - Data Associate - Wealthmonitor

ION

Budapest, Hungary (On-Site)
6 Months ago
Next Level Business Services - Information Management Architect (Full Time)

Next Level Business Services

Milford, Ohio, United States (On-Site)
6 Months ago
PwC - IN-Senior Associate_Azure data Engineer_Data &  Analytics_Advisory_PAN India

PwC

Bengaluru, Karnataka, India (On-Site)
7 Months ago
The Walt Disney Company - Sr Data Analyst

The Walt Disney Company

New York, New York, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Microsoft - Principal Data Science Manager

Microsoft

Redmond, Washington, United States (Hybrid)
3 Days ago
Actian - Sustenance Engineer - Actian Data Platform - Bangalore/Pune

Actian

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Velotio Technologies - Lead Engineer (Python)

Velotio Technologies

(Remote)
6 Days ago
Rennsportgg - Site Reliability Engineer

Rennsportgg

Munich, Bavaria, Germany (Remote)
1 Month ago
Rackspace Technology - Data Engineer III

Rackspace Technology

Vietnam (Remote)
1 Month ago
Microsoft - Test Engineer

Microsoft

Penang, Malaysia (On-Site)
3 Days ago
AppMySite - Lead Back-End Developer

AppMySite

Delhi, India (Remote)
9 Months ago
Microsoft - Senior Software Engineer – CIEng

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Week ago
Inworld AI - Staff Platform Engineer, MLOps

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
1 Week ago
Canva - Staff Technical Program Manager - Platform Org

Canva

Sydney, New South Wales, Australia (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Gurugram, Haryana, India

Lakshya Digital - Senior 3D Art Lead - Character

Lakshya Digital

Haryana, India (On-Site)
1 Month ago
Assystems - Sr. ELV Engineer

Assystems

Gurugram, Haryana, India (On-Site)
6 Months ago
Nagarro - Staff Engineer, Java Fullstack

Nagarro

India (Remote)
6 Months ago
Luminar Technologies - Security Admin Engineer , Cybersecurity Operations

Luminar Technologies

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
PwC - IN_Director_Decarbonization_Decarbonization_Advisory_Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
6 Months ago
PhonePe - Software Engineer (Backend, 7-10 Yrs)

PhonePe

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Loyalty Juggernaut - Product Engineer (Python)

Loyalty Juggernaut

Hyderabad, Telangana, India (On-Site)
1 Year ago
Nagarro - Senior Staff Engineer, Delivery ETIL

Nagarro

India (Remote)
6 Months ago
PwC - Senior Associate - SAP ABAP - GDC

PwC

Kolkata, West Bengal, India (On-Site)
7 Months ago
Google - Senior Machine Learning Physical Design Engineer

Google

Bengaluru, Karnataka, India (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

The Walt Disney Company - Manager, Software Engineering – Media and Playback Data Processing

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago
Nielsen Holdings - ETL (AWS Glue)/Informatica-Redshift ,Java ,SQL

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Google - Customer Engineer, Data Analytics, Google Cloud

Google

Austin, Texas, United States (On-Site)
2 Days ago
Go Fund Me - Senior Data Engineer

Go Fund Me

Buenos Aires, Buenos Aires, Argentina (Remote)
5 Months ago
Playrix - Senior Data Analyst (Attribution)

Playrix

Serbia (Remote)
6 Months ago
Nintendo - Compensation Analyst

Nintendo

Redmond, Washington, United States (Hybrid)
2 Weeks ago
ComeOn Group - Responsible Gaming Analyst

ComeOn Group

St. Julian's, Malta (Hybrid)
1 Month ago
Tesla - SQL Database Optimization Engineer

Tesla

Athens, Greece (On-Site)
2 Months ago
N-iX - Senior/Lead Data Engineer

N-iX

Ukraine (Remote)
2 Weeks ago
Fortis Games - Head of Analytics

Fortis Games

Canada (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded