Senior Telemetry Data Engineer

1 Month ago • 4-8 Years • Data Analyst • DevOps • Undisclosed

Job Summary

Job Description

The Senior Telemetry Data Engineer at Microsoft's Cloud Operations + Innovation (CO+I) team will design and deliver automated solutions for monitoring and alerting on data center critical environment resources. This role involves working with massive amounts of real-time data, leveraging machine learning models for anomaly detection, and utilizing cutting-edge technologies within a Lakehouse architecture. Responsibilities include designing telemetry data ingestion and processing systems, implementing anomaly detection systems, defining data models, and ensuring high-frequency, low-latency data pipelines. The engineer will collaborate with cross-functional teams, ensuring interoperability and high coverage of data center signals. Experience with KQL, Python, GoLang, or Spark is required, along with expertise in processing data from networking protocols.
Must have:
  • Subject matter expertise in machine learning models for anomaly detection
  • Experience with data lakes and real-time data streams
  • Proficiency in KQL, Python, GoLang, or Spark
  • Experience building AI/ML applications for IT operations
  • Bachelor's or master's degree in a related field
Good to have:
  • Experience designing telemetry systems for data center networks
  • Familiarity with HVAC, CRAC, AHU, Chillers, and other critical environment equipment
  • Knowledge of incident management and data center operations
  • Cloud computing certifications
Perks:
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Job Details

Overview

Microsoft is on a mission to empower every person and every organization on the planet to achieve more. Our culture is centered on embracing a growth mindset, a theme of inspiring excellence, and encouraging teams and leaders to bring their best each day. In doing so, we create life-changing innovations that impact billions of lives around the world. You can help us achieve our mission.       

 

Cloud Operations + Innovation (CO+I) is the engine that powers Microsoft’s core cloud platforms and services that millions of people use every day. With more than 95% of Fortune 500 business on Azure, 180 million using Office 365, and millions using other services – all running on Microsoft's cloud infrastructure – CO+I builds and operates the foundation upon which Microsoft’s mission to empower every person and organization comes to life.       

 

Are you passionate about cloud computing? Do you get excited about taking a hands-on approach to transforming Microsoft’s most critical business through investigation, data analysis, and automation? If so, come and help us build the most reliable & efficient datacenter infrastructure on the planet. The CO+I Critical Environment Systems Intelligence (CESI) team is responsible for designing and delivering solutions to support global datacenter operations and to improve availability. CESI is helping to drive CO+I’s transition to a customer centric, data driven, observability based, live service culture. As a Data Engineer, you will be a key player in this transition.     

 

As a Senior Data Engineer on the CO+I Critical Environment Service Intelligence (CESI) team, you partner and collaborate on the design and delivery of automated solutions to monitor, detect, and alert on data center critical environment mechanical and electrical resourcesYou will collaborate with other CO+I teams to contribute and benefit from their work to ensure that we are constantly improving across the fleetYou will work with massive amounts of data with low latency requirements across cutting edge technologies, with the potential for significant impact to both internal partners and external customers.  

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day. 

Qualifications

Qualifications: 

  • Subject matter expertise level in supervised and unsupervised machine learning models for anomaly detection. 
  • Demonstrated subject matter expertise in utilizing data lakes within Lakehouse architectures to process, aggregate, and manage real-time data streams from cloud-based services. 
  • Demonstrated subject matter expertise in managing and processing large-scale data formats with a focus on real-time serialization and deserialization to ensure low-latency during data handling. This includes advanced proficiency in Kusto query language (KQL) with experience, and proficiency in coding with Python, GoLang, or Spark. 
  • Experience with generative AI or Copilots for troubleshooting data center environments. 
  • Expertise in processing data frames from networking layers and protocols, including BGP, TCP/IP, and GPRS tunneling protocol. 
  • Proven experience on building applications using artificial intelligence (AI) techniques, including machine learning (ML) and data science, to enhance and automate various IT operations (AIOPS).  
  • Bachelor's or master’s degree in computer science, data engineering, or a related field. 
  • Excellent problem-solving skills and attention to detail. 
  • Ability to work collaboratively with cross-functional teams. 
  • Strong written and verbal communication skills. 

Preferred Qualifications: 

  • Indepth experience in designing and implementing telemetry systems for data center networks. 
  • Familiarity with HVAC, CRAC, AHU, Chillers, and other critical environment equipment. 
  • Knowledge of incident management and data center operations. 
  • Certifiable knowledge in cloud computing. 

 

About Us: We are committed to maintaining the highest standards of operational excellence in our data centers. Join us in our mission to enhance our telemetry capabilities and ensure the reliability and efficiency of our critical environments. 

 

 

Background Check Requirements: 

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.  

 

#COICareers  

 

Responsibilities

As a Senior Technical Program Manager in DC Critical Environments, you will: 

  • Drive a program that covers end-to-end monitoring, and processing of critical environment (CE) infrastructure telemetry for all leased sites, to bring those sites on par with owned datacenter sites.  
  • Design and implement telemetry data ingestion and data processing systems for leased sites. 
  • Prototype, pilot, and deploy multi-signal anomaly detection and prevention systems leveraging machine learning and statistical analysis for DC leased sites  
  • Define and drive an operationalization plan for the telemetry pipeline for leased sites. 
  • Ensure interoperability of detection methods, systems, and workflows by defining conceptual, logical, and physical data models. 
  • Understand the signals coming from the EPMS and BAS systems for leased sites. 
  • Ensure high percent coverage and mapping of leased site signals including thermal, power, and other environmental conditions and data. 
  • Define a set of reusable primitives for mapping logical and physical topology of data centers leased sites. 
  • Ensure there is a high-frequency, high-volume, low-latency streaming and micro-batching capable pipeline to process DC CE telemetry from leased sites.  
  • Architect a staging model to ensure the onboarding of leased sites CE telemetry (thermal, power, and other environmental subjects). 
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Similar Jobs

Playtika - Data Science Expert

Playtika

Israel (On-Site)
2 Weeks ago
N-iX - Middle Support Data Engineer (#2521)

N-iX

Ukraine (Remote)
1 Month ago
Microsoft - Solutions Sales Specialist - Azure Data & AI

Microsoft

London, England, United Kingdom (On-Site)
1 Week ago
Spellbrush - Software Engineer

Spellbrush

Tokyo, Japan (On-Site)
3 Months ago
Spell Brush - Software Engineer

Spell Brush

San Francisco, California, United States (On-Site)
3 Months ago
MURKA - Data Scientist

MURKA

Poland (On-Site)
5 Days ago
PwC - IN_Senior Associate_Tableau Developer_Data & Analytics_Advisory_PAN India

PwC

Gurugram, Haryana, India (On-Site)
3 Months ago
Epic Games - Senior Data Analyst, Game Platform

Epic Games

Cary, North Carolina, United States (On-Site)
3 Weeks ago
PwC - D&A - GDC

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
InMobiInMobi - Manager - Product Analytics

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
5 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Bazaar Voice - Staff Software Engineer

Bazaar Voice

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
PhonePe - SRE - Big Data (OnPrem)

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Nielsen Holdings - Senior Data Scientist

Nielsen Holdings

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
Luxoft - Regular Data Engineer

Luxoft

(Remote)
2 Months ago
The Walt Disney Company - Product Owner Data

The Walt Disney Company

Montévrain, Île-de-France, France (Hybrid)
1 Week ago
SymphonyAI - Senior Python Developer

SymphonyAI

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Accenture in India - GN - Song - MT - Brand and Creative Strategy- Jr. Art Director- Analyst

Accenture in India

Maharashtra, India (Hybrid)
7 Months ago
Razer - Senior Data Scientist

Razer

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
3 Months ago
ByteDance - Machine Learning Engineer - Machine Learning Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Data Analyst Jobs

Fliff  Inc  - Data Scientist

Fliff Inc

Austin, Texas, United States (On-Site)
6 Months ago
EXUSIA - Data Governance Specialist - Collibra

EXUSIA

India (Remote)
3 Months ago
Playrix - Data QA Engineer

Playrix

Cyprus (Remote)
3 Months ago
Scopely - Data Scientist, Portfolio Analytics

Scopely

Barcelona, Catalonia, Spain (Hybrid)
2 Months ago
PwC - IN_Manager_Telecom Consulting _TMT_Advisory _Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
3 Weeks ago
Casumo - Support Engineer

Casumo

(Hybrid)
1 Month ago
Magic Media - Python Automation Engineer

Magic Media

State Of Rio De Janeiro, Brazil (Remote)
1 Month ago
The Walt Disney Company - Senior Product Manager II - Data

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Weeks ago
PhonePe - MIS Analyst

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Peak - Data Scientist (New Grad)

Peak

(On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Milan, Lombardy, Italy (On-Site)

Gurugram, Haryana, India (On-Site)

Barcelona, Catalonia, Spain (On-Site)

Prague, Prague, Czechia (On-Site)

Montreal, Quebec, Canada (On-Site)

Dublin, County Dublin, Ireland (On-Site)

London, England, United Kingdom (On-Site)

Atlanta, Georgia, United States (On-Site)

Virginia, United States (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug