AI Infrastructure Engineer

1 Month ago • All levels • Devops • $118,657 PA - $177,000 PA

Job Summary

Job Description

The AI Infrastructure Engineer will join the TIGE CDN Platform Dev team to develop a high-performing global multi-cloud CDN platform. The role involves integrating AI solutions to enhance platform automation, configuration, and incident management. Responsibilities include designing and developing AI-powered solutions for log analysis, root cause analysis, and automated troubleshooting. The engineer will also contribute to creating AI-driven configuration assistants, optimize AI model performance, and collaborate with various teams to apply AI technologies to solve business problems. This role demands strong analytical and communication skills, along with the ability to work in a dynamic, multi-cloud environment.
Must have:
  • Bachelor's or Master's in related fields
  • Proficient in Python and deep learning frameworks
  • Experience with AI model deployment and optimization
  • Basic understanding of distributed systems and cloud technologies
  • Excellent analytical and problem-solving skills
  • Strong communication and teamwork abilities
Good to have:
  • Internship experience with AI solutions
  • Familiarity with log analytics platforms
  • Experience with infrastructure-as-code tools
  • Coursework in MLOps practices

Job Details

Team Description The TIGE CDN Platform Dev team provides a highly available, cost-efficient, and top-performing global multi-cloud CDN platform for ByteDance’s internal customers by integrating both self-built and commercial CDNs. The team continuously evolves the platform to include more cloud services beyond CDN, delivering a unified, secure, reliable, and high-performance multi-cloud PaaS solution. We are now expanding our capabilities by integrating AI-driven solutions to significantly enhance platform automation, configuration, and intelligent incident management. Job Description 1. Participate in exploring, designing, and developing AI-powered solutions for intelligent log analysis, root cause analysis (RCA), and automated troubleshooting to enhance platform reliability and reduce MTTR. 2. Contribute to designing and implementing AI-driven multi-cloud configuration assistants, enabling intuitive and automated interfaces for platform configuration and customer self-service scenarios. 3. Work closely with senior engineers, product managers, and operations teams to identify business pain points and apply AI technologies to deliver rapid, tangible improvements. 4. Assist in optimizing AI model inference performance and deployment efficiency within a cloud-native, edge computing environment.
Qualifications Minimum Qualifications: 1. Bachelor’s or Master’s degree in Computer Science, Electronics, Communication, Artificial Intelligence, or related fields. 2. Strong programming skills, proficient in Python, and familiarity with at least one deep learning framework such as PyTorch, TensorFlow, or JAX. 3. Academic or project-based experience with AI model deployment and optimization, especially large language models (LLMs), vector databases, RAG techniques, or prompt engineering. 4. Basic understanding of distributed systems, cloud-native technologies, and Kubernetes-based infrastructure. 5. Excellent analytical and problem-solving skills, with coursework or projects demonstrating the application of data-driven AI solutions. 6. Strong communication and teamwork abilities, comfortable collaborating across diverse teams. Preferred Qualifications 1. Internship experience or academic projects related to AI solutions within CDN, edge computing, or multi-cloud platforms. 2. Familiarity with log analytics platforms (e.g., Prometheus, ClickHouse, ELK) or automated incident management systems. 3. Experience with infrastructure-as-code tools (Terraform, OpenAPI) or automation frameworks. 4. Coursework or projects in MLOps practices or using deployment tools such as Kubeflow, Ray, or BentoML.

Similar Jobs

Activate Games - Game Facilitator (Store Associate)

Activate Games

Tustin, California, United States (On-Site)
1 Month ago
Sailpoint - Director of CAE

Sailpoint

United States (Remote)
3 Weeks ago
Bungie - Contract Associate Creator Marketing Manager

Bungie

(Hybrid)
6 Months ago
Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Remote)
1 Month ago
Tesla - Workshop Supervisor

Tesla

Holzwickede, North Rhine-Westphalia, Germany (On-Site)
4 Months ago
NVIDIA - Senior BMC Firmware Development Engineer - Platform Lead

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
AeroSpike - Solutions Architect

AeroSpike

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Unity - Mobile Automation Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Nightfall AI - Senior ML Platform Backend Engineer

Nightfall AI

San Francisco, California, United States (Hybrid)
1 Month ago
Perplexity - AI Software Engineer - Evaluation Platform

Perplexity

San Francisco, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Riot Games - Principal Software Engineer, Gameplay - Teamfight Tactics

Riot Games

Dublin, County Dublin, Ireland (On-Site)
7 Months ago
Riot Games - Staff Software Engineer - VALORANT, Foundations, Build Platforms

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Vercel - Engineering Manager, SRE

Vercel

New York, New York, United States (Hybrid)
1 Month ago
Moloco - Growth Manager (Russian Speaking)

Moloco

London, England, United Kingdom (On-Site)
6 Days ago
Tesla - Service Advisor

Tesla

Vienna, Vienna, Austria (On-Site)
4 Months ago
Mapbox - Engineering Manager, Search

Mapbox

Japan (On-Site)
2 Months ago
Nice - HR Operations Specialist

Nice

Sandy, Utah, United States (On-Site)
3 Weeks ago
Sagecor - Software Integration Engineer 3

Sagecor

Fort Meade, Maryland, United States (On-Site)
1 Month ago
WongDoody - UI Designer

WongDoody

Hong Kong, Hong Kong (On-Site)
2 Months ago
Passive Logic - Technical Project Manager

Passive Logic

Salt Lake City, Utah, United States (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

Rippling - Staff Software Engineer - Device Products

Rippling

Seattle, Washington, United States (On-Site)
6 Days ago
UPF Industries  - Regional Truck Driver

UPF Industries

Granger, Indiana, United States (On-Site)
2 Months ago
whoop - Senior Test Development Engineer

whoop

Boston, Massachusetts, United States (On-Site)
2 Months ago
Nice - Senior Sales, Compensation, Financial and Data Analyst

Nice

United States (Remote)
3 Weeks ago
Advanced Systems Group, LLC - Audio & Visual Technical Coordinator

Advanced Systems Group, LLC

San Francisco, California, United States (On-Site)
2 Weeks ago
bytedance - Software Engineer, Inference

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
CyberArk - Senior Implementation Engineer

CyberArk

United States (On-Site)
1 Week ago
Apple - Wireless PHY System Bringup Engineer

Apple

San Diego, California, United States (On-Site)
1 Month ago
Palo Alto Networks - Senior Analyst, SOX and External Reporting Assurance

Palo Alto Networks

Santa Clara, California, United States (On-Site)
1 Week ago
Nintendo - Localization Product Specialist III - Spanish

Nintendo

Redmond, Washington, United States (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Veeam Software - DevOps Engineer

Veeam Software

Warsaw, Masovian Voivodeship, Poland (Remote)
3 Weeks ago
Palo Alto Networks - Senior Consulting Director, Cloud Security, Proactive Services (Unit 42)

Palo Alto Networks

New York, United States (Remote)
1 Week ago
Mendix - Senior Presales Solution Architect

Mendix

Bangkok, Thailand (Remote)
7 Months ago
USE Insider - DevOps Engineer

USE Insider

Istanbul, İstanbul, Türkiye (Remote)
7 Months ago
Interactive Brokers - Senior Platform Engineer

Interactive Brokers

Chicago, Illinois, United States (Hybrid)
5 Days ago
bytedance - Solutions Architect

bytedance

Taguig, Metro Manila, Philippines (On-Site)
4 Months ago
Playtika - SRE Group Manager

Playtika

Ukraine (On-Site)
5 Months ago
Bluevine - Senior DevOps Engineer

Bluevine

Bengaluru, Karnataka, India (Hybrid)
9 Months ago
Palo Alto Networks - Senior Staff Site Reliability Engineer (Cortex Observability)

Palo Alto Networks

Santa Clara, California, United States (On-Site)
2 Days ago
Brillio - Full Stack/Architect (Python, React, Strapi, AWS, Terraform)

Brillio

New York, United States (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug