Tech Lead

Metadomeai

Job Summary

This Tech Lead role involves driving the architecture and development of scalable distributed systems and complex data pipelines. The successful candidate will lead a team of engineers, focusing on high-performance backend services, scalable infrastructure, and resilient microservices architectures, combining technical excellence with leadership and strategic planning.

Must Have

  • Design and implement distributed data processing pipelines (Go, Kafka, queueing systems).
  • Architect microservices with Go frameworks (Gin/Echo/Fiber), implementing resilience patterns.
  • Design and implement horizontal and vertical scaling strategies for workloads.
  • Optimize resource utilization via load balancing, caching, and connection pooling.
  • Build event-driven architectures with Apache Kafka (topic design, partitioning).
  • Design polyglot persistence solutions (MongoDB, MySQL, Neo4j).
  • Implement comprehensive monitoring and observability (Prometheus, Grafana, ELK, tracing).
  • Design Kubernetes deployments with advanced patterns (StatefulSets, DaemonSets).
  • Implement auto-scaling strategies (HPA, VPA, cluster autoscaling).
  • Manage multi-environment deployments with GitOps workflows (ArgoCD/Flux).
  • Design disaster recovery and backup strategies across regions.
  • Implement cost optimization strategies for cloud resources.
  • Lead a team of 5-8 engineers, conducting 1:1s and career development.
  • Drive sprint planning, backlog grooming, and capacity planning.
  • Define and track team OKRs aligned with organizational goals.
  • Conduct performance reviews and create individual development plans.
  • Foster technical excellence through code and design reviews, knowledge sharing.
  • Manage stakeholder expectations and communicate technical decisions.
  • Lead incident response and post-mortem processes.
  • Collaborate with Product/Business to translate requirements into roadmaps.
  • Perform capacity planning for infrastructure and team resources.
  • Drive technical debt reduction initiatives balancing feature delivery.
  • Establish engineering best practices, coding standards, documentation.
  • Lead architectural decision records (ADRs) and technology evaluation.
  • Advanced proficiency in Go (concurrency, memory optimization, error handling).
  • Deep understanding of Kafka internals and exactly-once semantics.
  • Hands-on experience with service mesh, API gateways, distributed caching, saga patterns.
  • Expertise in load balancing, database sharding, read replicas, CDN integration.
  • Strong knowledge of CPU/memory profiling, GPU acceleration, resource optimization.
  • Experience with MySQL/PostgreSQL query optimization, indexing, connection pooling.
  • Production experience with Kubernetes (CRDs, operators, network policies, RBAC).
  • Expertise in distributed tracing, custom metrics, log aggregation, SLI/SLOs.
  • 7-10 years of software engineering experience.
  • 4+ years in distributed systems.
  • 2+ years of technical leadership or team lead experience.
  • Proven track record of leading projects from conception to production.
  • Experience managing cross-functional technical initiatives.
  • Background in 24x7 production support and on-call rotation.
  • Experience with compliance and security requirements (SOC2, GDPR).

Perks & Benefits

  • Medical insurance (Self + Spouse + Parents + Kids)
  • Accidental insurance (Self)
  • Employer's PF Contribution
  • Gratuity: As per the Gratuity Act

Job Description

About the Role

We're seeking an experienced Tech Lead to drive the architecture and development of scalable distributed systems and complex data pipelines. You'll lead a team of engineers while remaining hands-on with high-performance backend services, implement scalable infrastructure solutions, and design resilient microservices architectures. This role combines technical excellence with team leadership and strategic planning.

Responsibilities

Technical Leadership

  • Design and implement distributed data processing pipelines using Go, Kafka, and various queueing systems (RabbitMQ, SQS, Redis Streams)
  • Architect microservices using Go with Gin/Echo/Fiber frameworks, implementing circuit breakers, retry mechanisms, and graceful degradation
  • Design and implement horizontal and vertical scaling strategies for compute-intensive workloads
  • Optimize resource utilization through intelligent load balancing, caching strategies, and connection pooling
  • Build event-driven architectures with Apache Kafka, including topic design, partition strategies, and consumer group management
  • Design polyglot persistence solutions across MongoDB (document modeling, aggregation pipelines), MySQL (query optimization, indexing strategies), and Neo4j (graph traversal optimization)
  • Implement comprehensive monitoring and observability using Prometheus, Grafana, ELK stack, and distributed tracing (Jaeger/Zipkin)

Infrastructure & DevOps

  • Design Kubernetes deployments with advanced patterns: StatefulSets, DaemonSets, Jobs, and CronJobs
  • Implement auto-scaling strategies: HPA, VPA, and cluster autoscaling based on custom metrics
  • Manage multi-environment deployments with GitOps workflows (ArgoCD/Flux)
  • Design disaster recovery and backup strategies across multiple regions
  • Implement cost optimization strategies for cloud resources

Requirements

Team Leadership & Management

  • Lead a team of 5-8 engineers, conducting regular 1:1s and career development discussions
  • Drive sprint planning, backlog grooming, and capacity planning for quarterly deliverables
  • Define and track team OKRs aligned with organizational goals
  • Conduct performance reviews and create individual development plans
  • Foster a culture of technical excellence through code reviews, design reviews, and knowledge sharing sessions
  • Manage stakeholder expectations and communicate technical decisions to non-technical audiences
  • Lead incident response and post-mortem processes

Strategic Planning

  • Collaborate with Product and Business teams to translate requirements into technical roadmaps
  • Perform capacity planning for infrastructure and team resources
  • Drive technical debt reduction initiatives while balancing feature delivery
  • Establish engineering best practices, coding standards, and documentation guidelines
  • Lead architectural decision records (ADRs) and technology evaluation processes

Required Technical Skills

Backend Development

  • Go Expertise: Advanced proficiency in concurrency patterns (worker pools, channels), memory optimization, and building production-grade middleware with proper error handling and context propagation.

Distributed Systems

  • Message Streaming: Deep understanding of Kafka internals including partitioning strategies, consumer group management, and implementing exactly-once semantics with proper data serialization.
  • Microservices Patterns: Hands-on experience with service mesh, API gateways, distributed caching, and implementing saga patterns for distributed transactions.

Scaling & Performance

  • Horizontal Scaling: Expertise in load balancing (L4/T4/A10), database sharding, read replicas, and CDN integration for distributed workloads.
  • Vertical Scaling: Strong knowledge of CPU/memory profiling, GPU acceleration for compute-intensive tasks, and resource optimization strategies.

Data Systems

  • MySQL/POSTGRES: Experience with query optimization, indexing strategies, and connection pooling for RDBMS performance tuning.

Cloud & Infrastructure

  • Kubernetes: Production experience with CRDs, operators, network policies, and implementing RBAC for multi-tenant environments.
  • Observability: Expertise in distributed tracing, custom metrics, log aggregation, and defining SLI/SLOs for system reliability.

Tools knowledge

Required Experience

  • 7-10 years of software engineering with 4+ years in distributed systems
  • 2+ years of technical leadership or team lead experience
  • Proven track record of leading projects from conception to production
  • Experience managing cross-functional technical initiatives
  • Background in 24x7 production support and on-call rotation management
  • Experience with compliance and security requirements (SOC2, GDPR)

What we offer

Over and Above Benefits:

  • Medical insurance (Self + Spouse + Parents + Kids)
  • Accidental insurance (Self)
  • Employer's PF Contribution
  • Gratuity: As per the Gratuity Act

18 Skills Required For This Role

Team Management Cross Functional Data Structures Game Texts Mysql Rabbitmq Incident Response Service Mesh Load Balancing Apache Kafka Prometheus Grafana Elk Redis Mongodb Neo4j Microservices Kubernetes

Similar Jobs