Staff Engineer - Data

2 Months ago • 8 Years +
Data Analysis

Job Description

At SAFE Security, we champion a safer digital future through radical transparency, autonomy, and accountability. We foster a culture-first environment with unlimited vacation and a commitment to continuous learning. We seek a Staff Data Engineer to solve complex data challenges at scale. This role involves leading the design of data platforms, pipelines, and lakehouse architectures to fuel AI-driven cyber risk quantification globally, setting data strategy, and shaping large-scale data ecosystems.
Good To Have:
  • Exposure to CI/CD pipelines, automated testing, and infrastructure-as-code for data workflows.
  • Familiarity with real-time analytics engines (Druid, Pinot, Rockset) or machine learning data pipelines.
  • Contributions to open-source data projects or thought leadership in the data engineering community.
  • Prior experience in cybersecurity, risk quantification, or other high-scale SaaS domain.
Must Have:
  • Mentor engineers and champion data engineering best practices.
  • Design and lead petabyte-scale data ingestion, processing, and analytics platforms.
  • Build streaming and batch pipelines handling billions of events daily.
  • Define frameworks for data modeling, schema evolution, and data quality.
  • Write high-performance data processing jobs in Python, SQL, and Scala.
  • Architect data lakes and warehouse solutions balancing cost, performance, and scalability.
  • Debug and optimize long-running jobs, data skew, and high-volume ETL bottlenecks.
  • Collaborate with Product, AI/ML, and Platform teams for data solutions.
  • Evaluate and introduce emerging data technologies.
  • 8+ years experience in data engineering, designing and scaling distributed data systems.
  • Deep expertise in big data processing frameworks like Apache Spark, Flink, and Airflow.
  • Strong hands-on experience with Snowflake, Iceberg, and Parquet.
  • Proficiency in Python, SQL, Scala, Go/Nodejs for ETL/ELT workloads.
  • Expertise in real-time data ingestion pipelines using Kafka or Kinesis.
  • Experience operating in cloud-native environments (AWS) and leveraging services like S3, Lambda, ECS, Glue, Athena.
  • Strong understanding of data modeling, schema design, indexing, and query optimization.
  • Proven leadership in mentoring engineers and driving architectural decisions.
  • Experience in streaming architectures, CDC pipelines, and data observability frameworks.
  • Ability to navigate ambiguous problems and lead teams toward innovative solutions.
  • Proficient in deploying containerized applications (Docker, Kubernetes, ECS).
  • Familiarity with AI Coding assistants like Cursor, Claude Code, or GitHub Copilot.
Perks:
  • Unlimited vacation policy
  • High-trust work environment
  • Commitment to continuous learning

Add these skills to join the top 1% applicants for this job

saas-business-models
data-analytics
github
game-texts
automated-testing
aws
spark
ci-cd
docker
kubernetes
python
scala
sql
machine-learning

At SAFE Security, our vision is to be the Champions of a Safer Digital Future and the Catalysts of Change. We believe in empowering individuals and teams with the freedom and responsibility to align their goals, ensuring we all move forward together.

We operate with radical transparency, autonomy, and accountability—there’s no room for brilliant jerks. We embrace a culture-first approach, offering an unlimited vacation policy, a high-trust work environment, and a commitment to continuous learning. For us, Culture is Our Strategy—check out the Culture Memo to dive deeper into what makes SAFE unique.

We’re looking for a Staff Data Engineer who thrives on solving complex data challenges at scale. You’ll be the technical force multiplier, leading the design of data platforms, pipelines, and lakehouse architectures that fuel AI-driven cyber risk quantification globally. If you’ve been waiting for a role where you can set data strategy, lead bold ideas, and shape large-scale data ecosystems—this is it.

What You’ll Do:

  • Be the Data Tech Leader: Mentor engineers, champion data engineering best practices, and raise the bar for technical excellence across the org.
  • Architect at Scale: Design and lead petabyte-scale data ingestion, processing, and analytics platforms using Snowflake, Apache Spark, Iceberg, Parquet, and AWS-native services.
  • Own the Data Flow: Build streaming and batch pipelines handling billions of events daily, orchestrated through Apache Airflow for reliability and fault tolerance.
  • Set the Standards: Define frameworks for data modeling, schema evolution, partitioning strategies, and data quality/observability for analytics and AI workloads.
  • Code Like a Pro: Stay hands-on, writing high-performance data processing jobs in Python, SQL, and Scala, and conducting deep-dive reviews when it matters most.
  • Master the Lakehouse: Architect data lakes and warehouse solutions that balance cost, performance, and scalability, leveraging AWS S3 and Snowflake.
  • Solve Complex Problems: Elegantly and efficiently debug and optimize long-running jobs, data skew, and high-volume ETL bottlenecks.
  • Collaborate and influence: Work with the Product, AI/ML, and Platform teams to ensure that data solutions directly power real-time cyber risk analytics.
  • Innovate Constantly: Evaluate and introduce emerging data technologies (e.g., Flink, Druid, Rockset) to keep SAFE at the forefront of data engineering innovation.

What We’re Looking For:

  • 8+ years of experience in data engineering, with a proven track record of designing and scaling distributed data systems.
  • Deep expertise in big data processing frameworks (Apache Spark, Flink) and workflow orchestration (Airflow).
  • Strong hands-on experience with data warehousing (Snowflake) and data lakehouse architectures (Iceberg, Parquet).
  • Proficiency in Python, SQL, Scala, Go/Nodejs with an ability to optimize large-scale ETL/ELT workloads.
  • Expertise in real-time data ingestion pipelines using Kafka or Kinesis, handling billions of events daily.
  • Experience operating in cloud-native environments (AWS) and leveraging services like S3, Lambda, ECS, Glue, and Athena.
  • Strong understanding of data modeling, schema design, indexing, and query optimization for analytical workloads.
  • Proven leadership in mentoring engineers, driving architectural decisions, and aligning data initiatives with product goals.
  • Experience in streaming architectures, CDC pipelines, and data observability frameworks.
  • Ability to navigate ambiguous problems, high-scale challenges, and lead teams toward innovative solutions.
  • Proficient in deploying containerized applications (Docker, Kubernetes, ECS).
  • Familiarity with using AI Coding assistants like Cursor, Claude Code, or GitHub Copilot

Preferred Qualifications:

  • Exposure to CI/CD pipelines, automated testing, and infrastructure-as-code for data workflows.
  • Familiarity with real-time analytics engines (Druid, Pinot, Rockset) or machine learning data pipelines.
  • Contributions to open-source data projects or thought leadership in the data engineering community.
  • Prior experience in cybersecurity, risk quantification, or other high-scale SaaS domain

If you’re passionate about cyber risk, thrive in a fast-paced environment, and want to be part of a team that’s redefining security—we want to hear from you! 🚀

Set alerts for more jobs like Staff Engineer - Data
Set alerts for new jobs by Safe security
Set alerts for new Data Analysis jobs in India
Set alerts for new jobs in India
Set alerts for Data Analysis (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙