Data Engineer

1 Month ago • 1 Years + • Data Analyst

Job Summary

Job Description

Design and build scalable, reliable data infrastructure. Build applications for optimal data extraction, cleaning, transformation, and loading from diverse sources using big data technologies. Develop ETL/ELT pipelines and work with data lakes, warehouses, and BI tools. Utilize cloud services (AWS, GCP, Azure) for highly available data processing and storage. Implement data quality monitoring processes. Collaborate with business units to develop long-term data platform architecture. Establish operational excellence in data engineering. Evaluate and integrate tools to accelerate data engineering, data science, and analytics. Write unit/integration tests. Contribute to design documents.
Must have:
  • Build scalable data infrastructure
  • ETL/ELT pipeline development
  • Big data technologies (Spark, Hadoop)
  • Cloud experience (AWS, GCP, Azure)
  • Data quality monitoring
  • Python, Scala, or Java proficiency
  • Database design experience
Good to have:
  • Redshift/Snowflake/BigQuery experience
  • Data Lake/warehouse experience
  • Airflow/similar workflow tools
  • Machine learning model deployment
  • Agile methodologies

Job Details

Description

Velotio Technologies is a product engineering company working with innovative startups and enterprises. We are a certified Great Place to Work® and recognized as one of the best companies to work for in India. We have provided full-stack product development for 110+ startups across the globe building products in the cloud-native, data engineering, B2B SaaS, IoT & Machine Learning space. Our team of 400+ elite software engineers solves hard technical problems while transforming customer ideas into successful products.

Requirements

  • Design and build scalable data infrastructure with efficiency, reliability, and consistency to meet rapidly growing data needs
  • Build the applications required for optimal extraction, cleaning, transformation, and loading data from disparate data sources and formats using the latest big data technologies
  • Building ETL/ELT pipelines and work with other data infrastructure components, like Data Lakes, Data Warehouses and BI/reporting/analytics tools
  • Work with various cloud services like AWS, GCP, Azure to implement highly available, horizontally scalable data processing and storage systems and automate manual processes and workflows
  • Implement processes and systems to monitor data quality, to ensure data is always accurate, reliable, and available for the stakeholders and other business processes that depend on it
  • Work closely with different business units and engineering teams to develop a long-term data platform architecture strategy and thus foster data-driven decision-making practices across the organization
  • Help establish and maintain a high level of operational excellence in data engineering
  • Evaluate, integrate, and build tools to accelerate Data Engineering, Data Science, Business Intelligence, Reporting, and Analytics as needed
  • Focus on building test-driven development by writing unit/integration tests
  • Contribute to design documents and engineering wiki

You will enjoy this role if you...

  • Like building elegant well-architected software products with enterprise customers
  • Want to learn to leverage public cloud services & cutting-edge big data technologies, like Spark, Airflow, Hadoop, Snowflake, and Redshift
  • Work collaboratively as part of a close-knit team of geeks, architects, and leads

Desired Skills & Experience:

  • 1+ years of data engineering or equivalent knowledge and ability
  • 1+ years software engineering or equivalent knowledge and ability
  • Strong proficiency in at least one of the following programming languages: Python, Scala, or Java
  • Experience designing and maintaining at least one type of database (Object Store, Columnar, In-memory, Relational, Tabular, Key-Value Store, Triple-store, Tuple-store, Graph, and other related database types)
  • Good understanding of star/snowflake schema designs
  • Extensive experience working with big data technologies like Spark, Hadoop, Hive
  • Experience building ETL/ELT pipelines and working on other data infrastructure components like BI/reporting/analytics tools
  • Experience working with workflow orchestration tools like Apache Airflow, Oozie, Azkaban, NiFi, Airbyte, etc.
  • Experience building production-grade data backup/restore strategies and disaster recovery solutions
  • Hands-on experience with implementing batch and stream data processing applications using technologies like AWS DMS, Apache Flink, Apache Spark, AWS Kinesis, Kafka, etc.
  • Knowledge of best practices in developing and deploying applications that are highly available and scalable
  • Experience with or knowledge of Agile Software Development methodologies
  • Excellent problem-solving and troubleshooting skills
  • Process-oriented with excellent documentation skills

Bonus points if you:

  • Have hands-on experience using one or multiple cloud service providers like AWS, GCP, Azure and have worked with specific products like EMR, Glue, DataProc, DataBricks, DataStudio, etc
  • Have hands-on experience working with either Redshift, Snowflake, BigQuery, Azure Synapse, or Athena and understand the inner workings of these cloud storage systems
  • Have experience building DataLakes, scalable data warehouses, and DataMarts
  • Have familiarity with tools like Jupyter Notebooks, Pandas, NumPy, SciPy, sci-kit learn, Seaborn, SparkML, etc.
  • Have experience building and deploying Machine Learning models to production at scale
  • Possess excellent cross-functional collaboration and communication skills

Benefits

Our Culture:

  • We have an autonomous and empowered work culture encouraging individuals to take ownership and grow quickly
  • Flat hierarchy with fast decision making and a startup-oriented “get things done” culture
  • A strong, fun & positive environment with regular celebrations of our success. We pride ourselves in creating an inclusive, diverse & authentic environment

At Velotio, we embrace diversity. Inclusion is a priority for us, and we are eager to foster an environment where everyone feels valued. We welcome applications regardless of ethnicity or cultural background, age, gender, nationality, religion, disability or sexual orientation.

Similar Jobs

CloudHire - Full Stack Developer - Angular & Node

CloudHire

Hyderabad, Telangana, India (Remote)
1 Month ago
Prophecy - Alliance Solutions Engineer

Prophecy

United States (Remote)
1 Month ago
Zazz - Data Engineer (6–8 Years) Adhoc

Zazz

India (On-Site)
4 Months ago
Nagarro - Senior Staff Engineer (Cloud Infrastructure)

Nagarro

Gurugram, Haryana, India (On-Site)
5 Months ago
PwC - Senior Associate_Azure Data Engineer-- Data and Analytics_Advisory_Gurugram

PwC

Gurugram, Haryana, India (On-Site)
5 Months ago
Mozilla - Senior Data Engineer

Mozilla

United States (Remote)
6 Months ago
ION - Freelance Financial Journalist - United States

ION

New York, New York, United States (Remote)
6 Months ago
Dream Sports - Software Development Engineer 3 - Backend (Platform)

Dream Sports

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Playtika - Compensation & Benefits Analyst

Playtika

Israel (On-Site)
6 Months ago
ION - Data Center Architect, Italy

ION

Italy (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Virtuos - R&D Machine Learning Engineer

Virtuos

China (On-Site)
1 Month ago
Ubisoft - Esports Business Intelligence Specialist

Ubisoft

Paris, Île-de-France, France (Hybrid)
1 Month ago
Ajmera Infotech - SENIOR ASP.NET DEVELOPER

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
9 Months ago
Metyis - Lead Devops Engineer

Metyis

Bengaluru, Karnataka, India (On-Site)
5 Months ago
ARHS - Senior System Engineer

ARHS

Valletta, Malta (Remote)
6 Months ago
GT - Full-stack Engineer (Python + React.js)

GT

Poland (Remote)
1 Month ago
Trend Micro - (Sr.) Data Engineer/AI Trainer

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Dream11 - SDE 2 - Frontend

Dream11

Mumbai, Maharashtra, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Pune, Maharashtra, India

DNEG - Animator

DNEG

Karnataka, India (On-Site)
1 Month ago
Warner Bros Games - Senior Software Engineer - Java Fullstack

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
4 Weeks ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Hyderabad, Telangana, India (Hybrid)
3 Months ago
CloudHire - Senior Database Engineer

CloudHire

Hyderabad, Telangana, India (Remote)
1 Month ago
Convai - Prompt Engineering Data Scientist

Convai

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Luxoft - Senior Java Developer

Luxoft

Pune, Maharashtra, India (On-Site)
5 Months ago
Zazz - Marketing Data Specialist

Zazz

India (On-Site)
4 Months ago
InMobiInMobi - Graphic Designer

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
2 Months ago
CRED - campaign ops

CRED

Bengaluru, Karnataka, India (On-Site)
6 Months ago
PwC - Senior Associate - SAP BASIS and Hyperscaler - RDC

PwC

Kolkata, West Bengal, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

DraftKings - Analyst II, Compliance

DraftKings

Boston, Massachusetts, United States (On-Site)
1 Month ago
CloudHire - Data Labeler

CloudHire

Pune, Maharashtra, India (Remote)
1 Month ago
Trendyol - Pricing Data Analyst

Trendyol

İstanbul, Türkiye (Hybrid)
6 Months ago
Zynga - Data Analyst - Gram Games

Zynga

İstanbul, Türkiye (Hybrid)
4 Months ago
PwC - IN-Senior Associate_React Developer_Data &Analytics_Advisory_PAN India

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Rockstar Games - Manager, Data Engineering

Rockstar Games

New York, New York, United States (On-Site)
4 Months ago
Tesla - Process Data Analyst R&D

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
2 Months ago
N-iX - Senior/Lead Power BI Engineer

N-iX

Ukraine (Remote)
1 Month ago
Epic Games - Analytics Lead (Gameplay)

Epic Games

Cary, North Carolina, United States (On-Site)
3 Months ago
Netflix - Distributed Systems Engineer (L4) - Data Platform

Netflix

United States (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Velotio Technologies is a leading product engineering and digital solutions company working with innovative startups and enterprises across the globe. We specialize in Full-Stack development, Web & Mobile App Development, Cloud & DevOps, Data Engineering, AI/ML, UI/UX, and Quality Assurance. Since our inception in 2016, we have worked with over 110 global customers including NASDAQ-listed enterprises, unicorn startups, YCombinator and Sequoia funded companies, and cutting-edge product companies.

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Velotio Technologies

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug