Sr Big Data Engineer - Oozie and Pig (GCP)

2 Months ago • All levels • Data Analysis • $116,100 PA - $198,440 PA

Job Summary

Job Description

This role requires a Senior Big Data Engineer with expertise in distributed systems, batch data processing, and large-scale data pipelines. Responsibilities include designing and developing scalable batch processing systems using Hadoop, Oozie, Pig, Hive, MapReduce, and HBase; writing efficient, production-ready code; developing and optimizing complex data workflows within the Apache Hadoop ecosystem; leveraging GCP tools (Dataproc, GCS, Composer); implementing DevOps and automation best practices (CI/CD, IaC); and collaborating with cross-functional teams. The ideal candidate possesses strong hands-on experience with Oozie, Pig, the Apache Hadoop ecosystem, and programming proficiency in Java or Python. A deep understanding of data structures and algorithms is essential. This is a fully remote position.
Must have:
  • Oozie, Pig, Hadoop expertise
  • Java/Python proficiency
  • Data structure & algorithm knowledge
  • GCP experience
  • DevOps & CI/CD skills
  • Batch processing system design
  • Scalable data pipeline development
Good to have:
  • Airflow
  • BigTable
  • Redis
  • Spark

Job Details

About the Role 

We are seeking a Senior Big Data Engineer with deep expertise in distributed systems, batch data processing, and large-scale data pipelines. The ideal candidate has strong hands-on experience with Oozie, Pig, the Apache Hadoop ecosystem, and programming proficiency in Java (preferred) or Python. This role requires a deep understanding of data structures and algorithms, along with a proven track record of writing production-grade code and building robust data workflows. 

This is a fully remote position and requires an independent, self-driven engineer who thrives in complex technical environments and communicates effectively across teams. 

Work Location: US-Remote, Canada-Remote 

Key Responsibilities:

    • Design and develop scalable batch processing systems using technologies like Hadoop, Oozie, Pig, Hive, MapReduce, and HBase, with hands-on coding in Java or Python. 
    • Write clean, efficient, and production-ready code with a strong focus on data structures and algorithmic problem-solving applied to real-world data engineering tasks. 
    • Develop, manage, and optimize complex data workflows within the Apache Hadoop ecosystem, with a strong focus on Oozie orchestration and job scheduling. 
    • Leverage Google Cloud Platform (GCP) tools such as Dataproc, GCS, and Composer to build scalable and cloud-native big data solutions. 
    • Implement DevOps and automation best practices, including CI/CD pipelines, infrastructure as code (IaC), and performance tuning across distributed systems. 
    • Collaborate with cross-functional teams to ensure data pipeline reliability, code quality, and operational excellence in a remote-first environment. 

Qualifications:

    • Bachelors's degree in Computer Science, software engineering or related field of study.
    • Experience with managed cloud services and understanding of cloud-based batch processing systems are critical.
    • Proficiency in Oozie, Airflow, Map Reduce, Java.
    • Strong programming skills with Java (specifically Spark), Python, Pig, and SQL.
    • Expertise in public cloud services, particularly in GCP.
    • Proficiency in the Apache Hadoop ecosystem with Oozie, Pig, Hive, Map Reduce.
    • Familiarity with BigTable and Redis.
    • Experienced in Infrastructure and Applied DevOps principles in daily work. Utilize tools for continuous integration and continuous deployment (CI/CD), and Infrastructure as Code (IaC) like Terraform to automate and improve development and release processes.
    • Proven experience in engineering batch processing systems at scale.

The following information is required by pay transparency legislation in the following states: CA, CO, HI, NY, and WA. This information applies only to individuals working in these states.
 
·       The anticipated starting pay range for Colorado is: $116,100 - $170,280.
·       The anticipated starting pay range for the states of Hawaii and New York (not including NYC) is: $123,600 - $181,280.
·       The anticipated starting pay range for California, New York City and Washington is: $135,300 - $198,440.

Unless already included in the posted pay range and based on eligibility, the role may include variable compensation in the form of bonus, commissions, or other discretionary payments. These discretionary payments are based on company and/or individual performance and may change at any time. Actual compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, licenses and certifications, and specific work location. Information on benefits  offered is here.
#LI-VM1
#LI-Remote




About Rackspace Technology
We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.
 
 
More on Rackspace Technology
Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.
 
 

Similar Jobs

Saviynt - Principal Engineer - Elastic Search

Saviynt

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Apple - Engineering Project Manager, Apps Analytics

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Yahoo - Director of Business Development and Partnerships

Yahoo

United States (Hybrid)
3 Weeks ago
Ceragon Networks - HRBP

Ceragon Networks

Romania (On-Site)
1 Month ago
Tencent - Data Science Intern

Tencent

(On-Site)
4 Months ago
onwards Search - Principal Insights Analyst

onwards Search

Los Angeles, California, United States (On-Site)
2 Weeks ago
Qube Cinema - Business Strategy & Data Analytics Manager

Qube Cinema

Delhi, India (On-Site)
1 Month ago
Internet Brands - Senior Business Intelligence Analyst

Internet Brands

Los Angeles, California, United States (On-Site)
3 Months ago
ness digital  - QA Engineer with Data expertise

ness digital

Timișoara, Timiș, Romania (Remote)
1 Month ago
Hawkeye Innovations - Senior Data Test Automation Engineer

Hawkeye Innovations

Basingstoke, England, United Kingdom (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nice - AI Architect

Nice

Sandy, Utah, United States (Hybrid)
2 Weeks ago
Blinkhealth - Pharmacy Technician - Data Entry/Intake/Order Entry

Blinkhealth

Boise, Idaho, United States (On-Site)
1 Month ago
PhonePe - Associate Director, Business Development and Implementation - UPI

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
broadcom - Client Services Consultant

broadcom

Mumbai, Maharashtra, India (On-Site)
3 Weeks ago
WebTech Corporation - Marketing Systems Analyst

WebTech Corporation

Barnsley, England, United Kingdom (On-Site)
1 Month ago
Oh BiBi - Game Lead

Oh BiBi

Paris, Île-de-France, France (Hybrid)
2 Months ago
Synechron - Big Data Engineer

Synechron

Weehawken Township, New Jersey, United States (On-Site)
1 Month ago
Electronic Arts - FY26 Project Coordinator

Electronic Arts

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Buckman - Lead Digital Software Engineer – Back End

Buckman

Chennai, Tamil Nadu, India (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Ziff Davis - Senior Paid Media Analyst

Ziff Davis

United States (Remote)
1 Month ago
Apple - AIML - Machine Learning Engineer, Foundation Models

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Next Level Business Services - Azure Services developer

Next Level Business Services

Redmond, Washington, United States (On-Site)
8 Months ago
Nice - Account Executive, Actimize(MM)

Nice

United States (Remote)
1 Week ago
Unbroken Studios - Senior Project Manager - Cleared Architecture/Engineering Design

Unbroken Studios

Arlington, Virginia, United States (Hybrid)
1 Week ago
Unbroken Studios - Single-ply Roofer

Unbroken Studios

Waco, Texas, United States (On-Site)
5 Days ago
Zuora - Sr Enterprise Solution Architect-Zuora Billing & CPQ

Zuora

United States (Remote)
1 Month ago
Whatnot - Machine Learning Scientist

Whatnot

San Francisco, California, United States (Remote)
1 Month ago
Aryaka - Product Marketing Manager

Aryaka

Santa Clara, California, United States (On-Site)
2 Months ago
Bright Edge - Sales Development Representative (Illinois State Students)

Bright Edge

Chicago, Illinois, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

EXUSIA - Senior Data Analyst - Data Engineering / Modeling

EXUSIA

India (Remote)
2 Weeks ago
N-ix - Senior Data Architect

N-ix

Poland (Hybrid)
2 Weeks ago
Expedia - Data Scientist II, Product Analytics

Expedia

Gurugram, Haryana, India (On-Site)
3 Weeks ago
GoDaddy - Senior Business Analyst

GoDaddy

Colombia (Remote)
4 Weeks ago
endava - Oracle Data & Analytics Specialist (OAC & BI Platforms)

endava

Cluj-Napoca, Cluj County, Romania (On-Site)
1 Month ago
Vendavo - Data Scientist

Vendavo

Prague, Prague, Czechia (Hybrid)
1 Week ago
Go Fund Me - Senior Data Engineer

Go Fund Me

Buenos Aires, Buenos Aires, Argentina (Hybrid)
1 Month ago
Lionbridge Games - Data Engineer I

Lionbridge Games

Mexico City, Mexico City, Mexico (On-Site)
4 Months ago
playrix  - Senior Data Analyst (Attribution)

playrix

Ukraine (Remote)
8 Months ago
Sporty - Product Data Analyst

Sporty

(Remote)
11 Months ago

Get notifed when new similar jobs are uploaded