Junior Data Engineer

2 Weeks ago • 1-2 Years • Data Analyst

About the job

Job Description

This remote Junior Data Engineer role at Patterned Learning involves designing, implementing, and managing data pipelines using ETL tools. Responsibilities include data modeling, creating efficient data structures, ensuring data quality and consistency through cleansing and validation processes, and integrating data from various sources (structured and unstructured). The role also requires utilizing big data technologies (e.g., Kafka) for processing large datasets, implementing data security measures, and optimizing pipeline performance. Experience with Python, Java, SQL, and database management (relational and non-relational) is essential. The ideal candidate will be a team player with excellent communication and problem-solving skills.
Must have:
  • 2+ years programming (Python, Java, SQL)
  • 2+ years ETL & database experience
  • Data pipeline development & management
  • Data modeling & quality assurance
  • Data integration & security
Good to have:
  • Big data technologies (PySpark, Databricks, Azure Synapse)
  • Cloud platform experience

This is a remote position.

Junior Data Engineer  - Remote Job, 1+ Year Experience


Annual Income: $63K - $77K


A valid work permit is necessary in the US

About us: Patterned Learning is a platform that aims to help developers code faster and more efficiently. It offers features such as collaborative coding, real-time multiplayer editing, and the ability to build, test, and deploy directly from the browser. The platform also provides tightly integrated code generation, editing, and output capabilities.




Position Summary

Join the fast-paced, innovative, and collaborative environment focused on providing an AIOps platform that enhances the intelligence of the CVS Health infrastructure. Work closely with subject matter experts and colleagues to build and scale out machine learning and AI solutions that will detect, predict, and recommend solutions to correct issues before system impact and enhance the efficiency, reliability, and performance of CVS Health’s IT operations. 

Key Responsibilities include:

  • Data pipeline development: Designed, implemented, and managed data pipelines for extracting, transforming, and loading data from various sources into data lakes for processing, analytics, and correlation.

  • Data modeling: Create and maintain data models ensuring data quality, scalability, and efficiency

  • Develop and automate processes to clean, transform, and prepare data for analytics, ensuring data accuracy and consistency

  • Data Integration: Integrate data from disparate sources, both structured and unstructured to provide a unified view of key infrastructure platform and application data

  • Utilize big data technologies such as Kafka to process and analyze large volumes of data efficiently

  • Implement data security measures to protect sensitive information and ensure compliance with data and privacy regulation

  • Create/maintain documentation for data processes, data flows, and system configurations

  • Performance Optimization- Monitor and optimize data pipelines and systems for performance, scalability and cost-effectiveness

Characteristics of this role:

  • Team Player: Willing to teach, share knowledge, and work with others to make the team successful.

  • Communication: Exceptional verbal, written, organizational, presentation, and communication skills.

  • Creativity: Ability to take written and verbal requirements and come up with other innovative ideas.

  • Attention to detail: Systematically and accurately research future solutions and current problems.

  • Strong work ethic: The innate drive to do work extremely well.

  • Passion: A drive to deliver better products and services than expected to customers.


Required Qualifications

  • 2+ years of programming experience in languages such as Python, Java, SQL

  • 2+ years of experience with ETL tools and database management (relational, non-relational)

  • 2+ years of experience in data modeling techniques and tools to design efficient scalable data structures

  • Skills in data quality assessment, data cleansing, and data validation


Preferred Qualifications

  • Knowledge of big data technologies and cloud platforms

  • Experience with technologies like PySpark, Databricks, and Azure Synapse.


Education

Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent working experience


Why Patterned Learning LLC?


Patterned Learning can provide intelligent suggestions, automate repetitive tasks, and assist developers in writing code more effectively. This can help reduce coding errors, improve productivity, and accelerate the development process.


The pattern recognition is particularly relevant in the context of coding. Neural networks, especially deep learning models, are commonly employed for pattern detection and classification tasks. These models simulate human decision-making and can identify patterns in data, making them well-suited for tasks like code analysis and generation.



View Full Job Description
$63.0K - $77.0K/yr (Outscal est.)
$70.0K/yr avg.
Worldwide

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

Similar Jobs

Luxoft - Regular Android HMI Architect

Luxoft, Egypt (On-Site)

Microsoft - ROP - Software Engineer II

Microsoft, India (On-Site)

Technorizen Software Solutions - Exp. Android Developer (1-2 years)

Technorizen Software Solutions, India (On-Site)

ByteDance - Risk Control Business Partner - Dubai

ByteDance, United Arab Emirates (On-Site)

Playrix - Senior Data Analyst (Game)

Playrix, Serbia (Remote)

GoTo Group - Geospatial Analyst

GoTo Group, Indonesia (On-Site)

The Walt Disney Company - Manager Data Science

The Walt Disney Company, United States (On-Site)

Tangelo Games Corp - Data & Analytics Engineer

Tangelo Games Corp, Spain (Remote)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Dream Sports - SDE 1 - React Native with Android

Dream Sports, India (On-Site)

Moon Active - Unity Developer

Moon Active, Israel (Hybrid)

ION - Technical Consultant - Endur

ION, United States (On-Site)

bosh group india - Data Engineer--Scala development

bosh group india, India (On_site)

ION - Senior Java Developer - Italy

ION, Italy (On-Site)

Logitech - Sr Integration Engineer

Logitech, India (On-Site)

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Netflix - Indirect Demand Specialist

Netflix, United Kingdom (On-Site)

Miniclip - Data Engineer

Miniclip, Portugal (On-Site)

Netflix - Data Engineer (L5) - Product (Device)

Netflix, United States (Remote)

GeoYeti - LIDAR/EO Photogrammetrist - Junior

GeoYeti, United States (On-Site)

Varonis  - BI Developer

Varonis , United States (On-Site)

Equivalent Jobs - C++ SOFTWARE ENGINEER (MARKET DATA)

Equivalent Jobs, (Remote)

Zinrelo - Data Scientist

Zinrelo, India (Hybrid)

Get notifed when new similar jobs are uploaded