Senior Data Engineer

1 Month ago • 3-5 Years • Data Analysis

Job Summary

Job Description

Progress is seeking a Senior Data Engineer to join their AI First team. The role involves transforming unstructured content like documentation and knowledge-base articles into high-quality, AI-ready data for their products. Responsibilities include designing and maintaining scalable data pipelines, processing and analyzing large datasets for AI model embedding and fine-tuning, validating and cleaning data, preparing data for Retrieval-Augmented Generation (RAG) workflows, building and optimizing database pipeline architectures, implementing data governance and security, collaborating with cross-functional teams, and contributing to ground-truth datasets for Gen-AI features.
Must have:
  • Bachelor's or Master's degree in Computer Science, Data Science, or related field.
  • 3-5 years of experience as a Data Engineer or Data Scientist.
  • Proficiency with big data systems (Spark, Kafka, VectorDB).
  • Strong SQL and Python skills.
  • Hands-on experience with MS Azure.
  • Solid understanding of machine learning fundamentals.
Good to have:
  • Experience in a cloud environment.
  • Knowledge of data science workflows and libraries.
  • Familiarity with text-processing/ML libraries (pandas, PySpark, Hugging Face).
  • Exposure to vector search technologies (pgvector, Pinecone, Milvus, Azure AI Search).
  • Effective communication and collaboration abilities.
Perks:
  • Generous remuneration package
  • Employee Stock Purchase Plan
  • 30 days paid annual vacation
  • Extra day off for birthday
  • 2 additional days off for volunteering
  • Premium healthcare and dental care coverage
  • Additional pension insurance
  • Well-equipped gym on-site with CrossFit equipment and a climbing wall
  • Co-funded Multisport card
  • Daycare Center onsite
  • Flexible working hours
  • Opportunity to work from home
  • Free underground parking with designated space for bikes and electric scooters

Job Details

We are Progress (Nasdaq: PRGS) - an experienced, trusted provider of products designed with customers in mind so they can develop the applications they need, deploy where and how they want, and manage it all safely and securely.   
We’re proud to have a diverse, global team where we value the individual and enrich our culture by considering varied perspectives because we believe people power progress. Join us as a Senior Data Engineer and help us do what we do best: propelling business forward. 
 
We are seeking an experienced and driven Data Engineer to join our AI First team, part of Web Components and Tools division. In this role, you will turn large volumes of unstructured content - documentation, knowledge-base articles, API references, etc. into high-quality, AI-ready data that powers our products. 
 
In this role, you will:
  • Design, develop, and maintain scalable data pipelines for collecting, ingesting, transforming and monitoring text data from multiple sources.
  • Process and analyze large datasets to support AI model embedding and fine-1922574463tuning.
  • Validate, clean and categorize data to ensure quality, security and usability; surface anomalies and gaps through automated checks.
  • Prepare data for RAG (Retrieval-Augmented Generation) workflows by splitting and chunking documents, managing metadata, and ensuring smooth integration with vector stores.
  • Build and optimize database pipeline architectures for performance and reliability
  • Implement data governance and security best practices.
  • Collaborate with cross-functional teams to deliver data solutions aligned with business goals.
  • Contribute to the creation of ground-truth datasets and evaluation harnesses for future Gen-AI features.
  • Stay current with emerging AI tools and trends to drive innovation.
Your background:
  • Bachelor’s or Master’s degree in Computer Science, Data Science, or a related field.
  • 3–5 years of experience as a Data Engineer or Data Scientist, preferably in a cloud environment.
  • Proficiency with big data systems and tools (e.g., Apache Spark, Kafka, VectorDB).
  • Knowledge of data science workflows and libraries
  • Strong SQL and Python skills; familiarity with text-processing/ML libraries (pandas, PySpark, Hugging Face).
  • Hands-on experience with cloud platform, especially MS Azure.
  • Exposure to vector search technologies (e.g., pgvector, Pinecone, Milvus, Azure AI Search).
  • Solid understanding of machine learning fundamentals
  • Strong analytical and problem-solving skills.
  • Effective communication and collaboration abilities.
If this sounds like you and fits your experience and career goals, we’d be happy to chat.  
 
What we offer in return is the opportunity to experience a great company culture with wonderful colleagues to learn from and collaborate with and also to enjoy:  
 
Compensation
  • Generous remuneration package
  • Employee Stock Purchase Plan Enrollment 
Family, and Health 
  • 30 days paid annual vacation
  • An extra day off for your birthday
  • 2 additional days off for volunteering
  • Premium healthcare and dental care coverage
  • Additional pension insurance
  • Well-equipped gym on-site with CrossFit equipment and a climbing wall
  • Co-funded Multisport card
  • Daycare Center for your little ones onsite
  • Flexible working hours and the opportunity to work from home.
  • Free underground parking with a designated space for bikes and electric scooters 

Apply now!

#LI-DG1
#LI-Hybrid

Similar Jobs

Aerovect - AV System Architect

Aerovect

United States (Remote)
1 Week ago
zeta - Project Manager - Talent Acquisition

zeta

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Ajmera Infotech - Sr. Backend Engineer - Node Expert

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
11 Months ago
WongDoody - (CX) CUSTOMER EXPERIENCE CONSULTANT

WongDoody

Australia (On-Site)
9 Months ago
DWS Group - Data Scientist

DWS Group

Mumbai, Maharashtra, India (Hybrid)
8 Months ago
Glean - Senior/Staff Data Scientist, Core Product

Glean

Palo Alto, California, United States (Hybrid)
2 Months ago
Cognite - Senior Data Engineer

Cognite

Oslo, Oslo, Norway (Hybrid)
1 Year ago
PwC - Senior Associate AI Engineer - Data and Analytics - Advisory

PwC

Hyderabad, Telangana, India (On-Site)
3 Days ago
NVIDIA - Senior Data Scientist and System Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

kaizen gaming  - Social Media Manager

kaizen gaming

Berlin, Berlin, Germany (On-Site)
1 Month ago
Scopely - Data Science Manager, Marketing Analytics

Scopely

Barcelona, Catalonia, Spain (Hybrid)
5 Months ago
DevRev - Demand Generation Content Writer

DevRev

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
ShyftLabs - MicroStrategy Reporting Engineer

ShyftLabs

Noida, Uttar Pradesh, India (Hybrid)
1 Month ago
bytedance - Infrastructure Software Engineer in Edge Cloud

bytedance

Seattle, Washington, United States (On-Site)
3 Months ago
Ziff Davis - Services Product Manager

Ziff Davis

Málaga, Andalusia, Spain (Remote)
5 Days ago
Apple - Machine Learning Manager - Apple Ads

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Blenheim Chalcot India - Lead Data Engineer

Blenheim Chalcot India

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Palo Alto Networks - Consulting Director, Proactive Services - Cloud and AI (Unit 42)

Palo Alto Networks

Drenthe, Netherlands (Remote)
1 Month ago
Hawkeye Innovations - Senior Data Test Automation Engineer

Hawkeye Innovations

Basingstoke, England, United Kingdom (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sofia, Sofia City Province, Bulgaria

Tide - Escalations Associate with German

Tide

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Month ago
Vertexbee studios - Senior 3D Character Artist

Vertexbee studios

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Year ago
Workato - Staff Software Engineer

Workato

Sofia, Sofia City Province, Bulgaria (Remote)
1 Week ago
luxsoft - Regular Build System Engineer with DevOps on Linux

luxsoft

Madan, Smoljan, Bulgaria (Remote)
3 Weeks ago
CyberArk - Software Engineer

CyberArk

Bulgaria (On-Site)
2 Weeks ago
CyberArk - Senior Software Engineer, Python

CyberArk

Bulgaria (Hybrid)
1 Week ago
creative assembly - Audio Programmer

creative assembly

Sofia, Sofia City Province, Bulgaria (On-Site)
3 Months ago
kaizen gaming  - Senior Backend Engineer

kaizen gaming

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Month ago
Tide - Social Media Associate with German

Tide

Bulgaria (On-Site)
1 Week ago
Progress - Senior Marketing Operations Analyst

Progress

Sofia, Sofia City Province, Bulgaria (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Discord - Data Scientist, Analytics - Safety Response

Discord

San Francisco, California, United States (On-Site)
1 Week ago
blend - Senior Data Scientist

blend

Hyderabad, Telangana, India (On-Site)
3 Weeks ago
ness digital  - Senior Data Scientist

ness digital

Iași, Iași County, Romania (On-Site)
1 Month ago
Discord - Senior Financial Analyst, Business Partnership

Discord

San Francisco, California, United States (On-Site)
1 Month ago
Addepar - Portfolio Data Operations Analyst

Addepar

Edinburgh, Scotland, United Kingdom (On-Site)
1 Week ago
Cognite - Senior Data Engineer

Cognite

Oslo, Oslo, Norway (Hybrid)
1 Year ago
Roblox - Senior Data Scientist - Creator

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
Roblox - Senior Data Scientist - Ecosystem and Learning Platform

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
truecaller - Senior Data Scientist

truecaller

Bengaluru, Karnataka, India (On-Site)
2 Months ago
N-ix - Senior Data Engineer

N-ix

Brazil (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Progress (Nasdaq: PRGS) empowers organizations to achieve transformational success in the face of disruptive change. Our software enables our customers to develop, deploy and manage responsible AI-powered applications and experiences with agility and ease. Customers get a trusted provider in

Progress, with the products, expertise and vision they need to succeed. Over 4 million developers and technologists at hundreds of thousands of enterprises depend on Progress. Learn more at www.progress.com.

Brno, South Moravian Region, Czechia (On-Site)

United States (Remote)

Limerick, County Limerick, Ireland (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Sofia, Sofia City Province, Bulgaria (Hybrid)

Sofia, Sofia City Province, Bulgaria (Hybrid)

Raleigh, North Carolina, United States (Hybrid)

Burlington, Massachusetts, United States (Hybrid)

Burlington, Massachusetts, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by progress

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug