Research Software Developer II

2 Weeks ago • 2-4 Years • Data Analyst • $98,300 PA - $208,800 PA

About the job

Job Description

The Microsoft Translation team seeks a Research Software Developer II to enhance data collection and processing for machine translation models. Responsibilities include discovering and evaluating data sources, creating and evaluating algorithms for data processing (text extraction, sentence alignment, normalization), productizing research prototypes into scalable pipelines, and maintaining data infrastructure for large-scale text data. The role involves working with large datasets, applying data science, machine learning, and NLP techniques to improve the quality and coverage of translation models. This position requires strong coding skills (C#, Python), experience with large-scale data processing tools, and collaboration with research scientists.
Must have:
  • Bachelor's Degree in CS or related field
  • 2+ years experience in software development (C# or Python)
  • Experience with web-scale data mining/data science
  • AI/ML/NLP experience (preferably in production)
  • Data processing pipeline development and maintenance
Good to have:
  • Experience in cloud ecosystems (Azure, AWS)
  • Linguistics/NLP coursework
  • Master's Degree in Computer Science
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investments
  • Maternity/paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Overview

The Microsoft Translation team is on a mission to enable communication without language barriers. We offer state-of-the-art machine translation for more than 130 languages. Recently the team has introduced new features such as Document Translation, online and offline containers, as well as custom neural dictionaries. 

  

Training world class models requires world-class data. Even the best model architectures are useless if you don’t have the data to train them with.

 

As we work to expand our language coverage and quality, our needs for data quantity and quality are expanding. We are looking for a talented Research Software Developer II to join our small data team and help us to identify and collect high-quality data at large scale. The ideal candidate will have a passion for analyzing and experimenting with large-scale data, writing quality code, and a knack for developing systems that are testable, redundant and scalable. You will be working in the fields of data science, data mining, machine learning, deep neural networks and natural language processing. You will directly collaborate with experienced Machine Learning, NLP and Machine Translation scientists. This position will require work in both research and engineering domains.

 

The ideal candidate must be comfortable both with exploring new ideas and algorithms and implementing them in a robust and scalable manner. This is a fantastic opportunity to make a real difference in the quality of our system. If you are excited about making a real difference in the quality of our system, we would love to hear from you. 

 

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required/Minimum Qualifications 

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C# or Python.
    • OR equivalent experience.
  • At least one of the following: 
  • Experience with web-scale data mining / data science (Hadoop MapReduce, Scope, Spark, R, etc.)

OR

  • Experience doing AI, Machine Learning, NLP, Machine Translation, preferably in a production environment

Other Requirements 

 

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: 
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. 

 

 

Additional or Preferred Qualification:

  • Bachelor's Degree in Computer Science
    • OR related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript,
    • OR Python
    • OR Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Experience programming in cloud ecosystems (Azure Storage, ADLS, Cosmos or their AWS equivalents)
  • Linguistics and/or NLP coursework or experience 

Software Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:


Microsoft will accept applications for the role until January 16, 2025. 

 

Responsibilities

  • Discover new data sources and evaluate their quality to enrich our data coverage 
  • Work with researchers to create and evaluate prototypes for new algorithms 
  • Explore and evaluate new data processing tools and algorithms (for example text extraction, sentence extraction, parallel data alignment, normalization, duplicate identification) to improve the existing data processing pipelines.  
  • Productize research prototypes into end-to-end pipelines 

Maintain the existing pipelines for: 

  • Automated discovery and identification of language data from multiple domains at web scale 
  • Text data processing (text extraction, sentence extraction, parallel data alignment, normalization, duplicate identification) 
  • Large scale text data storage infrastructure (import, export, query) 
  • Data cleaning and filtering 

Embody our and

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$98.3K - $208.8K/yr (Outscal est.)
$153.6K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Texas, United States (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Redmond, Washington, United States (On-Site)

Mumbai, Maharashtra, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Phoenix, Arizona, United States (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Nielsen Holdings - Staff Machine learning Engineer

Nielsen Holdings, India (Hybrid)

Salesforce - Java Software Engineering SMTS

Salesforce, Germany (On-Site)

Luxoft - Lead QA Automation Engineer

Luxoft, India (Remote)

Simplify 360 - Tech Lead Full Stack (Java + React)

Simplify 360, India (Hybrid)

DraftKings - Senior Data Engineer, Snowflake

DraftKings, United States (On-Site)

Nagarro - Trainee

Nagarro, Japan (On-Site)

ION - Analyst - LCM - Mumbai - 763

ION, India (On-Site)

Twitch - Data Engineer - Monetization

Twitch, United States (Hybrid)

eBay - DataScience Lead-Global Consumer Insight

eBay, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Dream Games - Game Developer

Dream Games, Türkiye (On-Site)

 Sagecor Solutions - Application Developer 2 (NRN - 009)

Sagecor Solutions, United States (On-Site)

N-iX - Lead Mulesoft Developer (#2288)

N-iX, Ukraine (Remote)

Saviynt - Engineer/Sr. Engineer, CloudOps

Saviynt, India (Hybrid)

Meta - Software Engineer, Product

Meta, United States (On-Site)

Playtech - Team Leader Java

Playtech, Estonia (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Microsoft - Prototype Engineer

Microsoft, United States (On-Site)

Postman - Senior Product Marketing Manager

Postman, United States (On-Site)

Universal Music - Analyst, Strategic Analytics

Universal Music, United States (On-Site)

Axinous - Account Executive, Commercial - NSP (SHI)

Axinous, United States (Remote)

Scale AI - Mission Engineering Manager

Scale AI, United States (Hybrid)

Visa - Manager, Site Reliability Engineering

Visa, United States (Hybrid)

Hasbro - Game Support Rep - Temporary

Hasbro, United States (Remote)

Inkittt - Senior Product Manager, Recommendations

Inkittt, United States (On-Site)

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Bluevine - Senior Product Manager

Bluevine, India (Hybrid)

Samsung Semiconductor - Senior Engineer, Data Scientist

Samsung Semiconductor, United States (On-Site)

PwC - Associate - D&A - GDC

PwC, India (On-Site)

Aristocrat Gaming - Business Analyst

Aristocrat Gaming, United States (Hybrid)

CloudHire - Senior Data Engineer

CloudHire, India (Remote)

Sony India Software Centre - Data Engineer

Sony India Software Centre, India (Hybrid)

ConcertAI - Technical Architect

ConcertAI, India (On-Site)

Get notifed when new similar jobs are uploaded