Research Software Developer II

26 Minutes ago • 2-4 Years • Data Analyst

About the job

Job Description

The Microsoft Translation team seeks a Research Software Developer II to enhance data collection and processing for machine translation models. Responsibilities include discovering and evaluating data sources, creating and evaluating algorithms for data processing (text extraction, sentence alignment, normalization), productizing research prototypes into scalable pipelines, and maintaining data infrastructure for large-scale text data. The role involves working with large datasets, applying data science, machine learning, and NLP techniques to improve the quality and coverage of translation models. This position requires strong coding skills (C#, Python), experience with large-scale data processing tools, and collaboration with research scientists.
Must have:
  • Bachelor's Degree in CS or related field
  • 2+ years experience in software development (C# or Python)
  • Experience with web-scale data mining/data science
  • AI/ML/NLP experience (preferably in production)
  • Data processing pipeline development and maintenance
Good to have:
  • Experience in cloud ecosystems (Azure, AWS)
  • Linguistics/NLP coursework
  • Master's Degree in Computer Science
Perks:
  • Industry-leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investments
  • Maternity/paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Overview

The Microsoft Translation team is on a mission to enable communication without language barriers. We offer state-of-the-art machine translation for more than 130 languages. Recently the team has introduced new features such as Document Translation, online and offline containers, as well as custom neural dictionaries. 

  

Training world class models requires world-class data. Even the best model architectures are useless if you don’t have the data to train them with.

 

As we work to expand our language coverage and quality, our needs for data quantity and quality are expanding. We are looking for a talented Research Software Developer II to join our small data team and help us to identify and collect high-quality data at large scale. The ideal candidate will have a passion for analyzing and experimenting with large-scale data, writing quality code, and a knack for developing systems that are testable, redundant and scalable. You will be working in the fields of data science, data mining, machine learning, deep neural networks and natural language processing. You will directly collaborate with experienced Machine Learning, NLP and Machine Translation scientists. This position will require work in both research and engineering domains.

 

The ideal candidate must be comfortable both with exploring new ideas and algorithms and implementing them in a robust and scalable manner. This is a fantastic opportunity to make a real difference in the quality of our system. If you are excited about making a real difference in the quality of our system, we would love to hear from you. 

 

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required/Minimum Qualifications 

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C# or Python.
    • OR equivalent experience.
  • At least one of the following: 
  • Experience with web-scale data mining / data science (Hadoop MapReduce, Scope, Spark, R, etc.)

OR

  • Experience doing AI, Machine Learning, NLP, Machine Translation, preferably in a production environment

Other Requirements 

 

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: 
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. 

 

 

Additional or Preferred Qualification:

  • Bachelor's Degree in Computer Science
    • OR related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript,
    • OR Python
    • OR Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • Experience programming in cloud ecosystems (Azure Storage, ADLS, Cosmos or their AWS equivalents)
  • Linguistics and/or NLP coursework or experience 

Software Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:


Microsoft will accept applications for the role until January 16, 2025. 

 

Responsibilities

  • Discover new data sources and evaluate their quality to enrich our data coverage 
  • Work with researchers to create and evaluate prototypes for new algorithms 
  • Explore and evaluate new data processing tools and algorithms (for example text extraction, sentence extraction, parallel data alignment, normalization, duplicate identification) to improve the existing data processing pipelines.  
  • Productize research prototypes into end-to-end pipelines 

Maintain the existing pipelines for: 

  • Automated discovery and identification of language data from multiple domains at web scale 
  • Text data processing (text extraction, sentence extraction, parallel data alignment, normalization, duplicate identification) 
  • Large scale text data storage infrastructure (import, export, query) 
  • Data cleaning and filtering 

Embody our and

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$98.3K - $208.8K/yr (Outscal est.)
$153.6K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Bengaluru, Karnataka, India (On-Site)

Singapore (On-Site)

Taipei City, Taiwan (Hybrid)

Bucharest, Bucharest, Romania (On-Site)

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

Next Level Business Services - Java Developer

Next Level Business Services, United States (On-Site)

ICE - Engineer II, QA

ICE, India (Hybrid)

Baazi Games - Senior Software Engineer

Baazi Games, India (Hybrid)

Playtech - Junior Java Developer

Playtech, (On_site)

Casumo - Data Analyst

Casumo, (Hybrid)

Evernorth Health Services - Software Engineering Advisor [T500-10128]

Evernorth Health Services, India (On-Site)

Lytx,  Inc  - Staff Data Engineer

Lytx, Inc , India (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

SKYDANCE - ShotGrid Production Manager

SKYDANCE, United States (Hybrid)

Infoblox - Senior Enterprise Account Executive

Infoblox, United States (On-Site)

Cadence - Lead Software Engineer, Synthesis

Cadence, United States (On-Site)

Netflix - Creative Design Manager – Design Foundations

Netflix, United States (Remote)

ION - Senior UX Product Designer, New York

ION, United States (On-Site)

The Walt Disney Company - Senior Manager Digital Video Newsroom

The Walt Disney Company, United States (On-Site)

Gala - Senior Talent Acquisition Specialist

Gala, United States (On-Site)

Info Stretch - Senior .NET Developer

Info Stretch, United States (On-Site)

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

LSEG (London Stock Exchange Group) - Data Scientist

LSEG (London Stock Exchange Group), India (Hybrid)

Zeta - Sr. Associate - Data Privacy

Zeta, India (On-Site)

Fluence - Battery Data Scientist

Fluence, India (Hybrid)

Meta - Global Sales Analytics Lead

Meta, United States (Remote)

Chiselon Technologies   - Data Scientist

Chiselon Technologies , India (Hybrid)

Life church - Director of Analytics and Decision Support

Life church, United States (On-Site)

Coinbase - Senior Data Scientist

Coinbase, India (Remote)

 Sagecor Solutions - Software Engineer 3 (IDN - 075)

Sagecor Solutions, United States (On-Site)

Get notifed when new similar jobs are uploaded