Senior Data R&D Engineer (Python, Research Tools Development)

10 Minutes ago • 5 Years +
Data Analysis

Job Description

The Senior Data R&D Engineer will join the Speech Input Division's Data Team, focusing on the data backend for speech recognition, NLU, and AI models. This role involves ensuring systems are updated with named entities from multiple sources, maintaining data purity, relevance, and optimizing data pipelines for performance. Key responsibilities include text processing in multiple languages, content extraction from databases, CI/CD for data pipelines, web scraping, and developing research tools.
Good To Have:
  • Elasticsearch
  • Data Visualization
  • Experience/Knowledge with cloud computing infrastructure
  • Experience in text data parsing and data mining
  • Experience with data versioning tools such as GIT, DVC
  • Knowledge of orchestration frameworks such as Airflow
  • Text encoding
  • Background in natural language understanding, data science, or speech recognition
  • Knowledge of multiple languages (Indian languages as well as international)
Must Have:
  • Develop and maintain data backend for AI models.
  • Manage named entities from multiple data sources.
  • Optimize data pipelines for performance and purity.
  • Perform text processing in multiple languages.
  • Implement CI/CD for data pipelines.
  • Develop web scrapers and research tools.
  • Test and monitor team's infrastructure.
  • MSc/MTech in Computer Science or equivalent.
  • Minimum 5 years of work experience.
  • 2+ years of Python experience, including Pandas.
  • Proficiency in JavaScript, Spark/Pyspark.
  • Experience with Unix/Linux.
  • Ability to query SQL and RDF databases.
Perks:
  • Annual bonus opportunity
  • Insurance coverage (medical, life, and disability)
  • Paid time off
  • Paid holidays
  • Company contribution to the RRSP (Registered Retirement Savings Plan)
  • Equity awards for certain positions and levels
  • Remote and/or hybrid work available depending on the position

Add these skills to join the top 1% applicants for this job

data-analytics
github
game-texts
linux
unix
data-visualization
elasticsearch
spark
data-science
pandas
ci-cd
git
python
sql
javascript

A Moving Experience.

Do you have a passion for pushing the boundaries of innovation? Are you excited about AI’s potential to improve the human experience? Then come join the ride!

Who is Cerence AI?

Cerence AI is the global leader in AI for transportation, specialized in building AI and voice-powered companions for cars, two-wheelers, and more that enable people to focus on what matters most. With over 500 million cars shipped with Cerence technology, we partner with leading automakers (such as Volkswagen, Mercedes, Audi, Toyota and many more), mobility providers, and technology companies to power intuitive, integrated experiences that create safer, more connected, and more enjoyable journeys for drivers and passengers alike.

Our Driving Force

Our team is dedicated to pushing the boundaries of AI innovation, working around the globe with headquarters in Burlington, Massachusetts, USA and 16 other offices across Europe, Asia, and North America. We bring together diverse backgrounds and varied skill sets with the shared goal of advancing the next generation of transportation user experiences. Our culture is customer-centric, collaborative, fast-paced, and fun, with continuous opportunities for learning and development to support your career growth.

Interested in having a significant impact in a dynamic industry with a high-performing global team?

The Speech Input Division is looking for an excellent Senior software engineer or Senior developer to join the Data Team. We are responsible for the data backend of speech recognition, NLU and AI models. We need you to help ensuring that our systems are always aware of named entities such as artists, music or movies which are aggregated from multiple sources. A key aspect is to ensure the content is pure, matching what end users speak, and sorted by relevance. You will also work on keeping the data pipelines organized well and performant.

Your Impact:

  • Text processing in multiple languages
  • Extract content from databases
  • CI/CD for data pipelines
  • Write web scrapers and spiders
  • Development of research tools
  • Testing and monitoring of the team’s infrastructure

What You Bring:

  • Education: MSc / MTech in Computer Science, Engineering, or equivalent.
  • Excellent BSc / BTech candidates can be considered.
  • 2+ years of experience utilizing Python
  • Minimum years of work experience: 5
  • Scripting skills in Python, especially with Pandas library
  • JavaScript & Spark/Pyspark
  • Unix/Linux on user level
  • Querying large databases in SQL, RDF or similar
  • Excellent written and spoken English
  • Comfortable working in an international, distributed team
  • Elasticsearch & Data Visualization(preferred skill)
  • Experience/Knowledge with cloud computing infrastructure (preferred skill)
  • Experience in text data parsing and data mining (preferred skill)
  • Experience with data versioning tools such as GIT, DVC (preferred skill)
  • Knowledge of orchestration frameworks such as Airflow (preferred skill)
  • Text encoding (preferred skill)
  • Background in natural language understanding, data science, or speech recognition (preferred skill)
  • Knowledge of multiple languages is a plus, Indian languages as well as international (preferred skill)

What we offer

We offer a generous compensation and benefits package (in addition to the base salary), including:

  • Annual bonus opportunity
  • Insurance coverage (medical, life, and disability)
  • Paid time off
  • Paid holidays
  • Company contribution to the RRSP (Registered Retirement Savings Plan)
  • Equity awards for certain positions and levels
  • Remote and/or hybrid work available depending on the position
  • All compensation and benefits are subject to the terms and conditions of the underlying plans or programs, as applicable, and may be amended, terminated, or replaced from time to time.

Cerence Inc. (Nasdaq: CRNC and www.cerence.com) is the global industry leader in creating unique, moving experiences for the automotive world. Spun out from Nuance in October 2019, Cerence is a new, independent company that has quickly gained traction as a leader in the automotive voice assistant space, working with all of the world’s leading automakers – from Ford and Fiat Chrysler to Daimler, Audi and BMW to Geely and SAIC – to transform how a car feels, responds and learns. Its track record is built on more than 20 years of industry experience and leadership and more than 500 million cars on the road today across more than 70 languages.

As Cerence looks to the future and continues an ambitious growth agenda, we need someone to join the team and help build the future of voice and AI in cars. This is an exciting opportunity to join Cerence’s passionate, dedicated, global team and be a part of meaningful innovation in a rapidly growing industry.

EQUAL OPPORTUNITY EMPLOYER

Cerence is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all federal, state and local laws that prohibit employment discrimination on the basis of age, race, color, gender, gender identity, gender expression, sex, sex stereotyping, pregnancy, national origin, ancestry, religion, physical or mental disability, medical condition, marital status, citizenship status, sexual orientation, protected military or veteran status, genetic information and other protected classifications. Cerence Equal Employment Opportunity Policy Statement.

All prospective and current Employees need to remain vigilant when it comes to executing security policies in the workplace. This includes:

  • Following workplace security protocols and training programs to familiarize with the ways to maintain a safe workplace.
  • Following security procedures to report any suspicious activity.
  • Having respect for corporate security procedures to allow those procedures to be effective.
  • Adhering to company's compliance and regulations.
  • Encouraging to follow a zero tolerance for workplace violence.
  • Basic knowledge of information security and data privacy requirements (e.g., how to protect data & how to be handling this data).
  • Demonstrative knowledge of information security through internal training programs.

About Us

At Cerence AI, we help the world's leading automotive and technology brands leverage AI to create safer, more productive and more joy-filled brand and user experiences. We're looking for motivated, collaborative individuals who come alive with big challenges and are excited about AI’s potential to shape the future of how people experience the world.

Set alerts for more jobs like Senior Data R&D Engineer (Python, Research Tools Development)
Set alerts for new jobs by Cerence
Set alerts for new Data Analysis jobs in India
Set alerts for new jobs in India
Set alerts for Data Analysis (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙