Middle Machine Learning Engineer (Document Management System)

1 Month ago • 3 Years + • Artificial Intelligence

About the job

Summary

As a Middle Machine Learning Engineer, you will join Sigma Software's ML Competence Centre and contribute to the development and implementation of innovative NLP-based solutions for a leading provider of IT products in the Swedish and Danish public sectors. You will work under the supervision of the client's research team, validating ideas through PoCs, implementing and training learning models, and collaborating on optimal model selection. You will be responsible for applying best practices to integrate models into secure pipelines, ensuring the effective delivery of ML-based solutions. This role requires strong expertise in NLP techniques and hands-on experience with LLMs, frameworks like Langchain/Llamaindex, and libraries like NLTK, spaCy, and Hugging Face Transformers.
Must have:
  • 3+ years of hands-on experience in ML with a focus on NLP
  • Proficiency in Python
  • Proven experience with LLMs APIs and/or open-weight models
  • Proven experience with NLP techniques like tokenization, stemming, lemmatization, and NER
  • Experience with NLP libraries and frameworks like NLTK, spaCy, and Hugging Face Transformers
  • Excellent communication skills
  • Strong problem-solving skills
  • Ability to troubleshoot issues in model development and deployment
  • Upper-Intermediate level of English
Good to have:
  • Bachelor's or Master's degree in Computer Science, Data Science, Machine Learning, or a related field
  • Experience with PoC development and prototyping in enterprise solutions
  • Experience with LLM evaluation frameworks
  • Experience with fine-tuning open-source LLMs for domain and tasks
  • Knowledge of MLOPs pipeline and instruments
  • Familiarity with deep learning frameworks such as TensorFlow, PyTorch, or Keras
  • Familiarity with the deployment of NLP models in production environments
  • Knowledge of cloud platforms (AWS, Google Cloud, Azure) and their machine-learning services
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Company Description

We invite you to join our ML Competence Centre, a key part of Sigma Software’s dynamic organizational structure that integrates diverse clients, intriguing projects, and opportunities to enhance your professional skills.

Your initial project will place you in the ML R&D Centre on the client’s side, where you will help verify and implement groundbreaking ideas driven by advancements in ML, particularly NLP, to enhance the customer’s products and deliver added value to their clients. 

CUSTOMER

Our client is a leading provider of high-quality IT products in the Swedish and Danish public sectors, with over 12 years of successful cooperation with us. Their automation solutions serve 80% of government agencies in Sweden. The company specializes in document management, enterprise content management (ECM), data sharing, digital preservation, GDPR compliance, ERP solutions, and more. They cater to various sectors, including government, banking, retail, manufacturing, and life sciences.

PROJECT

Currently, our client is establishing a PoC initiative to explore innovative ideas for potential advancements and improvements

Job Description

  • Work under the supervision of the client’s research team to validate various ideas through PoCs, and to implement, train, test, and tune different learning models
  • Collaborate with the research team to brainstorm and experiment with optimal models and open-source alternatives for NLP-based ideas to achieve the best possible results
  • Apply best practices to effectively and securely implement pipelines for integrating models and ML-based solutions

Qualifications

  • 3+ years of hands-on experience in ML, with a focus on NLP (text extraction & classification, anonymization and pseudonymization, document processing, clustering and other NLP tasks) 
  • Proficiency in Python
  • Proven experience with LLMs APIs and/or open-weight models (e.g.: LLaMA, Mistral), with the use of frameworks like Langchain/Llamaindex for implementing solutions/features
  • Proven experience with NLP techniques such as tokenization, stemming, lemmatization, and named entity recognition (NER)
  • Experience with NLP libraries and frameworks like NLTK, spaCy, and Hugging Face Transformers
  • Excellent communication skills for effective collaboration with cross-functional teams
  • Strong problem-solving skills and ability to troubleshoot issues in model development and deployment
  • At least an Upper-Intermediate level of English

WOULD BE A PLUS

  • Bachelor's or Master's degree in Computer Science, Data Science, Machine Learning, or a related field
  • Experience with PoC development and prototyping in enterprise solutions
  • Experience with LLM evaluation frameworks (e.g.: DeepEval, MLFlow, RAGAs, Deepchecks, etc.)
  • Experience with fine-tuning open-source LLMs for domain and tasks
  • Knowledge of MLOPs pipeline and instruments
  • Familiarity with deep learning frameworks such as TensorFlow, PyTorch, or Keras
  • Familiarity with the deployment of NLP models in production environments
  • Knowledge of cloud platforms (AWS, Google Cloud, Azure) and their machine-learning services 
View Full Job Description

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

Masovian Voivodeship, Poland (On-Site)

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug