Senior AI/NLP Engineer

1 Month ago • 4-5 Years • Research Development

Job Summary

Job Description

We are looking for a skilled Document AI / NLP Engineer to develop intelligent systems that extract meaningful data from documents such as PDFs, scanned images, and forms. In this role, you will build document processing pipelines using OCR and NLP technologies, fine-tune ML models for tasks like entity extraction and classification, and integrate those solutions into scalable cloud-based applications. You will collaborate with cross-functional teams to deliver high-performance, production-ready pipelines and stay up to date with advancements in the document understanding and machine learning space.
Must have:
  • Design, build, and optimize document parsing pipelines using tools like Amazon Textract, Azure Form Recognizer, or Google Document AI.
  • Perform data preprocessing, labeling, and annotation for training machine learning and NLP models.
  • Fine-tune or train models for tasks such as Named Entity Recognition (NER), text classification, and layout understanding using PyTorch, TensorFlow, or HuggingFace Transformers.
  • Integrate document intelligence capabilities into larger workflows and applications using REST APIs, microservices, and cloud components (e.g., AWS Lambda, S3, SageMaker).
  • Evaluate model and OCR accuracy, applying post-processing techniques or heuristics to improve precision and recall.
  • Collaborate with data engineers, DevOps, and product teams to ensure solutions are robust, scalable, and meet business KPIs.
  • Monitor, debug, and continuously enhance deployed document AI solutions.
  • Maintain up-to-date knowledge of industry trends in OCR, Document AI, NLP, and machine learning.

Job Details

Project description

We are looking for a skilled Document AI / NLP Engineer to develop intelligent systems that extract meaningful data from documents such as PDFs, scanned images, and forms. In this role, you will build document processing pipelines using OCR and NLP technologies, fine-tune ML models for tasks like entity extraction and classification, and integrate those solutions into scalable cloud-based applications.

You will collaborate with cross-functional teams to deliver high-performance, production-ready pipelines and stay up to date with advancements in the document understanding and machine learning space.

Responsibilities

  • Design, build, and optimize document parsing pipelines using tools like Amazon Textract, Azure Form Recognizer, or Google Document AI.
  • Perform data preprocessing, labeling, and annotation for training machine learning and NLP models.
  • Fine-tune or train models for tasks such as Named Entity Recognition (NER), text classification, and layout understanding using PyTorch, TensorFlow, or HuggingFace Transformers.
  • Integrate document intelligence capabilities into larger workflows and applications using REST APIs, microservices, and cloud components (e.g., AWS Lambda, S3, SageMaker).
  • Evaluate model and OCR accuracy, applying post-processing techniques or heuristics to improve precision and recall.
  • Collaborate with data engineers, DevOps, and product teams to ensure solutions are robust, scalable, and meet business KPIs.
  • Monitor, debug, and continuously enhance deployed document AI solutions.
  • Maintain up-to-date knowledge of industry trends in OCR, Document AI, NLP, and machine learning.

Skills

Must have

  • 4-5 years of hands-on experience in machine learning, document AI, or NLP-focused roles.
  • Strong expertise in OCR tools and frameworks, especially Amazon Textract, Azure Form Recognizer, Google Document AI, or open-source tools like Tesseract, LayoutLM, or PaddleOCR.
  • Solid programming skills in Python and familiarity with ML/NLP libraries: scikit-learn, spaCy, transformers, PyTorch, TensorFlow, etc.
  • Experience working with structured and unstructured data formats, including PDF, images, JSON, and XML.
  • Hands-on experience with REST APIs, microservices, and integrating ML models into production pipelines.
  • Working knowledge of cloud platforms, especially AWS (S3, Lambda, SageMaker) or their equivalents.
  • Understanding of NLP techniques such as NER, text classification, and language modeling.
  • Strong debugging, problem-solving, and analytical skills.
  • Clear verbal and written communication skills for technical and cross-functional collaboration.

Nice to have

  • N/A

Other

  • Languages: English: B2 Upper Intermediate
  • Seniority: Senior

Similar Jobs

Apple - Display Algorithm Engineer

Apple

San Diego, California, United States (On-Site)
2 Months ago
GHX - Senior Process Automation Program Manager

GHX

Hyderabad, Telangana, India (On-Site)
2 Months ago
Apple - GPU Silicon Triage Engineer

Apple

Austin, Texas, United States (On-Site)
3 Months ago
Tesla - Supply Chain Program Manager

Tesla

Brandenburg, Germany (On-Site)
6 Months ago
Scout - Staff Technical Program Manager

Scout

Fremont, California, United States (On-Site)
3 Months ago
rivos - Deep Learning Libraries Engineer

rivos

United Kingdom (Hybrid)
1 Year ago
Nagarro - Staff Engineer, Machine Learning

Nagarro

Gurugram, Haryana, India (On-Site)
10 Months ago
Apple - Engineering Program Manager, AIML Privacy and Regulatory Compliance

Apple

Cupertino, California, United States (On-Site)
3 Months ago
Kaedim - Machine Learning Engineer

Kaedim

London, England, United Kingdom (On-Site)
1 Year ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rockstar Games - Director of Security Operations

Rockstar Games

New York, United States (On-Site)
3 Months ago
Sonar Source - LLM Engineer

Sonar Source

Singapore (On-Site)
5 Months ago
Axon - Senior Product Manager

Axon

London, England, United Kingdom (On-Site)
1 Month ago
Corsair - DIY Marketing Specialist, Vietnam

Corsair

Vietnam (On-Site)
4 Months ago
New Globe - Front-end Engineer - Software Development

New Globe

Lagos, Lagos, Nigeria (Hybrid)
3 Weeks ago
IT Gurus Software - ETL Test Automation Engineer (ETL Tester)

IT Gurus Software

Pune, Maharashtra, India (On-Site)
10 Months ago
Funko - Buyer

Funko

London, England, United Kingdom (On-Site)
2 Months ago
Figma - Account Executive, SMB

Figma

San Francisco, California, United States (Hybrid)
1 Month ago
Pomelo - Payer Sales Executive

Pomelo

United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Capgemini - Powerflex Engineer

Capgemini

Gurugram, Haryana, India (On-Site)
3 Months ago
Salesforce - Territory Account Executive - SMB

Salesforce

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Any Desk - Channel Business Development Associate

Any Desk

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Software Engineer, PhD

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
AiDash - Software Development Engineer - III (DevOps)

AiDash

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Qualcomm - AI ML Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Months ago
Saama - Technical Project Manager

Saama

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Accenture - Int Controls & Compliance Specialist

Accenture

Gurugram, Haryana, India (On-Site)
4 Months ago
Paytm - Key Account Manager

Paytm

Delhi, India (On-Site)
2 Months ago
Springer Group - Implementation Specialist, Systems Optimisation

Springer Group

Pune, Maharashtra, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Apple - Generative AI, Machine Learning Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Voki games - Machine Learning Engineer

Voki games

Kyiv, Kyiv City, Ukraine (Hybrid)
1 Month ago
ALTEN - Stage Innovation: Artificial Intelligence / Computer Vision Engineer

ALTEN

Toulouse, Occitanie, France (On-Site)
2 Months ago
bytedance - Machine Learning Scientist, Scaling AI for Biology

bytedance

Seattle, Washington, United States (On-Site)
9 Months ago
gismart - Machine Learning Engineer

gismart

(On-Site)
1 Month ago
Apple - AIML Triage and Diagnostic Tooling Engineer, AIML Integration and Delivery

Apple

Santa Clara, California, United States (On-Site)
3 Months ago
bytedance - Machine Learning Engineer - MLDev

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Egnyte - Sr Software Engineer - AI

Egnyte

India (Remote)
9 Months ago
Scale AI - Applied AI Engineer

Scale AI

San Francisco, California, United States (On-Site)
3 Months ago
Bosch Group - AI Research Scientist – GenAI

Bosch Group

Sunnyvale, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Empower your future with Luxoft: Innovate, thrive and grow in a software-defined world.

Kraków, Lesser Poland Voivodeship, Poland (Remote)

Wrocław, Lower Silesian Voivodeship, Poland (Remote)

Gdańsk, Pomeranian Voivodeship, Poland (Remote)

Warsaw, Masovian Voivodeship, Poland (Remote)

Bengaluru, Karnataka, India (On-Site)

Chennai, Tamil Nadu, India (On-Site)

View All Jobs

Get notified when new jobs are added by luxsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug