We are seeking a Senior AI Engineer with strong expertise in Generative AI (GenAI), Databricks, and end-to-end ML/LLM systems. This role involves designing, building, and deploying intelligent, scalable GenAI solutions integrated into enterprise-grade data and analytics platforms. The ideal candidate will lead AI agentic workflows, data pipeline optimization, and model-driven automation using Databricks, MLflow, and Azure/Snowflake ecosystems, combining strong software engineering, MLOps, and LLM engineering experience.
Must Have:- Design and implement end-to-end Generative AI solutions on Databricks, leveraging Unity Catalog, MLflow, Delta Lake, and Vector Search.
- Architect LLM-based multi-agent frameworks for intelligent automation, chatbot systems, and document reasoning tasks.
- Integrate Cortex AI, OpenAI, or Anthropic APIs for retrieval-augmented generation (RAG), conversational reasoning, and workflow orchestration.
- Fine-tune and evaluate LLMs and domain-specific NLP models (NER, Risk Assessment, Question Answering).
- Develop pipelines for prompt engineering, context management, model evaluation, and hallucination detection.
- Collaborate with data engineering teams to ensure clean, well-governed, and vectorized data pipelines.
- Build and maintain feature stores and embeddings stores using Databricks or Snowflake.
- Implement data validation, lineage, and monitoring using Delta Live Tables and Unity Catalog.
- Build reusable ML pipelines using Databricks Repos, MLflow, and Feature Store.
- Automate deployment, monitoring, and retraining workflows for continuous model improvement.
- Strong background in Machine Learning, NLP, and LLMs (Transformers, RAG, embedding models).
- Proven experience fine-tuning or implementing models using Hugging Face, LangChain, LlamaIndex, or OpenAI API.
- Expertise in Databricks (Delta Lake, MLflow, Unity Catalog, Feature Store, Vector Search).
- Strong proficiency in Python, SQL, PySpark, and Databricks Notebooks.
- Experience building modular codebases, deploying APIs, and working with CI/CD pipelines (GitHub Actions, Azure DevOps).
- Hands-on with MLflow tracking, model registry, and experiment management.
- Need GenAI Data Scientist – Databricks certified ML Engineer and work closely with customers.
- Use case will involve data extract from pdf-based documents.
- Leverage Databricks native solutions.