Senior Software Engineer II -Machine Learning
Sumo logic
Job Summary
As a Senior Machine Learning Engineer at Sumo Logic, you will design, implement, and optimize agentic AI components for next-generation AI-powered analytics in observability and security. This role involves leveraging machine learning, large language models, and GPU-accelerated inference to create autonomous AI agents that process massive, heterogeneous log data in real-time, contributing to cutting-edge LLM infrastructure and defining best practices in context engineering and AI observability.
Must Have
- Design, implement, and optimize agentic AI components (tools, memory management, prompts).
- Develop and maintain golden datasets by defining sourcing strategies and ensuring quality.
- Prototype and evaluate novel prompting strategies and reasoning chains.
- Collaborate cross-functionally with product, data, and infrastructure teams.
- Operate autonomously in a fast-paced, ambiguous environment.
- Ensure reliability, performance, and observability of deployed agents.
- Maintain a strong bias for action, delivering incremental improvements.
- B.Tech, M.Tech, or Ph.D. in Computer Science, Data Science, or a related field.
- 6+ years of hands-on industry experience with demonstrable ownership and delivery.
- Strong understanding of machine learning fundamentals, data pipelines, and model evaluation.
- Proficiency in Python and ML/data libraries (NumPy, pandas, scikit-learn).
- Working knowledge of LLM core concepts, prompt design, and agentic design patterns.
- Experience with distributed systems and dealing with large amounts of data.
- Strong communication skills and a passion for shaping emerging AI paradigms.
Good to Have
- Experience leading a team of engineers.
- Prior experience building and deploying AI agents or LLM applications in production.
- Familiarity with modern agentic AI frameworks (e.g., LangGraph, LangChain, CrewAI).
- Experience with ML infrastructure and tooling (PyTorch, MLflow, Airflow, Docker, AWS).
- Exposure to LLM Ops (infrastructure optimization, observability, latency, and cost monitoring).
- Familiarity with JVM languages.
Job Description
Want to build the next generation of AI-powered analytics and intelligent agents for observability and security use cases? Passionate about leveraging cutting-edge machine learning, large language models, and GPU-accelerated inference to transform how organizations understand and act on their logs data? Come talk with us!
As a Senior Machine Learning Engineer, you’ll build the intelligence behind the next generation of agentic AI systems that reason over massive, heterogeneous log data. You’ll combine machine learning, prompt engineering, context engineering and rigorous evaluation to create autonomous AI agents that help organizations understand and act on their data in real time.
You’ll be part of a small, high-impact team shaping how AI agents understand complex machine data. This is an opportunity to work on cutting-edge LLM infrastructure and contribute to defining best practices in context engineering and AI observability.
Responsibilities
- Design, implement, and optimize agentic AI components, including tools, memory management, and prompts.
- Develop and maintain golden datasets by defining sourcing strategies, working with data vendors, and ensuring quality and representativeness at scale.
- Prototype and evaluate novel prompting strategies and reasoning chains for model reliability and interpretability.
- Collaborate cross-functionally with product, data, and infrastructure teams to deliver end-to-end AI-powered insights.
- Operate autonomously in a fast-paced, ambiguous environment - defining scope, setting milestones, and driving outcomes.
- Ensure reliability, performance, and observability of deployed agents through rigorous testing and continuous improvement.
- Maintain a strong bias for action—delivering incremental, well-tested improvements that directly enhance customer experience.
Required Qualifications
- B.Tech, M.Tech, or Ph.D. in Computer Science, Data Science, or a related field.
- 6+ years of hands-on industry experience with demonstrable ownership and delivery.
- Strong understanding of machine learning fundamentals, data pipelines, and model evaluation.
- Proficiency in Python and ML/data libraries (NumPy, pandas, scikit-learn); familiarity with JVM languages is a plus.
- Working knowledge of LLM core concepts, prompt design, and agentic design patterns.
- Experience with distributed systems and dealing with large amounts of data.
- Strong communication skills and a passion for shaping emerging AI paradigms.
Desired Qualifications
- Experience leading a team of engineers.
- Prior experience building and deploying AI agents or LLM applications in production.
- Familiarity with modern agentic AI frameworks (e.g., LangGraph, LangChain, CrewAI).
- Experience with ML infrastructure and tooling (PyTorch, MLflow, Airflow, Docker, AWS).
- Exposure to LLM Ops - infrastructure optimization, observability, latency, and cost monitoring.
About Us
Sumo Logic, Inc. empowers the people who power modern, digital business. Sumo Logic enables customers to deliver reliable and secure cloud-native applications through its Sumo Logic SaaS Analytics Log Platform, which helps practitioners and developers ensure application reliability, secure and protect against modern security threats, and gain insights into their cloud infrastructures. Customers worldwide rely on Sumo Logic to get powerful real-time analytics and insights across observability and security solutions for their cloud-native applications. For more information, visit www.sumologic.com.
Sumo Logic Privacy Policy. Employees will be responsible for complying with applicable federal privacy laws and regulations, as well as organizational policies related to data protection.
Create a Job Alert
Interested in building your career at Sumo Logic? Get future opportunities sent straight to your email.
Apply for this job
- indicates a required field
Autofill with MyGreenhouse
First Name*
Last Name*
Preferred First Name
Email*
Phone
Country
Phone
Resume/CV
AttachAttach
Dropbox
Google Drive
Enter manuallyEnter manually
Accepted file types: pdf, doc, docx, txt, rtf
Cover Letter
AttachAttach
Dropbox
Google Drive
Enter manuallyEnter manually
Accepted file types: pdf, doc, docx, txt, rtf
- * *
LinkedIn Profile
How did you hear about this job?
What is your current location?*
How many years of experience you have?*
Select...
Do you have extensive experience on AI/ML or GenAI projects?*
Select...
Current CTC*
Expected CTC*
How long is the notice period?*
Select...
Submit application