Job Title: Data Engineer
Company: P99Soft Pvt Ltd
Location: Hyderabad/Bangalore/Pune
Job Type: Full-time
About Us:
P99Soft Pvt Ltd is an innovative tech company at the forefront of AI-driven solutions, empowering businesses with cutting-edge technologies. We specialize in delivering data-driven insights, and our mission is to build scalable, intelligent systems that solve real-world challenges. We are looking for a passionate and skilled Data Engineer to join our growing team and help shape the future of data science and AI.
Role Overview:
We are seeking a highly motivated and hands-on Data Engineer with expertise in large language models (LLM), OpenAI, and Generative AI (GenAI) technologies. This role requires a strong foundation in data engineering practices, alongside deep familiarity with machine learning, AI model integration, and processing large datasets for advanced AI-driven applications.
As a Data Engineer at P99Soft, you will work closely with data scientists, software engineers, and AI researchers to build scalable data infrastructure and deploy cutting-edge AI models, including OpenAI's tools and custom Generative AI solutions.
Key Responsibilities:
- Design, develop, and maintain robust data pipelines to support machine learning and AI applications, ensuring data integrity and efficiency.
- Collaborate with cross-functional teams to collect, process, and analyze data from various sources for AI model training, validation, and deployment.
- Work with large language models (LLM) and generative AI technologies (e.g., OpenAI GPT) to implement intelligent systems and applications.
- Optimize and scale AI/ML pipelines to handle vast amounts of structured and unstructured data efficiently.
- Support the deployment and monitoring of AI models into production environments, ensuring high performance and low-latency operations.
- Implement best practices for data management, data quality, and data security in AI-driven systems.
- Troubleshoot and resolve data-related issues and bottlenecks across the AI infrastructure.
- Contribute to the design of data architecture to support the integration of AI models and machine learning frameworks.
Key Requirements:
- 2-5 years of hands-on experience as a Data Engineer or in a similar role.
- Proven experience working with large language models (LLM), OpenAI technologies, and Generative AI (GenAI) frameworks.
- Strong proficiency in Python, SQL, and other programming languages commonly used in data engineering.
- Experience with data processing frameworks (e.g., Apache Spark, Apache Kafka, Hadoop) and cloud platforms (e.g., AWS, GCP, Azure).
- Familiarity with machine learning models, model deployment, and API integration.
- Solid understanding of data warehousing, ETL processes, and data architecture.
- Knowledge of data security, privacy standards, and compliance regulations.
- Strong problem-solving skills and ability to work in a fast-paced, collaborative environment.
- Excellent communication skills to work with cross-functional teams and stakeholders.
Preferred Skills:
- Experience with OpenAI’s GPT models and API integrations.
- Familiarity with advanced AI techniques like reinforcement learning, unsupervised learning, etc.
- Knowledge of containerization (e.g., Docker, Kubernetes) and CI/CD pipelines.
- Experience working with both structured and unstructured data (e.g., text, images, audio).
- Exposure to advanced data analytics and visualization tools (e.g., Tableau, Power BI, etc.).