Staff Machine Learning Engineer (Video Insights)

JioStar · Bengaluru / Mumbai

Job Summary: As a Staff Machine Learning Engineer, you will demonstrate strong independence and technical proficiency while collaborating effectively within the team. You will uphold a standard of excellence, ensuring high-quality work and timely delivery of projects. Additionally, you will serve as the functional lead within your domain, providing guidance and expertise to team members.

About the team: Join our Video CoE team as a Senior ML Engineer focused on fine-tuning and optimizing large video models. You'll work with cutting-edge multimodal AI technology, experimenting with the latest video understanding models and customizing them for real-world use cases. Your work will directly impact millions of users by enabling smarter video insights and content understanding. Be part of a team that combines deep ML expertise with the infrastructure to deploy at scale.

Join our cutting-edge technology team focused on advancing video understanding through custom-tuned large video models. If you want to tackle hard and interesting ML problems at scale and create an impact within an entrepreneurial environment, join us!

Key responsibilities:

Fine-tune large video models (Vid-LLMs) using advanced techniques such as LoRA, QLoRA, and PEFT for specific video understanding tasks
Design and implement efficient model adaptation pipelines for domain-specific video content and use cases
Optimize model inference performance through quantization, knowledge distillation, and hardware-specific optimizations
Conduct extensive experimentation and ablation studies to identify optimal model configurations and hyperparameters
Build robust evaluation frameworks and metrics to assess model quality, generalization, and edge case performance
Collaborate with research and product teams to translate business requirements into model tuning objectives
Develop and maintain documentation of tuning methodologies, lessons learned, and best practices for the team
Contribute to open-source projects and stay current with the latest advancements in multimodal AI and video understanding

Skills and attributes for success:

6+ years of professional experience in machine learning engineering, with specific focus on deep learning and model fine-tuning

Advanced proficiency in Python and hands-on experience with deep learning frameworks (PyTorch preferred)

Hands-on experience fine-tuning large language models and multimodal models using PEFT, LoRA, and similar techniques

Strong understanding of video codecs, video processing pipelines, and streaming technologies

Solid foundation in computer vision and deep learning fundamentals (CNNs, Transformers, attention mechanisms)

Experience with model evaluation frameworks, A/B testing, and continuous experimentation infrastructure

Proficiency with GPU-based training and inference optimization using CUDA or similar frameworks

Excellent problem-solving skills and ability to debug complex ML systems in production

Experience with version control (Git) and MLOps tools (MLflow, Weights & Biases, or similar)

Preferred education and experience:

BE/B.Tech in Computer Science, Electrical Engineering, AI, or a related technical field with 7 to 9 Yrs of experience MS or PhD in ML/AI a plus

Apply →