AI Evaluations Lead, Global Manual Evaluations

Uber · Hyderabad, Telangāna, India

**About the Role** The AI Evaluations Team Lead II is responsible for leading the successful delivery of AI evaluation programs, ensuring high-quality execution, operational excellence, and alignment across multiple workstreams. This role oversees a team of AI Evaluations Specialists and SMEs, managing capacity, prioritisation, and execution to ensure evaluation outputs are delivered accurately, consistently, and at scale. As the primary operational lead for the program, the Team Lead partners closely with program teams to align on priorities, manage delivery commitments, escalate risks, and ensure evaluation insights translate into measurable improvements across AI-powered support experiences. Their impact extends beyond team performance to driving program effectiveness, scalability, and business insights. **What the Candidate Will Do** - Lead and develop a team of AI Evaluations Specialists and SMEs, fostering a high-performance culture focused on quality, accountability, and continuous improvement. - Own delivery outcomes across evaluation programs, ensuring work is prioritized, executed, and completed against agreed timelines, quality standards, and stakeholder expectations. - Manage workforce planning, capacity allocation, and workload prioritization across multiple evaluation workstreams and business priorities. - Partner with Program teams to align on upcoming initiatives, sprint planning, evaluation requirements, and delivery commitments. - Act as the primary escalation point for operational risks, delivery blockers, resource constraints, and cross-functional dependencies. - Ensure evaluation findings, insights, and recommendations are effectively communicated to stakeholders and translated into actionable improvement opportunities. - Drive operational governance across evaluation programs, including performance reviews, delivery tracking, quality oversight, and risk management. - Monitor program health and performance metrics, identifying trends, gaps, and opportunities to improve efficiency, quality, and business impact. - Coordinate bug identification, issue escalation, and follow-through with Product, Engineering, and Triage teams to support timely resolution and validation. - Support the continuous improvement of evaluation methodologies, workflows, quality frameworks, and operational processes. - Lead hiring, onboarding, coaching, and performance management activities to build team capability and support organizational growth. - Represent the evaluations function in cross-functional forums, ensuring stakeholder alignment on priorities, risks, dependencies, and outcomes. **Basic Qualifications** - Demonstrated experience leading teams responsible for delivering operational, quality, analytics, support, risk, trust & safety, or similar programs in complex environments. - Proven ability to manage capacity planning, workload prioritization, and resource allocation across multiple concurrent workstreams. - Strong stakeholder management skills, with experience partnering effectively with Product, Engineering, Operations, Policy, Quality, or equivalent cross-functional teams. - Experience driving operational delivery against defined goals, timelines, service levels, or business outcomes. - Strong program and project management capabilities, including risk identification, dependency management, escalation handling, and execution tracking. - Demonstrated ability to translate business priorities into clear operational plans and execution strategies. - Strong analytical and problem-solving skills, with the ability to assess operational challenges, identify solutions, and make data-informed decisions. - Experience managing performance, coaching team members, and developing talent within high-performing teams. - Excellent communication skills, including the ability to influence stakeholders, align priorities, and communicate complex topics clearly across technical and non-technical audiences. - Experience operating in fast-paced, ambiguous environments where priorities, products, and processes evolve rapidly. **Preferred Qualifications** - Experience working with AI-powered products, AI quality programs, customer support operations, Trust & Safety, or digital customer experience programs. - Familiarity with AI evaluation methodologies, quality assurance frameworks, policy governance, or root cause analysis practices. - Experience working with Jira, dashboards, workforce planning tools, and operational reporting systems. - Understanding of common GenAI concepts and failure modes, including hallucinations, retrieval failures, grounding issues, and instruction-following errors. - Experience supporting global or multi-regional programs involving multiple stakeholders and operational dependencies. Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuelds progress. What moves us, moves the world - let’s move it forward, together. Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role. \*Accommodations may be available based on religious and/or medical conditions, or as required by applicable law. To request an accommodation, please reach out to [[email protected]](mailto:[email protected]).

Operations pay context

Based on 4,455 disclosed Operations salaries on RoleSuite, the role pays a median of $110K/year, with most offers between $83K and $145K (10th–90th percentile: $66K–$184K).

See the full Operations salary breakdown →

Apply →

AI Evaluations Lead, Global Manual Evaluations

Operations pay context

Other roles at Uber

More Operations roles