Senior Machine Learning Engineer

Rubrik · Palo Alto, CA

About the Team & Role:

We're building SAGE, Rubrik's Semantic AI Governance Engine, which is the first system designed to monitor, govern, and remediate autonomous AI agents in real time. SAGE powers Rubrik Agent Cloud: enterprises define governance policies in natural language, and SAGE's custom small language models act as judges on every agent action. These models are fast enough to sit in the live request path and accurate enough that customers trust them with allow/block decisions on production traffic.

At its core, SAGE is "LLM-as-judge" applied to AI governance, utilizing the same technique most teams use for offline evaluation but productionized for real-time enforcement at enterprise scale. Our first-generation SLM Policy Guard already outperforms the larger frontier models we've benchmarked against on accuracy while running approximately 5x faster on the same workload. We're hiring to push that lead even further.

As an Applied ML Engineer on the SAGE team, you'll work end-to-end across the model lifecycle: curating data, training small models, serving them at production latency, and closing the feedback loop with real customer signals. The models you build don't just enforce policies in the live request path; they will also drive Agent Rewind, Rubrik's capability to instantly and precisely undo destructive autonomous-agent actions and restore the affected data to a trusted state.

We're a collaborative, applied team that ships models to enterprise customers within weeks, and we're passionate about proving that small, specialized models can outperform frontier LLMs at the problems that matter most for AI safety and governance.

Nature of the Specialized Duties

➢ Training, Fine-Tuning, and Distilling Production Small Language Models and Classifiers (25% of time)

Owning the full training lifecycle for the SLMs and classifiers in SAGE's real-time enforcement path, including base-model selection, supervised fine-tuning, preference optimization (DPO/RLAIF), and distillation from frontier teacher models.
Training anomaly and action-severity models that catch novel agent-side attack patterns at real-time decision latency, such as supply-chain compromises or emergent destructive behaviors not covered by any explicit policy. Severity scores route the highest-impact events to Agent Rewind for precise remediation.
Designing adversarial training pipelines like purpose-built adversarial agents and automated red-teams whose outputs feed directly into the next training run, turning every discovered weakness into a permanent model improvement.
Pushing the pareto frontier of accuracy, latency, and cost for governance-specific tasks through deliberate post-training choices (LoRA, quantization-aware training, distillation recipes, GRPO, etc.) and validating the wins on production traffic patterns.

➢ Engineering High-Performance Model Serving and Inference Infrastructure (25% of time)

Designing multi-stage inference pipelines that handle both real-time enforcement (inline prompt, response, and tool-call blocking) and high-throughput batch workloads (offline scoring, back-testing, corpus mining) while processing billions of tokens daily across Global 2000 customer agent fleets.
Optimizing live deployments through shared GPU pools, KV-cache-aware routing, continuous batching, FP8/INT8 quantization, and speculative decoding to minimize inference cost while holding sub-second P99 SLOs.
Building serving-layer infrastructure that lets SAGE block agent prompts, responses, and tool calls in real time without becoming a latency bottleneck. This includes model gateway design, request routing, and graceful degradation.
Owning canary, shadow, and A/B traffic patterns so new model variants are validated against live customer traffic before they take enforcement decisions.

➢ Building Synthetic Data Pipelines and Online + Offline Evaluation Frameworks (20% of time)

Designing automated data curation pipelines that mine live customer environments (with privacy and tenancy guarantees) for high-value per tenant training examples, such as long-tail violations, near-miss policy edges, or novel agent behaviors, and routing them back into the training loop for each customer.
Building automated policy back-testing by replaying historical agent traffic against new model and policy versions to catch regressions and recommend policy improvements before customer-visible deployment.
Building online evaluation systems for live model decisions, including shadow scoring, drift detection, calibration monitoring, and policy-coverage gap analysis, ensuring quality regressions surface in minutes rather than weeks.
Generating synthetic data using frontier teachers (adversarial prompts, policy-edge cases, multi-turn interactions) with evaluation that confirms synthetic data improves downstream quality, not just dataset size.

➢ Insights Mining, Failure Diagnosis, and Adaptive Model Improvement (15% of time)

Building memory and context harnesses that fuse data sensitivity, identity, and historical agent behavior into real-time enforcement decisions to ensure SAGE reasons from each customer's specific context.
Mining agent insights across millions of sessions to surface security gaps, which are then turned into new policy proposals, refinements to existing policies, and signals about upstream issues across the agent ecosystem (Google ADK, Azure AI Foundry, Vertex AI, and others).
Building feedback loops that turn production decisions, customer-flagged false positives, and missed violations into one-click natural-language policy refinements to drive false-positive rates down without sacrificing recall.
Diagnosing model failures end-to-end and distinguishing data, training-recipe, architecture, and serving-layer root causes so fixes land in the right layer the first time.

➢ Cross-Functional Collaboration and Translating Customer Reality into Modeling Problems (15% of time)

Providing technical leadership on a pillar of the SAGE model stack (training infrastructure, eval methodology, serving architecture, or insights pipeline), mentoring engineers ramping into ML, and shaping the team's technical roadmap.
Partnering with Product Management, customer-facing teams, and security analysts to translate customer agent-governance requirements into well-scoped modeling problems, and pushing back when ML is the wrong tool.
Communicating model behavior, tradeoffs, and limitations clearly to non-ML stakeholders, such as product managers and enterprise security leaders, so model decisions are made with full context.
Collaborating with Agent Cloud platform, security engineering, and AI research teams to integrate new SLMs into the real-time enforcement path with the right latency, observability, rollback, and tenancy guarantees.

Minimum Requirements for the Position

Education: A Bachelor's degree (or higher) in Computer Science, Machine Learning, Computer Engineering, Statistics, or a closely related technical field is required. Designing production SLM training and serving systems requires a deep theoretical understanding of modern deep learning, optimization, and systems performance.

Specialized Technical Knowledge:

2+ years of professional ML experience with demonstrable end-to-end production ownership; you have taken models from training to serving real customer traffic and stayed accountable for them through post-launch iteration.
Proficiency in Python and PyTorch (or equivalent) for production-grade training and evaluation.
Hands-on experience training, fine-tuning, or distilling language models or classifiers in a production setting, including SFT and at least one preference-optimization technique (DPO, RLAIF, or RLHF).
Production experience with serving frameworks (vLLM, SGLang, TensorRT-LLM, or equivalent), including optimization involving continuous batching, KV-cache strategy, and inference-time quantization.
Experience designing closed-loop ML systems, including the eval, telemetry, data-curation, and synthetic-data infrastructure that turns production signals back into training data and the next model release. You have built (not just used) at least one such loop.
Comfort operating at production scale, including debugging models that handle high QPS in safety-critical request paths where errors have customer-visible consequences.

Preferred Qualifications:

Deep background in AI safety and red-teaming, including hands-on experience with adversarial ML, prompt injection defense strategies, and automated evaluation suites for enterprise-grade LLM safety.
Expertise in model evaluation methodology, specifically building "LLM-as-judge" pipelines, calibration monitoring, and adversarial benchmarks that surface the subtle failure modes static metrics often overlook.
Experience with context-fusion and retrieval systems that synthesize disparate signals - such as data sensitivity, user identity, and behavioral history - into high-fidelity model decisions.
Production experience with low-latency inference for streaming or safety-critical request paths where model throughput and P99 SLOs are paramount.
Mastery of label-efficient training and data mining, utilizing weak supervision, active learning, and embedding-based retrieval to surface the production examples that drive the most significant quality improvements.
Hands-on knowledge distillation experience, successfully transferring capabilities from frontier teacher models to specialized, small-scale student models for production serving.
Familiarity with the agentic ecosystem, including tool-use frameworks, model gateway architectures (MCP, LiteLLM, or equivalent), and autonomous agent patterns.
Active open-source contributions to mainstream ML training, serving, or evaluation libraries.

The minimum and maximum base salaries for this role are posted below; additionally, the role is eligible for bonus potential, equity and benefits. The range displayed reflects the minimum and maximum target for new hire salaries for the role based on U.S. location. Within the range, the salary offered will be determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

US Pay Range

$188,500—$282,700 USD

Join Us in Securing and Accelerating the World's AI Transformation

Rubrik (RBRK), the Security and AI Operations Company, leads at the intersection of data protection, cyber resilience, and enterprise AI acceleration. Rubrik Security Cloud delivers complete cyber resilience by securing, monitoring, and recovering data, identities, and workloads across clouds. Rubrik Agent Cloud accelerates trusted AI agent deployments at scale by monitoring and auditing agentic actions, enforcing real-time guardrails, fine-tuning for accuracy and undoing agentic mistakes.

Linkedin | X (formerly Twitter) | Instagram | Rubrik.com

Inclusion @ Rubrik

At Rubrik, we are dedicated to fostering a culture where people from all backgrounds are valued, feel they belong, and believe they can succeed. Our commitment to inclusion is at the heart of our mission to secure the world’s data.

Our goal is to hire and promote the best talent, regardless of background. We continually review our hiring practices to ensure fairness and strive to create an environment where every employee has equal access to opportunities for growth and excellence. We believe in empowering everyone to bring their authentic selves to work and achieve their fullest potential.

Our inclusion strategy focuses on three core areas of our business and culture:

Our Company: We are committed to building a merit-based organization that offers equal access to growth and success for all employees globally. Your potential is limitless here.
Our Culture: We strive to create an inclusive atmosphere where individuals from all backgrounds feel a strong sense of belonging, can thrive, and do their best work. Your contributions help us innovate and break boundaries.
Our Communities: We are dedicated to expanding our engagement with the communities we operate in, creating opportunities for underrepresented talent and driving greater innovation for our clients. Your impact extends beyond Rubrik, contributing to safer and stronger communities.

Equal Opportunity Employer/Veterans/Disabled

Rubrik is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.

Rubrik provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, Rubrik complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

Federal law requires employers to provide reasonable accommodation to qualified individuals with disabilities. Please contact us at [email protected] if you require a reasonable accommodation to apply for a job or to perform your job. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.

EEO IS THE LAW

NOTIFICATION OF EMPLOYEE RIGHTS UNDER FEDERAL LABOR LAWS

AI Engineering pay context

Based on 607 disclosed AI Engineering salaries on RoleSuite, the role pays a median of $203K/year, with most offers between $165K and $243K (10th–90th percentile: $132K–$285K).

This posting lists $189K–$283K, above the $203K market median.

See the full AI Engineering salary breakdown →

Apply →