DevJobs
RoleSuite
CompaniesRemoteAboutMethodologyContactPrivacy
Updated 2026-06-15 11:00 UTC·© 2025–2026 RoleSuite
← Back to listings

Software Engineer - Hosted Model Infrastructure

Palantir · Palo Alto, CA

A World-Changing Company
 
Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.

The Role

We are a software engineering team with expertise in enabling ML models in production. We deploy AI models to run in variety of environments: air-gapped government networks, forward-deployed defense environments, edge nodes, and enterprises with strict data sovereignty requirements. Our customers rely on us for frontier AI capabilities running on hardware they control, often with constrained GPU resources and limited direct access. Rising to that challenge and meeting those expectations is what Palantir's excels at.

We treat models like any other software: continuously tested, continually delivered, packaged for reproducible deployment, and built for long-term maintainability. You will own services end-to-end, and work across the full stack, from inference engines, GPU scheduling to deployment pipelines, observability, and integration with Palantir's platform. The goal is to deliver new models and capabilities quickly and continuously.

Join us if you want to solve problems at the intersection of infrastructure and machine learning that directly enable critical customers.

Technologies We Use

  • Different backend languages, including Java, Rust, Python and Go
  • Model serving engines for GPU-accelerated inference
  • Docker and Kubernetes for containerization and orchestration
  • Industry-standard build tooling, including Gradle and GitHub
  • Core Responsibilities

  • Building high-performance model serving infrastructure that integrates with security models, hardware constraints, and different inference engines
  • Designing intelligent request handling including authentication, rate limiting, concurrency control, and audit logging for multi-tenant model access
  • Building and maintaining packaging and deployment pipelines enabling fast, secure, and reliable model rollouts across on-premises and air-gapped environments
  • Developing observability for production AI systems to enable easy service monitoring and fast incident triage and resolution
  • Debugging complex issues and performance problems throughout the stack, including open source inference engines, container runtimes, and GPU drivers, in environments you cannot always access directly
  • Designing and running testing and benchmarking infrastructure that validates model deployments across varying GPU hardware before they reach production
  • Working with product teams and customers to understand requirements, debug production issues, and deliver the models and capabilities they need
  • Integrating hosted model infrastructure with Palantir's deployment, configuration, and identity systems
  • What We Value

  • Ownership mindset and bias toward quality. Our software runs in environments where direct access for debugging is limited or unavailable.
  • High empathy for customer needs and drive to deliver reliable, easy-to-use models
  • Ability to work effectively across multiple languages and layers of the stack, from backend services and ML tooling to container orchestration and deployment configuration
  • Strong debugging skills and motivation to trace problems from application code through containers, orchestration, and hardware
  • Curiosity about emerging AI capabilities and the ability to quickly evaluate and integrate new models and technologies as the landscape evolves
  • Active US Security clearance, or eligibility and willingness to obtain a US Security clearance is beneficial, but not necessary
  • What We Require

  • 4+ years of professional software engineering experience building and operating production systems
  • Engineering background in Computer Science, Mathematics, Software Engineering, Physics, or similar field
  • Strong coding skills with demonstrated proficiency in programming languages, such as Java, C++, Python, Rust, or similar languages. Familiarity with the Python ML ecosystem is valuable.
  • Experience with containers, Kubernetes, and deploying backend services in production environments
  • Strong written and verbal communication skills and ability to iterate quickly with teammates, incorporating feedback and holding a high bar for quality
  • Software pay context

    Based on 7,134 disclosed Software salaries on RoleSuite, the role pays a median of $157K/year, with most offers between $123K and $199K (10th–90th percentile: $101K–$235K).

    This posting lists $145K–$200K, in line with the $157K market median.

    See the full Software salary breakdown →
    Apply →

    Other roles at Palantir

    • Software Engineer - Defense ApplicationsNew York, NY
    • Software Engineer - Core InterfacesNew York, NY
    • Operations Analyst - US Government SecurityNew York, NY
    • Commercial Administrative Business PartnerMiami, FL
    • Commercial Administrative Business PartnerWashington, D.C.
    • Commercial Administrative Business PartnerNew York, NY
    • Talent SourcerLondon, United Kingdom
    • Software Engineer - Hosted Model InfrastructureWashington, D.C.
    • Software Engineer - Hosted Model InfrastructureNew York, NY
    • Forward Deployed Software Engineer, Internship - AUS GovernmentSydney, Australia

    More Software roles

    • Head of Customer Engineering, SLED Midwest, Public SectorGoogle · Chicago, IL, USA
    • Engineering Analyst, Cloud AI AbuseGoogle · Seattle, WA, USA
    • Software Engineering Intern, Summer 2027Google · Bengaluru, Karnataka, India
    • Staff Software Engineer, Knowledge Catalog, AIGoogle · Sunnyvale, CA, USA
    • Software Engineer (Distributed Systems, Java)Apple · London
    • Software Engineer III, Technical InfrastructureGoogle · Sunnyvale, CA, USA
    • Software Engineer II, Compute Infrastructure and Spatial FlexibilityGoogle · Warsaw, Poland
    • Computer Vision Software Engineer, Calibration and Spatial SensingGoogle · Zürich, Switzerland
    • Software Engineer - Apple TV AppApple · Seattle
    • Senior Software Engineer, Growth PlatformsLyft · New York, NY