AIEngJobs
RoleSuite
CompaniesRemoteAboutMethodologyContactPrivacy
Updated 2026-06-10 19:00 UTC·© 2025–2026 RoleSuite
← Back to listings

Deep Learning Performance Architect

NVIDIA · China, Shanghai

NVIDIA is developing processor and system architectures that accelerate deep learning on edge devices, workstations, and data center GPUs for a variety of applications including automotive, robotics, large language models and AI generative models. We are looking for an expert deep learning system performance architect to join our deep learning modelling, performance optimization, projections, and analysis effort. In this position, you will have the chance to optimize deep learning hardware and software architecture and make the significant impact in a dynamic technology focused company

What you’ll be doing:

  • Benchmark and analyze performance of various machine learning/deep learning workloads across GPU- and NPU-based architectures

  • Build and validate performance models, and deliver performance projections and insights for deep learning (LLM/GenAI) workloads on emerging architectures

  • Identify architecture, software and system performance bottlenecks and propose actionable optimizations

  • Explore and evaluate new software/hardware capabilities and translate them into measureable application gains

  • Leverage AI agents to accelerate performance investigation and engineering workflows

What we need to see:

  • BSc. MS or PhD in relevant discipline (CS, EE, Math, etc.,)

  • 3+ years of working experience in relevant directions will be a plus

  • Familiar with GPU or Accelerator-based deep learning platform and software stack

  • A strong background in computer architecture

  • Familiar with LLM or generative AI deep learning algorithms and kernel optimizations

  • Experience in system architecture design and performance optimization

  • Familiar with machine learning and deep learning frameworks

  • Hands-on experience using AI agents to assist daily engineering work

AI Engineering pay context

Based on 639 disclosed AI Engineering salaries on RoleSuite, the role pays a median of $201K/year, with most offers between $162K and $246K (10th–90th percentile: $131K–$286K).

See the full AI Engineering salary breakdown →
Apply →

Other roles at NVIDIA

  • Interconnect NPI Product EngineerIsrael, Yokneam
  • ASIC Verification EngineerIndia, Hyderabad
  • ASIC Verification EngineerIndia, Hyderabad
  • Human Resources GeneralistTaiwan, Taipei
  • Verification Engineer, PCIEIndia, Bengaluru
  • Verification Engineer - PCIEIndia, Bengaluru
  • Senior Scientist, Synthetic Data and PrivacyUS, CA, Santa Clara
  • Senior Scientist, Synthetic Data GenerationUS, CA, Santa Clara
  • Security and Safety Circuit EngineerChina, Shanghai
  • Deep Learning Performance ArchitectChina, Shanghai

More AI Engineering roles

  • Field Sales Representative, AI/ML, Public SectorGoogle · London, UK
  • Senior Staff Machine Learning Engineer, Menu PersonalisationHellofresh · Warszawa, Masovian Voivodeship, Poland
  • Senior Staff Machine Learning Engineer, Menu PersonalisationHellofresh · Toronto, Ontario, Canada
  • Staff Research Engineer, Applied AI, DeepMindGoogle · Mountain View, CA, USA
  • Senior ML Engineer (ML/AI)Jobgether · US
  • Sr/Staff Application & AI Engineer (AI Center of Excellence)Jobgether · US
  • AI EngineerRemoFirst · Egypt / Ukraine / Poland / Slovakia / Slovenia / Romania / South Africa / Tunisia / North Macedonia / Kazakhstan / Uzbekistan / Azerbaijan / Georgia / Armenia / Bulgaria
  • Senior AI/ML EngineerJobgether · US
  • Applied AI EngineerSnowflake · PL-Warsaw
  • Lead Full Stack Machine Learning EngineerCerebras Systems · Bengaluru, Karnataka, India