DevJobs
RoleSuite
CompaniesRemoteAboutMethodologyContactPrivacy
Updated 2026-06-20 05:00 UTC·© 2025–2026 RoleSuite
← Back to listings

AI Computing Software Development Engineer, LLM Inference

NVIDIA · China, Shanghai

We are now looking for a Software Development Engineer to help TensorRT LLM and TensorRT Edge LLM projects! NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and GenerativeAI that has put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and large language models

  • Provide feedback into the architecture and hardware design and development

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

  • Publish key results in scientific conferences

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

Software pay context

Based on 7,496 disclosed Software salaries on RoleSuite, the role pays a median of $157K/year, with most offers between $123K and $198K (10th–90th percentile: $101K–$235K).

See the full Software salary breakdown →
Apply →

Other roles at NVIDIA

  • Senior Systems Architect - DatacenterUS, CA, Santa Clara
  • Principal Architect, System Software - Orbital Data CenterUS, CA, Santa Clara
  • Senior Systems Software Engineer, AI Stack and Performance - DGX StationUS, CA, Santa Clara
  • Executive Assistant, EMEA - WWFOUK, Remote
  • Senior Software Engineer - NVLink Rack Scale Stability and ReliabilityUS, CA, Santa Clara
  • Principal Software Engineer - Rack Scale Systems InfrastructureUS, CA, Santa Clara
  • Principal Release Infrastructure ArchitectUS, CA, Santa Clara
  • Engineering Manager, CPU Bootloader Firmware - SBIOSUS, CA, Santa Clara
  • Senior Software Engineer, GoLang - DSX MaxQUS, CA, Santa Clara
  • Senior Manager, Software Development - GPU Accelerated StorageUS, CA, Santa Clara

More Software roles

  • Software Engineer, Inference Platform Cerebras Systems · Sunnyvale, CA
  • Forward Deployed EngineerMachina Labs · Dayton, Ohio
  • Senior ServiceNow DeveloperEncora · Mexico
  • Senior ServiceNow DeveloperEncora · Peru
  • Senior ServiceNow DeveloperEncora · Costa Rica
  • Senior ServiceNow DeveloperEncora · Colombia
  • Senior ServiceNow DeveloperEncora · Brazil
  • Sênior Software Engineer - ElixirStone Co · Remoto
  • System Engineer II - Release Readiness & Metrics (Python)Torc Robotics · Remote - U.S, Ann Arbor, MI, Fort Worth, TX, Blacksburg, VA
  • Staff Software Engineer - Payments GatewayNubank · Brazil, Belo Horizonte; Brazil, Sao Paulo