DevJobs
RoleSuite
CompaniesRemoteAboutMethodologyContactPrivacy
Updated 2026-06-15 10:00 UTC·© 2025–2026 RoleSuite
← Back to listings

ML Software Engineer, Data Plane

Amazon · Tel Aviv-Yafo, Tel Aviv, ISR

The MLIL DataPlane team is looking for a Software Development Engineer to own the design and implementation of our inference data plane. We build the software that makes large models run efficiently on custom hardware - spanning model execution, memory management, data movement, and serving integration.
Our work covers the full inference path: integrating serving engines with custom hardware, developing high-performance compute kernels, enabling efficient data movement, and driving models from early validation through production. We operate at frontier scale with large distributed models.
This is a ground-up effort with rapidly evolving hardware and software. We are looking for an IC who can write and optimize low-level code for custom hardware, validate model architectures end-to-end, build test and profiling infrastructure, and drive performance across the stack.

Key job responsibilities
- Develop and optimize compute kernels for a custom ML accelerator architecture, targeting production-level performance for large language model inference.
- Implement and validate LLM architectures (decoder-only, mixture-of-experts) end-to-end - from PyTorch model definition through distributed execution on custom hardware.
- Integrate custom accelerator backends into open-source ML serving frameworks (vLLM, PyTorch), including scheduler extensions, memory management, and model parallelism.
- Build and maintain test infrastructure for model correctness validation across CPU, GPU, simulator, and hardware targets.
- Profile and optimize inference workloads - identify bottlenecks, instrument critical paths, and drive latency and throughput improvements from simulation through hardware bringup.
- Own features end-to-end: from design through implementation, testing, and integration into the broader software stack.
- Contribute to CI/CD pipelines that gate model and kernel changes on correctness and performance regressions.- Bachelor's degree or equivalent
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Knowledge of computer architecture, operating systems, and parallel computing- Knowledge of Machine Learning and LLM fundamentals, including transformer architecture, training/inference lifecycles, and optimization techniques
- Knowledge of ML frameworks including JAX, PyTorch, vLLM, SGLang, Dynamo, TorchXLA, and TensorRT
- Experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Software pay context

Based on 7,123 disclosed Software salaries on RoleSuite, the role pays a median of $157K/year, with most offers between $123K and $199K (10th–90th percentile: $101K–$235K).

See the full Software salary breakdown →
Apply →

Other roles at Amazon

  • Sr Manager, Software, Payload SofwareSunnyvale, California, USA
  • Software Development Engineer, Content and Channel Tech TeamBeijing, CHN
  • Sr. Business Development Mgr, Amazon Japan PrimeTokyo, JPN
  • Software Development Engineer, Japan Seller Services TechTokyo, JPN
  • Software Development Engineer, Japan Seller Services TechTokyo, JPN
  • Software Development Engineer, Japan Seller Services TechTokyo, JPN
  • Product Manager-Tech III, Finance AutomationBengaluru, Karnataka, IND
  • Software Development Manager, Payables TechHyderabad, Telangana, IND
  • System Development Engineer II, GREF Tech, Finance AutomationHyderabad, Telangana, IND
  • Data Engineering Manager for Accounts Payables Technology, FinAutoHyderabad, Telangana, IND

More Software roles

  • Head of Customer Engineering, SLED Midwest, Public SectorGoogle · Chicago, IL, USA
  • Engineering Analyst, Cloud AI AbuseGoogle · Seattle, WA, USA
  • Software Engineering Intern, Summer 2027Google · Bengaluru, Karnataka, India
  • Staff Software Engineer, Knowledge Catalog, AIGoogle · Sunnyvale, CA, USA
  • Software Engineer (Distributed Systems, Java)Apple · London
  • Software Engineer III, Technical InfrastructureGoogle · Sunnyvale, CA, USA
  • Software Engineer II, Compute Infrastructure and Spatial FlexibilityGoogle · Warsaw, Poland
  • Computer Vision Software Engineer, Calibration and Spatial SensingGoogle · Zürich, Switzerland
  • Software Engineer - Apple TV AppApple · Seattle
  • Senior Software Engineer, Growth PlatformsLyft · New York, NY