HPC Engineer

Sandisk · Bengaluru, KA, India

Sandisk understands how people and businesses consume data and we relentlessly innovate to deliver solutions that enable today’s needs and tomorrow’s next big ideas. With a rich history of groundbreaking innovations in Flash and advanced memory technologies, our solutions have become the beating heart of the digital world we’re living in and that we have the power to shape.

Sandisk meets people and businesses at the intersection of their aspirations and the moment, enabling them to keep moving and pushing possibility forward. We do this through the balance of our powerhouse manufacturing capabilities and our industry-leading portfolio of products that are recognized globally for innovation, performance and quality.

Sandisk has two facilities recognized by the World Economic Forum as part of the Global Lighthouse Network for advanced 4IR innovations. These facilities were also recognized as Sustainability Lighthouses for breakthroughs in efficient operations. With our global reach, we ensure the global supply chain has access to the Flash memory it needs to keep our world moving forward.

Role Overview

Experienced Senior HPC Engineer / Architect specializing in Linux-based high-performance computing (HPC) environmentsEDA workflows, and automation-driven infrastructure. Proven expertise in designing, managing, and optimising large-scale distributed HPC clusters supporting ASIC EDA workloads.

Key Responsibilities

  • Architect, deploy, and manage large-scale distributed HPC environments across global locations, supporting ASIC and GPU compute clusters 
  • Design and implement infrastructure automation using Ansible, Shell, and Python for system lifecycle management
  • Administer and optimize workload schedulers (LSF, Slurm, NC) including queue configuration, fair-share policies, and job prioritization
  • Perform deep troubleshooting and root cause analysis across compute, storage, networking, and scheduler layers
  • Collaborate with engineering teams to improve EDA workload performance and efficiency in global HPC environments 
  • Develop and deploy self-service automation solutions to reduce manual effort and improve system reliability
  • Manage and support EDA ecosystem including tool deployment (Cadence, Synopsys), licensing, and workflow optimization
  • Implement monitoring & observability frameworks using tools like Splunk, Grafana for proactive issue detection
  • Drive capacity planning, performance tuning, and resource optimization for HPC workloads
  • Create and maintain technical documentation, runbooks, and operational standards
  • Provide technical leadership and mentoring, influencing HPC architecture and long-term strategy

Techncal Skills

HPC & Scheduling: LSF, Slurm, Network Computer (NC), Grid/Batch scheduling

Operating Systems: RedHat Enterprise Linux (RHEL), CentOS

Automation & Scripting: Ansible, Shell/Bash, Python

EDA Tools: Cadence, Synopsys, EDA workflows & design environments

Monitoring & Observability: Splunk, Grafana, Prometheus

Storage & Filesystems: NFS, AutoFS, distributed storage systems

Authentication & Access: UNIX/Linux integrated with Active Directory

Infrastructure: On-premises & Hybrid HPC environments

Remote Access & VDI: Exceed TurboX, VNC, nomachine

Preferred Skills

  • Extensive experience with job schedulers such as LSF, Slurm, or equivalent platforms
  • Experience supporting EDA / semiconductor design environments
  • Exposure to GPU computing and accelerator-based workloads
  • Knowledge of EDA licensing systems and optimization
  • Experience with Infrastructure as Code (IaC) and platform standardization
  • Familiarity with cloud or hybrid HPC architectures (AWS/Azure HPC)
  • Bachelor’s degree in Computer Science, Engineering, or equivalent experience 
  • 8+ years of experience in Linux system administration (RHEL/CentOS)
  • Strong expertise in HPC cluster management and workload schedulers (LSF/Slurm)
  • Proven experience in automation and scripting (Ansible, Shell, Python and AI integration)
  • Hands-on experience managing large-scale HPC or EDA environments
  • Strong skills in performance tuning, capacity planning, and workload optimization
  • Excellent troubleshooting and problem-solving skills in complex production environments
  • Ability to lead projects end-to-end and work with cross-functional teams 

Sandisk thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution.

Sandisk is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at [email protected] to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

Software pay context

Based on 7,522 disclosed Software salaries on RoleSuite, the role pays a median of $155K/year, with most offers between $123K and $196K (10th–90th percentile: $101K–$232K).

See the full Software salary breakdown →
Apply →