Acceleration Kernel Developer Intern
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
As an Acceleration Kernel Developer Intern at Tenstorrent, you will help optimize low-level workloads and develop kernels for machine learning applications. You will work closely with engineers across software and machine learning to improve performance, reliability, and efficiency across the stack.
This role is a 6-month full-time, on-site internship based in our Gdańsk, Poland office.
Who You Are
- A student pursuing Computer Science, Software Engineering, Computer Engineering, Engineering Science, or a related field (math, physics) with software development experience. .
- Strong in low-level algorithm implementation and optimization, with experience in parallel acceleration on CPUs or GPUs.
- Proficient in C/C++ and Python, with familiarity with machine learning frameworks and profiling or debugging tools.
- Self-driven, detail-oriented, collaborative, and comfortable solving complex technical problems.
- Develop and optimize kernels and kernel libraries for machine learning and HPC workloads.
- Improve tensor compute and data movement performance through low-level optimization.
- Profile bottlenecks, debug issues, and build tests that improve software stability and reliability.
- Build infrastructure and documentation that help the team analyze test data and scale development.
- How low-level kernel development impacts machine learning performance on Tenstorrent systems.
- How tensor compute and data movement optimizations are designed, tested, and benchmarked.
- How optimized kernels integrate into machine learning frameworks and software pipelines.
- How software, machine learning, and infrastructure teams work together on real performance challenges.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
Software pay context
Based on 7,287 disclosed Software salaries on RoleSuite, the role pays a median of $158K/year, with most offers between $123K and $199K (10th–90th percentile: $101K–$236K).
See the full Software salary breakdown →