Nvidia Corporation
DL Performance Software Engineer - LLM Inference (Finance)
As a member of the LLM inference team you will help build innovative software with the goals of enabling LLM inference to be more efficient, scalable, and accessible. Are you interested in architecting and implementing the best inference stacks in the LLM world? Work and collaborate with a diverse set of teams involving resource orchestration, distributed systems, inference engine optimization, and writing high performance GPU kernels. Come join our team and contribute towards pioneering accelerated computing and AI.
What you'll be doing:
What we need to see:
Ways to stand out from the crowd:
We strongly encourage you to include sample projects (e.g. Github) that demonstrate the qualifications above.
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. We are building many of the most important AI technologies and infrastructure around the world. Are you passionate about AI systems, efficiency, and performance? Join us to push the frontier of accelerated computing together!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 120,000 USD - 189,750 USD for Level 2, and 148,000 USD - 235,750 USD for Level 3.
You will also be eligible for equity and benefits .
Applications for this job will be accepted at least until September 28, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.