Job Details:
Job Description:
The Intel NPU organization is dedicated to the development of hardware accelerators for the future of AI, enabling unprecedented machine intelligence on billions of edge, desktop and mobile computers. Doing what's never been done before takes vision, innovation, and the world's best talent. You'll be immersed in a diverse, encouraging environment where everyone is encouraged to do their best work. Join the adventure of harnessing the potential of state-of-art deep neural networks and the most advanced AI hardware accelerator in the world.
Intel is hiring software engineers for its deep learning compiler team. Our customers around the world are using NPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification, speech recognition to natural language processing. As deep learning models are becoming more versatile, using compiler technologies to achieve high performance becomes essential.
We are looking for individuals who are interested in building the world's leading deep learning compiler for current and future Intel NPUs. This is a product development position with the end goal being high-quality, high-performance, secure product software. You will start optimizing pre-silicon and have access to hardware shortly after it is powered on.
You will be responsible for analyzing deep learning networks and developing compiler optimization algorithms. Collaborate with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software. The scope of these efforts includes performance tuning and analysis, developing and implementing compiler and optimization techniques for neural networks, and other general software engineering work.
You will conduct software development and optimizations in the following areas:
Develop MLIR based compiler technology for deep learning workloads on Intel NPUs.
Develop AI execution middleware based on Intel OpenVINO toolkit.
Develop large-scale production software with validation and continuous integration in mind.
Collaborate with frameworks teams to develop compiler optimizations for the deep learning domain.
Collaborate and coordinate internally and externally with cross geographical teams such as execution runtime software, NPU hardware, infrastructure, and front-end teams on the same project.