Intel Software

Using Intel® NPUs to Prototype and Deploy Large Language Models

Thursday, June 27, 2024, from 9:00am to 11:00am PT

Time zone converter

Intel NPUs meet the challenge of delivering local AI PC capabilities.

This workshop demystifies Intel NPUs providing examples with large language models (LLMs) and case studies. The fundamental architecture of NPUs is explained, and the capabilities of the technology revealed, offering a clear picture of the role of neural processors in an AI system and the acceleration benefits.

Real-world examples show how AI applications integrate LLMs with Intel NPUs, including Chatbot, Retrieval Augmented Generation (RAG), Stable Diffusion, and Speech 2 Text. Developers gain insights into how this technology improves performance and efficiency, allowing AI operations to be performed effectively on PCs rather than the cloud.

Areas covered in the workshop include:

  • Understanding the fundamentals of Intel NPUs and the acceleration offered by Intel® Core™ Ultra processors
  • Learning the significance of LLMs
  • Creating rapid prototypes with the Intel NPU Acceleration Library
  • Building LLM system partitions and applications
  • Leveraging OpenVINO and the NPU plugin for enhanced performance

Hands-on demonstrations of these techniques require an Intel® Tiber™ Developer Cloud account. If you don’t have one, get one here.

Novice developers as well as experienced professionals interested in NPUs will benefit from this workshop.

Error: Please enter a first name.
Error: First name must be at least 2 characters long.
Error: First name must be less than 250 characters long.
Error: Please enter a first name.
Error: Please enter a last name.
Error: Last name must be at least 2 characters long.
Error: Last name must be less than 250 characters long.
Error: Please enter a last name.
Error: Please enter an email address.
Error: Please enter a valid email address.
Error: Email Address must be less than 250 characters.
Error: Please select a country/region.
Your registration cannot proceed. The materials on this site are subject to U.S. and other applicable export control laws and are not accessible from all locations.
Error: Please select a profession.
Error: Please enter a business phone.
Error: Business phone must be at least 2 characters long.
Error: Business phone must be less than 250 characters long.
Error: Please enter a business phone.
Error: Please enter a valid business phone.
Error: Please enter a company name.
Error: Company name must be at least 2 characters long.
Error: Company name must be less than 250 characters long.
Error: Please enter a company name.
Error: Please select a developer role.
Error: Please select a development language.
Error: Please select at least one option.
Error: Please select an industry.
Error: Please select at least one operating system.

Intel strives to provide you with a great, personalized experience, and your data helps us to accomplish this.

Error: Above consent required for submission.
Error: Above consent required for submission.

By submitting this form, you are confirming you are age 18 years or older. Intel may contact you for marketing-related communications. You can opt out at any time. To learn more about Intel's practices, including how to manage your preferences and settings, visit Intel's Privacy Notice.

By submitting this form, you are confirming you are age 18 years or older. Intel will process your Personal Data for the purpose of this business request. To learn more about Intel's practices, including how to manage your preferences and settings, visit Intel's Privacy Notice.

By submitting this form, you are confirming you are age 18 years or older. Intel may contact you for marketing-related communications. You can opt out at any time. To learn more about Intel's practices, including how to manage your preferences and settings, visit Intel's Privacy Notice.

Speakers

Alessandro Palla

Presenter

Machine Learning Engineer, Intel

Soumendu Ghosh

Q&A

Deep Learning R&D Architect, Intel