Tame, Train and Deploy Large Language Models | Kisaco Research

Throughout the AI Hardware & Edge AI Summit, we ALL will be talking about the wonders of large language models and why not! They offer tremendous advantage to enterprises and organizations, offloading work and resources to deliver enhanced value, efficiencies and end-user experiences. During this session, Sree Ganesan, head of software product with Habana, an Intel company, and Vasudev Lal, AI/ML research scientist with Intel Labs, will share their first-hand experiences in training and deploying large language models on Gaudi2 accelerators. They’ll introduce a variety of approaches they’ve taken to tame the LLM process, from training to fine-tuning to inference. To make it all real, we’ll focus on high-value ecosystem partners and share LLM demos that show our latest innovations.  

Sponsor(s): 
Habana
Speaker(s): 

Author:

Sree Ganesan

Head of Software Products
Habana Labs

Sree Ganesan leads Software Product Management at Habana Labs, working alongside a diverse global team to deliver state-of-the-art deep learning capabilities of the Habana SynapseAI® software suite to the market. Previously, she was Engineering Director in Intel’s AI Products Group, where she was responsible for AI software strategy and deep learning framework integration for Nervana NNP AI accelerators.  Ms. Ganesan joined Intel in 2001 and has held a variety of technical and management roles in software engineering, VLSI CAD and SOC design methodology. Ms. Ganesan received a bachelor’s degree in electrical engineering from the Indian Institute of Technology Madras, India and a PhD in computer engineering from the University of Cincinnati, Ohio.

Sree Ganesan

Head of Software Products
Habana Labs

Sree Ganesan leads Software Product Management at Habana Labs, working alongside a diverse global team to deliver state-of-the-art deep learning capabilities of the Habana SynapseAI® software suite to the market. Previously, she was Engineering Director in Intel’s AI Products Group, where she was responsible for AI software strategy and deep learning framework integration for Nervana NNP AI accelerators.  Ms. Ganesan joined Intel in 2001 and has held a variety of technical and management roles in software engineering, VLSI CAD and SOC design methodology. Ms. Ganesan received a bachelor’s degree in electrical engineering from the Indian Institute of Technology Madras, India and a PhD in computer engineering from the University of Cincinnati, Ohio.

Author:

Vasudev Lal

AI/ML Research Scientist
Intel Labs

Vasudev Lal is an AI Research Scientist at Intel Labs where he leads the Multimodal Cognitive AI team. His team develops AI systems that can synthesize concept-level understanding from multiple modalities: vision, language, video and audio. His current research interests include equipping deep learning with mechanisms to inject external knowledge; self-supervised training at scale for continuous and high dimensional modalities like images, video and audio; mechanisms to combine deep learning with symbolic compute.  Prior to joining Intel, Vasudev obtained his PhD in Electrical and Computer Engineering from the University of Michigan, Ann Arbor.

Vasudev Lal

AI/ML Research Scientist
Intel Labs

Vasudev Lal is an AI Research Scientist at Intel Labs where he leads the Multimodal Cognitive AI team. His team develops AI systems that can synthesize concept-level understanding from multiple modalities: vision, language, video and audio. His current research interests include equipping deep learning with mechanisms to inject external knowledge; self-supervised training at scale for continuous and high dimensional modalities like images, video and audio; mechanisms to combine deep learning with symbolic compute.  Prior to joining Intel, Vasudev obtained his PhD in Electrical and Computer Engineering from the University of Michigan, Ann Arbor.