| Page 861 | Kisaco Research

Pre-training Foundation Models is prohibitively expensive and therefore impossible for many companies. This is especially true if the models are Large Language Models (LLMs). However, people hope that Foundation Models will live up to the promise of learning more generally than classical Artificial Intelligence (AI) models. The dream is that if you provide just a few examples to Foundation Models, they could extrapolate the high-level, abstract representation of the problem and learn how to accomplish tasks that they have never been trained to execute before. So, the question is, how can you lower the cost of fine-tuning pre-trained Foundation Models for your needs? This is what we will discuss in this panel. We make available to you our personal experience, synthetized in a set of principles, so that you can discover how we found ways to lower the cost of fine-tuning pre-trained Foundational Models across multiple domains. 

Moderator

Author:

Fausto Artico

Head of Innovation and Data Science
GSK

Fausto has two PhDs (Information & Computer Science respectively), earning his second master’s and PhD at the University of California, Irvine. Fausto also holds multiple certifications from MIT, Columbia University, London School of Economics and Political Science, Kellogg School of Management, University of Cambridge and soon also from the University of California, Berkeley. He has worked in multi-disciplinary teams and has over 20 years of experience in academia and industry.

As a Physicist, Mathematician, Engineer, Computer Scientist, and High-Performance Computing (HPC) and Data Science expert, Fausto has worked on key projects at European and American government institutions and with key individuals, like Nobel Prize winner Michael J. Prather. After his time at NVIDIA corporation in Silicon Valley, Fausto worked at the IBM T J Watson Center in New York on Exascale Supercomputing Systems for the US government (e.g., Livermore and Oak Ridge Labs).

Fausto Artico

Head of Innovation and Data Science
GSK

Fausto has two PhDs (Information & Computer Science respectively), earning his second master’s and PhD at the University of California, Irvine. Fausto also holds multiple certifications from MIT, Columbia University, London School of Economics and Political Science, Kellogg School of Management, University of Cambridge and soon also from the University of California, Berkeley. He has worked in multi-disciplinary teams and has over 20 years of experience in academia and industry.

As a Physicist, Mathematician, Engineer, Computer Scientist, and High-Performance Computing (HPC) and Data Science expert, Fausto has worked on key projects at European and American government institutions and with key individuals, like Nobel Prize winner Michael J. Prather. After his time at NVIDIA corporation in Silicon Valley, Fausto worked at the IBM T J Watson Center in New York on Exascale Supercomputing Systems for the US government (e.g., Livermore and Oak Ridge Labs).

Panellists

Author:

Lisa Cohen

Director of Data Science for Gemini, Google Assistant, and Search Platforms
Google

Lisa Cohen is Director of Data Science for Gemini (formerly "Bard"), Google Assistant, and Search Platforms. She leads an organization of data scientists at Google, responsible for using data to create excellent user experiences across these products, and partnering closely with Product, Engineering, and User Experience Research. Formerly, Lisa was Head of Data Science and Engineering for Twitter, helping drive the strategy and direction of the Twitter product, through machine learning, metric development, experimentation and causal analyses. Before Twitter, Lisa led the Azure Customer Growth Analytics organization as part of Microsoft Cloud Data sciences. Her team was responsible for analyzing OKRs, informing data-driven decisions, and developing data science models to help customers be successful on Azure. Lisa worked at Microsoft for 17yrs, and also helped develop multiple versions of Visual Studio. She holds Bachelor and Masters degrees from Harvard in Applied Mathematics. You can follow Lisa on LinkedIn and Medium.

Lisa Cohen

Director of Data Science for Gemini, Google Assistant, and Search Platforms
Google

Lisa Cohen is Director of Data Science for Gemini (formerly "Bard"), Google Assistant, and Search Platforms. She leads an organization of data scientists at Google, responsible for using data to create excellent user experiences across these products, and partnering closely with Product, Engineering, and User Experience Research. Formerly, Lisa was Head of Data Science and Engineering for Twitter, helping drive the strategy and direction of the Twitter product, through machine learning, metric development, experimentation and causal analyses. Before Twitter, Lisa led the Azure Customer Growth Analytics organization as part of Microsoft Cloud Data sciences. Her team was responsible for analyzing OKRs, informing data-driven decisions, and developing data science models to help customers be successful on Azure. Lisa worked at Microsoft for 17yrs, and also helped develop multiple versions of Visual Studio. She holds Bachelor and Masters degrees from Harvard in Applied Mathematics. You can follow Lisa on LinkedIn and Medium.

Author:

Jeff Boudier

Product Director
Hugging Face

Jeff Boudier is a product director at Hugging Face, creator of Transformers, the leading open-source NLP library. Previously Jeff was a co-founder of Stupeflix, acquired by GoPro, where he served as director of Product Management, Product Marketing, Business Development and Corporate Development.

Jeff Boudier

Product Director
Hugging Face

Jeff Boudier is a product director at Hugging Face, creator of Transformers, the leading open-source NLP library. Previously Jeff was a co-founder of Stupeflix, acquired by GoPro, where he served as director of Product Management, Product Marketing, Business Development and Corporate Development.

Author:

Helen Byrne

VP, Solution Architect
Graphcore

Helen leads the Solution Architects team at Graphcore, helping innovators build their AI solutions using Graphcore’s Intelligence Processing Units (IPUs). She has been at Graphcore for more than 5 years, previously leading AI Field Engineering and working in AI Research, working on problems in Distributed Machine Learning. Before landing in the technology industry, she worked in Investment Banking. Her background is in Mathematics and she has a MSc in Artificial Intelligence.

Helen Byrne

VP, Solution Architect
Graphcore

Helen leads the Solution Architects team at Graphcore, helping innovators build their AI solutions using Graphcore’s Intelligence Processing Units (IPUs). She has been at Graphcore for more than 5 years, previously leading AI Field Engineering and working in AI Research, working on problems in Distributed Machine Learning. Before landing in the technology industry, she worked in Investment Banking. Her background is in Mathematics and she has a MSc in Artificial Intelligence.

 

(Moderator) Varun Mehta

Executive Director, Head of ESG Data and Technology Product Management
Morgan Stanley

(Moderator) Varun Mehta

Executive Director, Head of ESG Data and Technology Product Management
Morgan Stanley

(Moderator) Varun Mehta

Executive Director, Head of ESG Data and Technology Product Management
Morgan Stanley

Abstract coming soon...

Author:

Wayne Wang

Founder & CEO
Moffett AI

Wayne Wang is the Founder & CEO of Moffett AI, and is an expert in digital-analog hybrid circuits in Silicon Valley with 15 years of experience. His main experience is as a CPU high-speed link architect.

He has several years of experience in semiconductor entrepreneurship in Silicon Valley. He used to be the core architect of Intel and Qualcomm, and participated in the development of five generations of Intel CPU processors, with a cumulative mass production of over 5 billion pieces.

Wayne Wang

Founder & CEO
Moffett AI

Wayne Wang is the Founder & CEO of Moffett AI, and is an expert in digital-analog hybrid circuits in Silicon Valley with 15 years of experience. His main experience is as a CPU high-speed link architect.

He has several years of experience in semiconductor entrepreneurship in Silicon Valley. He used to be the core architect of Intel and Qualcomm, and participated in the development of five generations of Intel CPU processors, with a cumulative mass production of over 5 billion pieces.

 

Wayne Wang

Founder & CEO
Moffett AI

Wayne Wang is the Founder & CEO of Moffett AI, and is an expert in digital-analog hybrid circuits in Silicon Valley with 15 years of experience. His main experience is as a CPU high-speed link architect.

He has several years of experience in semiconductor entrepreneurship in Silicon Valley. He used to be the core architect of Intel and Qualcomm, and participated in the development of five generations of Intel CPU processors, with a cumulative mass production of over 5 billion pieces.

Wayne Wang

Founder & CEO
Moffett AI

Wayne Wang

Founder & CEO
Moffett AI

Wayne Wang is the Founder & CEO of Moffett AI, and is an expert in digital-analog hybrid circuits in Silicon Valley with 15 years of experience. His main experience is as a CPU high-speed link architect.

He has several years of experience in semiconductor entrepreneurship in Silicon Valley. He used to be the core architect of Intel and Qualcomm, and participated in the development of five generations of Intel CPU processors, with a cumulative mass production of over 5 billion pieces.

Abstract coming soon...

Author:

Jia Li

Co-Founder, Chief AI Officer & President
LiveX AI

Jia is Co-founder, Chief AI Officer and President of a Stealth Generative AI Startup. She is elected as IEEE Fellow for Leadership in Large Scale AI. She is co-teaching the inaugural course of Generative AI and Medicine at Stanford University, where she has served multiple roles including Advisory Board Committee to Nourish, Chief AI Fellow, RWE for Sleep Health and Adjunct Professor at the School of Medicine in the past. She was the Founding Head of R&D at Google Cloud AI. At Google, she oversaw the development of the full stack of AI products on Google Cloud to power solutions for diverse industries. With the passion to make more impact to our everyday life, she later became an entrepreneur, building and advising companies with award-winning platforms to solve today's greatest challenges in life. She has served as Mentor and Professor-in-Residence at StartX, advising founders/companies from Stanford/Alumni. She is the Co-founder and Chairperson of HealthUnity Corporation, a 501(c)3 nonprofit organization. She served briefly at Accenture as a part-time Chief AI Follow for the Generative AI strategy. She also serves as an advisor to the United Nations Children's Fund (UNICEF). She is a board member of the Children's Discovery Museum of San Jose. She was selected as a World Economic Forum Young Global Leader, a recognition bestowed on 100 of the world’s most promising business leaders, artists, public servants, technologists, and social entrepreneurs in 2018. Before joining Google, She was the Head of Research at Snap, leading the AI/AR innovation effort. She received her Ph.D. degree from the Computer Science Department at Stanford University.

Jia Li

Co-Founder, Chief AI Officer & President
LiveX AI

Jia is Co-founder, Chief AI Officer and President of a Stealth Generative AI Startup. She is elected as IEEE Fellow for Leadership in Large Scale AI. She is co-teaching the inaugural course of Generative AI and Medicine at Stanford University, where she has served multiple roles including Advisory Board Committee to Nourish, Chief AI Fellow, RWE for Sleep Health and Adjunct Professor at the School of Medicine in the past. She was the Founding Head of R&D at Google Cloud AI. At Google, she oversaw the development of the full stack of AI products on Google Cloud to power solutions for diverse industries. With the passion to make more impact to our everyday life, she later became an entrepreneur, building and advising companies with award-winning platforms to solve today's greatest challenges in life. She has served as Mentor and Professor-in-Residence at StartX, advising founders/companies from Stanford/Alumni. She is the Co-founder and Chairperson of HealthUnity Corporation, a 501(c)3 nonprofit organization. She served briefly at Accenture as a part-time Chief AI Follow for the Generative AI strategy. She also serves as an advisor to the United Nations Children's Fund (UNICEF). She is a board member of the Children's Discovery Museum of San Jose. She was selected as a World Economic Forum Young Global Leader, a recognition bestowed on 100 of the world’s most promising business leaders, artists, public servants, technologists, and social entrepreneurs in 2018. Before joining Google, She was the Head of Research at Snap, leading the AI/AR innovation effort. She received her Ph.D. degree from the Computer Science Department at Stanford University.

Author:

Krishna Rangasayee

Founder & CEO
SiMa.ai

Krishna is founder and CEO of SiMa.ai™, a machine learning company enabling effortless ML for the Embedded Edge.

Previously, he was the COO of Groq, a machine learning startup. He was with Xilinx for 18 years, where he was Senior Vice President and GM of Xilinx’s overall business prior to his most recent role as Executive Vice President, Global Sales. Prior to Xilinx, he held various engineering and business roles at Altera Corporation and Cypress Semiconductor. He holds 25+ international patents. He has also served on the board of directors of public and private companies.

Krishna Rangasayee

Founder & CEO
SiMa.ai

Krishna is founder and CEO of SiMa.ai™, a machine learning company enabling effortless ML for the Embedded Edge.

Previously, he was the COO of Groq, a machine learning startup. He was with Xilinx for 18 years, where he was Senior Vice President and GM of Xilinx’s overall business prior to his most recent role as Executive Vice President, Global Sales. Prior to Xilinx, he held various engineering and business roles at Altera Corporation and Cypress Semiconductor. He holds 25+ international patents. He has also served on the board of directors of public and private companies.

Abstract coming soon...

Author:

Soojung Ryu

CEO
SAPEON

As a well-known expert in AI processors, Soojung Ryu is in charge of SAPEON in order to accelerate the company’s growth in the global AI market. She brings more than 25 years of extensive experience in leading various projects related to NPU and GPU.

Before she joined SK Telecom as the head of the AI accelerator office, Ryu was a University-Industry Collaboration Professor at Seoul National University, where she conducted R&D in the NPU and PIM. When she served as the Vice President of Samsung Group's R&D hub, she undertook diverse projects related to GPU. Ryu received her Ph.D. degree in Electrical & Computer Engineering from Georgia Institute of Technology.

Soojung Ryu

CEO
SAPEON

As a well-known expert in AI processors, Soojung Ryu is in charge of SAPEON in order to accelerate the company’s growth in the global AI market. She brings more than 25 years of extensive experience in leading various projects related to NPU and GPU.

Before she joined SK Telecom as the head of the AI accelerator office, Ryu was a University-Industry Collaboration Professor at Seoul National University, where she conducted R&D in the NPU and PIM. When she served as the Vice President of Samsung Group's R&D hub, she undertook diverse projects related to GPU. Ryu received her Ph.D. degree in Electrical & Computer Engineering from Georgia Institute of Technology.

 

Dr. Michael Capps

CEO/Co-Founder
Diveplane

Dr. Michael Capps

CEO/Co-Founder
Diveplane

Dr. Michael Capps

CEO/Co-Founder
Diveplane

Abstract coming soon...

Author:

Jeremy Roberson

Director of Inference Software
FlexLogix

Director of Inference Software at Flex Logix. Jeremy earned his BSEE, MSEE, and PhD EE degrees from UC Davis specializing in Signal Processing Algorithms. Jeremy has worked on algorithms and hardware accelerator architectures for machine learning and signal processing in domains such as automatic speech recognition, object detection for biomedicine, capacitive sensing systems, and more. He has several patents and publications within these areas. He has spent the last 6 years working on inference SW for AI accelerators, first at Intel, and now at Flex Logix. 

Jeremy Roberson

Director of Inference Software
FlexLogix

Director of Inference Software at Flex Logix. Jeremy earned his BSEE, MSEE, and PhD EE degrees from UC Davis specializing in Signal Processing Algorithms. Jeremy has worked on algorithms and hardware accelerator architectures for machine learning and signal processing in domains such as automatic speech recognition, object detection for biomedicine, capacitive sensing systems, and more. He has several patents and publications within these areas. He has spent the last 6 years working on inference SW for AI accelerators, first at Intel, and now at Flex Logix. 

Abstract coming soon...

Author:

Jim Keller

CEO
Tenstorrent

Jim Keller is the CEO of Tenstorrent and a veteran hardware engineer. Prior to joining Tenstorrent, he served two years as Senior Vice President of Intel's Silicon Engineering Group. He has held roles as Tesla's Vice President of Autopilot and Low Voltage Hardware, Corporate Vice President and Chief Cores Architect at AMD, and Vice President of Engineering and Chief Architect at P.A. Semi, which was acquired by Apple Inc. Jim has led multiple successful silicon designs over the decades, from the DEC Alpha processors, to AMD K7/K8/K12, HyperTransport and the AMD Zen family, the Apple A4/A5 processors, and Tesla's self-driving car chip.

Jim Keller

CEO
Tenstorrent

Jim Keller is the CEO of Tenstorrent and a veteran hardware engineer. Prior to joining Tenstorrent, he served two years as Senior Vice President of Intel's Silicon Engineering Group. He has held roles as Tesla's Vice President of Autopilot and Low Voltage Hardware, Corporate Vice President and Chief Cores Architect at AMD, and Vice President of Engineering and Chief Architect at P.A. Semi, which was acquired by Apple Inc. Jim has led multiple successful silicon designs over the decades, from the DEC Alpha processors, to AMD K7/K8/K12, HyperTransport and the AMD Zen family, the Apple A4/A5 processors, and Tesla's self-driving car chip.