The BigScience Project: Collaboratively training a large multilingual language model

Date: Thursday, February 2, 2023
Time: 10:00 - 11:00 CET
Duration: 1 hour

Recent breakthroughs in Natural Language Processing (NLP) demonstrate the ability of Large Language Models - LLM (such as GPT-3 and T5) to solve diverse problems. However, building such large models raises challenges on data, training, and deployment considerations. Recently, the Big Science collaborative initiative - a group of over 1,000 researchers from academia and industry - developed BLOOM, an open source massive 176 billion parameter multilingual model.

In this webinar, we will present the key learnings of the BigScience initiative in developing the BLOOM language model in a transparent and collaborative way. This is an opportunity to discover what motivated the creation of the workshop, and the 4 month training process on the Jean Zay (IDRIS) supercomputer* using 384 NVIDIA A100 GPUs, and discuss the BLOOM achievements.

By attending this webinar you will learn about:

why large language models matter

an overview of the BigScience project

how to build multilingual large language model (dataset collection, engineering challenges, evaluation tools)

discuss LLM deployment considerations

Join us after the presentation for a live Q&A session.

*HPC ressources of Institut du Développement et des Ressources en Informatique Scientifique (IDRIS) du Centre national de la recherche scientifique (CNRS) under the allocation 2021-A0101012475 made by Grand équipement national de calcul intensif (GENCI).

Webinar Registration

THANK YOU FOR REGISTERING FOR THE WEBINAR

You will receive an email with instructions on how to join the webinar shortly.

Content

DGX Station Datasheet

Get a quick low-down and technical specs for the DGX Station.

DGX Station Whitepaper

Dive deeper into the DGX Station and learn more about the architecture, NVLink, frameworks, tools and more.

Speakers

Lucile Saulnier

Machine Learning Engineer, Hugging Face

Lucile Saulnier is a Machine Learning engineer at Hugging Face. She develops and supports the use of open source tools. She is also actively involved in research projects in the field of Deep Learning such as BigScience - a one-year collaborative project aiming to produce a large multilingual language model and a very large multilingual text dataset on the Jean Zay supercomputer.

Meriem Bendris

Senior Deep Learning Data Scientist

Meriem is a senior Deep Learning data scientist at NVIDIA, supporting partners delivering AI/deep learning solutions. Meriem area of expertise is conversational AI and large scale Natural Language Processing. Meriem holds a Ph.D. in signal and image from Telecom ParisTech, where she studied machine learning applied to audio-visual content.

Presenter 3 Name

Presenter 3 Title

Presenter 3 Bio

Presenter 4 Name

Job Title 4

Presenter 4 Bio

Other Speakers

Name1

Job Title.

Name 2

Job Title.

Name 3

Job Title.

Webinar Registration

THANK YOU FOR REGISTERING FOR THE WEBINAR

Content

Content

Content

Content

Content

Speakers

Lucile Saulnier

Meriem Bendris

Presenter 3 Name

Presenter 4 Name

Other Speakers

Content Title

Register