Theory and practical applications of large language models : Fiche UE : Offre de formation

Nombre de crédits : 3 ECTS

Code Apogée : INF2511M

Contient des enseignements proposés en anglais et des documents totalement disponibles en anglais

Responsabilité de l'UE :

YUN BRUNO

bruno.yununiv-lyon1.fr

Type d'enseignement

Nb heures *

Cours Magistraux (CM)

18 h

Travaux Pratiques (TP)

12 h

* Ces horaires sont donnés à titre indicatif.

Pré-requis :

M1 in computer science, having completed a traditional computer science program (incluging training in symbolic AI and machine learning).

Compétences attestées (transversales, spécifiques) :

Large language models (LLMs) have recently gained popularity due to their fluency and ability to generalise to many tasks and domains using in-context learning. This course explains the theory behind transformer-based LLMs, their training, and how they can be used and evaluated in practice. We cover a large variety of state-of-the art techniques as well as the ethical questions surrounding LLMs.

Programme de l'UE / Thématiques abordées :

- Part 1: Zoom on the transformer architecture
The RNN Architectures
The Transformer Model for NLG
- Part 2: A short history of large language models
A quick survey of existing LLMs
Running LLMs on Consumer-Grade Computers
- Part 3: Pre-training and Post-training
Pre-training
Post-training/Fine-tuning
- Part 4: Practical techniques and tools for large language models
Prompt engineering
Retrieval Augmented Generation
Useful tools for large language models
- Part 5: Evaluation and ethical questions of large language models
Evaluating LLMs
Security, privacy, and ethical questions of LLM
- Part 6: Exciting research of large language models (content of this section is updated continuously)

Parcours / Spécialité / Filière / Option utilisant cette UE :

M mention : Informatique : M2 Intelligence artificielle

Date de la dernière mise-à-jour : 27/02/2025