SINHA AMAN | Doctorat UL

12/12

9h00

Soutenance de thèse de AMAN SINHA

Évaluation des modèles de langue médicaux

Evaluation of Medical Language Models

Jury

Directeur de these_CLAUSEL_Marianne_Université de Lorraine
CoDirecteur de these_CONSTANT_Mathieu_Université de Lorraine
Examinateur_NéVéOL_Aurélie_LISN, Université Paris-Saclay
Président_AMBLARD_Maxime_ University of Lorraine
Examinateur_VILLEMONTE DE LA CLERGERIE_Eric _ALMAnaCH, INRIA
Rapporteur_FLEK_Lucie_University of Bonn, Lamarr Institute of Machine Learning and Artificial Intelligence
Rapporteur_TEODORO_Douglas _University of Geneva; Department of Radiology and Medical Informatics, Faculty of Medicine

école doctorale

IAEM - INFORMATIQUE - AUTOMATIQUE - ELECTRONIQUE - ELECTROTECHNIQUE - MATHEMATIQUES

Laboratoire

IECL - Institut Elie Cartan de Lorraine

Mention de diplôme

Informatique

ATILF ATILF, Campus Lettres, 54000, Nancy

Mots clés

Traitement automatique du langage médical,Modèles de langue,Évaluation,Interprétabilité,Incertitude,Linguistique,

Résumé de la thèse

Le langage médical est intrinsèquement complexe, car il constitue un sous-langage spécialisé caractérisé par un texte non structuré, un vocabulaire spécifique au domaine et une hétérogénéité considérable, ce qui le distingue du langage général. La compréhension du langage médical nécessite des connaissances spécialisées, qui sont généralement limitées aux cliniciens et aux professionnels de la santé. De même, les modèles d'apprentissage profond entrainés sur des ensembles de données du domaine général ne parviennent souvent pas à transférer efficacement leurs performances au domaine médical.

Keywords

Medical language Processing,Language Models,Evaluation,Interpretability,Uncertainty,Linguistics,

Abstract

Medical language is inherently complex, as it constitutes a specialized sublanguage characterized by unstructured text, domain-specific vocabulary, and considerable heterogeneity, distinguishing it from general language. Understanding medical language requires expert knowledge, which is typically limited to clinicians and medical professionals. Similarly, deep learning models trained on general-domain datasets often fail to transfer their performance effectively to the medical domain.

Liste de toutes les soutenances