THE 20 QUESTIONS GAME TO DISTINGUISH LARGE LANGUAGE MODELS - LAAS-Informatique Critique
Pré-Publication, Document De Travail Année : 2024

THE 20 QUESTIONS GAME TO DISTINGUISH LARGE LANGUAGE MODELS

Résumé

In a parallel with the 20 questions game, we present a method to determine whether two large language models (LLMs), placed in a black-box context, are the same or not. The goal is to use a small set of (benign) binary questions, typically under 20. We formalize the problem and first establish a baseline using a random selection of questions from known benchmark datasets, achieving an accuracy of nearly 100% within 20 questions. After showing optimal bounds for this problem, we introduce two effective questioning heuristics able to discriminate 22 LLMs by using half as many questions for the same task. These methods offer significant advantages in terms of stealth and are thus of interest to auditors or copyright owners facing suspicions of model leaks.
Fichier principal
Vignette du fichier
main-arxiv.pdf (314.01 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04699271 , version 1 (16-09-2024)

Licence

Identifiants

  • HAL Id : hal-04699271 , version 1

Citer

Gurvan Richardeau, Erwan Le Merrer, Camilla Penzo, Gilles Trédan. THE 20 QUESTIONS GAME TO DISTINGUISH LARGE LANGUAGE MODELS. 2024. ⟨hal-04699271⟩
0 Consultations
0 Téléchargements

Partager

More