학술논문

Stealing the Decoding Algorithms of Language Models

Document Type

Working Paper

Author

Naseh, Ali; Krishna, Kalpesh; Iyyer, Mohit; Houmansadr, Amir

Source

Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security

Subject

Computer Science - Machine Learning
Computer Science - Computation and Language
Computer Science - Cryptography and Security

Language

Abstract

A key component of generating text from modern language models (LM) is the selection and tuning of decoding algorithms. These algorithms determine how to generate text from the internal probability distribution generated by the LM. The process of choosing a decoding algorithm and tuning its hyperparameters takes significant time, manual effort, and computation, and it also requires extensive human evaluation. Therefore, the identity and hyperparameters of such decoding algorithms are considered to be extremely valuable to their owners. In this work, we show, for the first time, that an adversary with typical API access to an LM can steal the type and hyperparameters of its decoding algorithms at very low monetary costs. Our attack is effective against popular LMs used in text generation APIs, including GPT-2, GPT-3 and GPT-Neo. We demonstrate the feasibility of stealing such information with only a few dollars, e.g., $\$0.8$, $\$1$, $\$4$, and $\$40$ for the four versions of GPT-3.

Online Access

Open Access (Arxiv) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송