composer.models.gpt2#
Modules
GPT-2 model based on Hugging Face GPT-2. |
The GPT-2 model family is set of transformer-based networks for autoregressive language modeling at various scales. This family was originally proposed by OpenAI, and is trained on the OpenWebText dataset. It is useful for downstream language generation tasks, such as summarization, translation, and dialog.
See the Model Card for more details.
Classes
Implements |
Hparams
These classes are used with yahp
for YAML
-based configuration.