CTRL模型汇总#
下表汇总介绍了目前PaddleNLP支持的CTRL模型对应预训练权重。
Pretrained Weight |
Language |
Details of the model |
---|---|---|
|
English |
48-layer, 1280-hidden, 16-heads, 1701M parameters. The CTRL base model. |
|
English |
2-layer, 16-hidden, 2-heads, 5M parameters. The Tiny CTRL model. |