Welcome to PaddleNLP

Welcome to PaddleNLP#

PaddleNLP is the natural language processing development library of PaddlePaddle, featuring user-friendly text domain APIs, multi-scenario application examples, and high-performance distributed training. It aims to enhance the modeling efficiency in the text domain for PaddlePaddle developers and provides a wealth of NLP application examples.

  • User-friendly Text Domain APIs

    • Offers a rich set of industry-grade preset task capabilities Taskflow and comprehensive text domain APIs: supports a wide range of Chinese dataset loading with Dataset API, enables flexible and efficient data preprocessing with Data API, provides over 60+ pretrained word vectors with Embedding API, and offers over 100+ pretrained models with Transformer API, significantly boosting the efficiency of NLP task modeling.

  • Multi-scenario Application Examples

    • Covers NLP application examples from academic to industry levels, including fundamental NLP technologies, NLP system applications, and related extended applications. Developed entirely based on the new API system of the PaddlePaddle core framework 2.0, it provides developers with best practices in the text domain of PaddlePaddle.

  • High-performance Distributed Training

    • Based on the leading automatic mixed precision optimization strategy of the PaddlePaddle core framework, combined with the distributed Fleet API, it supports 4D hybrid parallel strategies, efficiently completing large-scale pretrained model training.

paddlenlp WeChat group QR code

Evaluation Metrics

Practical Tutorials

Indices and tables#