LongWriter-glm4-9b#


README(From Huggingface)#


language:

  • en

  • zh library_name: transformers tags:

  • Long Context

  • chatglm

  • llama datasets: train:

    • AI-ModelScope/LongWriter-6k pipeline_tag: text-generation studios:

  • ZhipuAI/LongWriter-glm4-9b-demo


LongWriter-glm4-9b#

🤖 [LongWriter Dataset] • 💻 [Github Repo] • 📃 [LongWriter Paper]

LongWriter-glm4-9b is trained based on glm-4-9b, and is capable of generating 10,000+ words at once.

A simple demo for deployment of the model:

from paddlenlp.transformers import AutoTokenizer, AutoModelForCausalLM
import paddle
tokenizer = AutoTokenizer.from_pretrained("ZhipuAI/LongWriter-glm4-9b", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("ZhipuAI/LongWriter-glm4-9b", dtype=paddle.bfloat16, trust_remote_code=True, )
model = model.eval()
query = "Write a `10000`-word China travel guide"
response, history = model.chat(tokenizer, query, history=[], max_new_tokens=1024, temperature=0.5)
print(response)

Environment: transformers==4.43.0

License: glm-4-9b License

Citation#

If you find our work useful, please consider citing LongWriter:

@article{bai2024longwriter,
  title={LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs}, 
  author={Yushi Bai and Jiajie Zhang and Xin Lv and Linzhi Zheng and Siqi Zhu and Lei Hou and Yuxiao Dong and Jie Tang and Juanzi Li},
  journal={arXiv preprint arXiv:2408.07055},
  year={2024}
}

Model Files#

Back to Main