microllama-0.3B / README.md
yujiepan's picture
Update README.md
bd66bb3 verified
metadata
license: apache-2.0
base_model:
  - keeeeenw/MicroLlama
pipeline_tag: text-generation

yujiepan/microllama-0.3B

This is the same model as keeeeenw/MicroLlama but is converted in BF16.

It is a small pretrained model that can do text generation. Very useful for algorithm development / debugging.

Special thanks to the original author keeeeenw for the hard work and contribution.

This repo is just a backup for myself. If you find this model useful, consider using the original repo instead.

Wikitext2 PPL

  • FP32: 33.07
  • BF16: 33.09