Differences bewteen OrionForCausalLM and LlamaForCausalLM

by J22 - opened Jan 24, 2024

J22

Jan 24, 2024

•

edited Jan 24, 2024

As far as I can tell, the only differences are that input_layernorm, post_attention_layernorm and final norm are changed to nn.LayerNorm from LlamaRMSNorm.

Yhyu13

Jan 24, 2024

The attention and embedding are also different by trust remote code

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment