---
base_model: senseable/WestLake-7B-v2
license: apache-2.0
datasets:
  - cognitivecomputations/samantha-data
library_name: transformers
model_creator: cognitivecomputations
model_name: Samantha 1.1 WestLake 7B - AWQ
model_type: mistral
pipeline_tag: text-generation
prompt_template: '<|im_start|>system

  {system_message}<|im_end|>

  <|im_start|>user

  {prompt}<|im_end|>

  <|im_start|>assistant

  '
quantized_by: Suparious
---
# Samantha 7B v1.1 laser - AWQ

- Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
- Original model: [WestLake 7B v2](https://huggingface.co/cognitivecomputations/WestLake-7B-v2)
- Fine Tuning: [cognitivecomputations](https://huggingface.co/cognitivecomputations/samantha-1.1-westlake-7b-laser)

It follows the implementation of [laserRMT](https://github.com/cognitivecomputations/laserRMT)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/DQ2iBVPM1PA4GKQBgvMEO.png)

## Model description

This repo contains AWQ model files for [cognitivecomputations's Samantha 7B v1.1](https://huggingface.co/cognitivecomputations/samantha-1.1-westlake-7b-laser).

These files were quantised using hardware kindly provided by [SolidRusT Networks](https://solidrust.net/).

### About AWQ

AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.

AWQ models are currently supported on Linux and Windows, with NVidia GPUs only. macOS users: please use GGUF models instead.

It is supported by:

- [Text Generation Webui](https://github.com/oobabooga/text-generation-webui) - using Loader: AutoAWQ
- [vLLM](https://github.com/vllm-project/vllm) - version 0.2.2 or later for support for all model types.
- [HF中国镜像站 Text Generation Inference (TGI)](https://github.com/huggingface/text-generation-inference)
- [Transformers](https://huggingface.co/docs/transformers) version 4.35.0 and later, from any code or client that supports Transformers
- [AutoAWQ](https://github.com/casper-hansen/AutoAWQ) - for use from Python code

## Prompt template: ChatML

```plaintext
<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{prompt}<|im_end|>
<|im_start|>assistant
```

Also working with Basic Mistral format:

```plaintext
<|system|>
</s>
<|user|>
{prompt}</s>
<|assistant|>
```