Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,85 @@
|
|
1 |
-
---
|
2 |
-
license: mit
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
language:
|
4 |
+
- fr
|
5 |
+
metrics:
|
6 |
+
- f1
|
7 |
+
base_model:
|
8 |
+
- almanach/camembert-base
|
9 |
+
pipeline_tag: text2text-generation
|
10 |
+
library_name: transformers
|
11 |
+
tags:
|
12 |
+
- Transformer
|
13 |
+
- disambiguation
|
14 |
+
---
|
15 |
+
|
16 |
+
# wsd-camembert-base-semcor-wngt-fr : almanach/camembert-base fine-tuned on Semcor+WNGT fr for Word Sense Disambiguation
|
17 |
+
|
18 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
19 |
+
|
20 |
+
*wsd-camembert-base-semcor-wngt-fr* is a Word Sense Disambiguation (WSD) model fine-tuned on the French version of Semcor and WNGT datasets with *almanach/camembert-base* as the pretrained BERT embeddings.
|
21 |
+
|
22 |
+
The fine-tuned model achieves the following performance on SemEval 2013 - fr:
|
23 |
+
| Test F1 (%) | GPUs | Epochs
|
24 |
+
|:-------------:|:--------------:|:--------------:| :--------:|:--------:|
|
25 |
+
| 51.28 | 1xV100 32GB | 40 |
|
26 |
+
|
27 |
+
## 📝 Model Details
|
28 |
+
|
29 |
+
The WSD model is a **Transformer encoder-decoder** architecture, consisting of 6 layers in both the encoder and decoder, and leveraging pretrained BERT embeddings for enhanced semantic representation.
|
30 |
+
|
31 |
+
## 💻 How to disambiguate a sentence
|
32 |
+
|
33 |
+
To disambiguate a sentence, please refer to the official [NWSD](https://github.com/macairececile/nwsd?tab=readme-ov-file#disambiguate-a-text) repository.
|
34 |
+
|
35 |
+
## ⚙️ Training Details
|
36 |
+
|
37 |
+
### Training and Test Data
|
38 |
+
|
39 |
+
We use [Semcor.fr](https://frsemcor.github.io/FrSemCor/) and [WNGT.fr](https://github.com/getalp/UFSAC) annotated with WordNet 3.0 sense keys IDs with:
|
40 |
+
| | Train | Valid |
|
41 |
+
|:-------------:|:-------------:|:--------------:|
|
42 |
+
| # utterances | 143,597 | 4,000 |
|
43 |
+
|
44 |
+
The semeval2013task12.fr.xml test data is the French version of the [SemEval-2013 Task 12](https://aclanthology.org/S13-2040/) test set, with:
|
45 |
+
| | Test |
|
46 |
+
|:-------------:|:-------------:|
|
47 |
+
| # utterances | 306 |
|
48 |
+
|
49 |
+
### Training Procedure and Hyperparameters
|
50 |
+
|
51 |
+
We follow the training procedure provided in the [NWSD](https://github.com/macairececile/nwsd) github repository.
|
52 |
+
|
53 |
+
#### Training time
|
54 |
+
|
55 |
+
With 1xV100 32GB, the training took ~ 4 hours.
|
56 |
+
|
57 |
+
#### Libraries
|
58 |
+
|
59 |
+
[Disambiguate](https://github.com/macairececile/nwsd):
|
60 |
+
```bibtex
|
61 |
+
@inproceedings{vial-etal-2019-sense,
|
62 |
+
title = "Sense Vocabulary Compression through the Semantic Knowledge of {W}ord{N}et for Neural Word Sense Disambiguation",
|
63 |
+
author = {Vial, Lo{\"i}c and
|
64 |
+
Lecouteux, Benjamin and
|
65 |
+
Schwab, Didier},
|
66 |
+
editor = "Vossen, Piek and
|
67 |
+
Fellbaum, Christiane",
|
68 |
+
booktitle = "Proceedings of the 10th Global Wordnet Conference",
|
69 |
+
month = jul,
|
70 |
+
year = "2019",
|
71 |
+
address = "Wroclaw, Poland",
|
72 |
+
publisher = "Global Wordnet Association",
|
73 |
+
url = "https://aclanthology.org/2019.gwc-1.14/",
|
74 |
+
pages = "108--117",
|
75 |
+
}
|
76 |
+
```
|
77 |
+
|
78 |
+
## 💡 Information
|
79 |
+
|
80 |
+
- **Developed by:** Cécile Macaire
|
81 |
+
- **Funded by [optional]:** GENCI-IDRIS (Grant 2023-AD011013625R1)
|
82 |
+
PROPICTO ANR-20-CE93-0005
|
83 |
+
- **Language(s) (NLP):** French
|
84 |
+
- **License:** MIT
|
85 |
+
- **Finetuned from model:** almanach/camembert-base
|