cecilemacaire commited on
Commit
c7324c6
·
verified ·
1 Parent(s): 9548605

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +85 -3
README.md CHANGED
@@ -1,3 +1,85 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - fr
5
+ metrics:
6
+ - f1
7
+ base_model:
8
+ - almanach/camembert-base
9
+ pipeline_tag: text2text-generation
10
+ library_name: transformers
11
+ tags:
12
+ - Transformer
13
+ - disambiguation
14
+ ---
15
+
16
+ # wsd-camembert-base-semcor-wngt-fr : almanach/camembert-base fine-tuned on Semcor+WNGT fr for Word Sense Disambiguation
17
+
18
+ <!-- Provide a quick summary of what the model is/does. -->
19
+
20
+ *wsd-camembert-base-semcor-wngt-fr* is a Word Sense Disambiguation (WSD) model fine-tuned on the French version of Semcor and WNGT datasets with *almanach/camembert-base* as the pretrained BERT embeddings.
21
+
22
+ The fine-tuned model achieves the following performance on SemEval 2013 - fr:
23
+ | Test F1 (%) | GPUs | Epochs
24
+ |:-------------:|:--------------:|:--------------:| :--------:|:--------:|
25
+ | 51.28 | 1xV100 32GB | 40 |
26
+
27
+ ## 📝 Model Details
28
+
29
+ The WSD model is a **Transformer encoder-decoder** architecture, consisting of 6 layers in both the encoder and decoder, and leveraging pretrained BERT embeddings for enhanced semantic representation.
30
+
31
+ ## 💻 How to disambiguate a sentence
32
+
33
+ To disambiguate a sentence, please refer to the official [NWSD](https://github.com/macairececile/nwsd?tab=readme-ov-file#disambiguate-a-text) repository.
34
+
35
+ ## ⚙️ Training Details
36
+
37
+ ### Training and Test Data
38
+
39
+ We use [Semcor.fr](https://frsemcor.github.io/FrSemCor/) and [WNGT.fr](https://github.com/getalp/UFSAC) annotated with WordNet 3.0 sense keys IDs with:
40
+ | | Train | Valid |
41
+ |:-------------:|:-------------:|:--------------:|
42
+ | # utterances | 143,597 | 4,000 |
43
+
44
+ The semeval2013task12.fr.xml test data is the French version of the [SemEval-2013 Task 12](https://aclanthology.org/S13-2040/) test set, with:
45
+ | | Test |
46
+ |:-------------:|:-------------:|
47
+ | # utterances | 306 |
48
+
49
+ ### Training Procedure and Hyperparameters
50
+
51
+ We follow the training procedure provided in the [NWSD](https://github.com/macairececile/nwsd) github repository.
52
+
53
+ #### Training time
54
+
55
+ With 1xV100 32GB, the training took ~ 4 hours.
56
+
57
+ #### Libraries
58
+
59
+ [Disambiguate](https://github.com/macairececile/nwsd):
60
+ ```bibtex
61
+ @inproceedings{vial-etal-2019-sense,
62
+ title = "Sense Vocabulary Compression through the Semantic Knowledge of {W}ord{N}et for Neural Word Sense Disambiguation",
63
+ author = {Vial, Lo{\"i}c and
64
+ Lecouteux, Benjamin and
65
+ Schwab, Didier},
66
+ editor = "Vossen, Piek and
67
+ Fellbaum, Christiane",
68
+ booktitle = "Proceedings of the 10th Global Wordnet Conference",
69
+ month = jul,
70
+ year = "2019",
71
+ address = "Wroclaw, Poland",
72
+ publisher = "Global Wordnet Association",
73
+ url = "https://aclanthology.org/2019.gwc-1.14/",
74
+ pages = "108--117",
75
+ }
76
+ ```
77
+
78
+ ## 💡 Information
79
+
80
+ - **Developed by:** Cécile Macaire
81
+ - **Funded by [optional]:** GENCI-IDRIS (Grant 2023-AD011013625R1)
82
+ PROPICTO ANR-20-CE93-0005
83
+ - **Language(s) (NLP):** French
84
+ - **License:** MIT
85
+ - **Finetuned from model:** almanach/camembert-base