Commit
·
0930397
1
Parent(s):
5daa6bb
Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ The performance of the model is the following:
|
|
28 |
|
29 |
| Release | Test WER | GPUs |
|
30 |
|:-------------:|:--------------:| :--------:|
|
31 |
-
| 20-05-22 | 3.
|
32 |
|
33 |
## Pipeline description
|
34 |
|
@@ -65,6 +65,28 @@ asr_model.transcribe_file('speechbrain/asr-crdnn-rnnlm-librispeech/example.wav')
|
|
65 |
### Inference on GPU
|
66 |
To perform inference on the GPU, add `run_opts={"device":"cuda"}` when calling the `from_hparams` method.
|
67 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
68 |
### Limitations
|
69 |
The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
|
70 |
|
@@ -77,7 +99,7 @@ The SpeechBrain team does not provide any warranty on the performance achieved b
|
|
77 |
year = {2021},
|
78 |
publisher = {GitHub},
|
79 |
journal = {GitHub repository},
|
80 |
-
howpublished = {
|
81 |
}
|
82 |
```
|
83 |
|
|
|
28 |
|
29 |
| Release | Test WER | GPUs |
|
30 |
|:-------------:|:--------------:| :--------:|
|
31 |
+
| 20-05-22 | 3.09 | 1xV100 32GB |
|
32 |
|
33 |
## Pipeline description
|
34 |
|
|
|
65 |
### Inference on GPU
|
66 |
To perform inference on the GPU, add `run_opts={"device":"cuda"}` when calling the `from_hparams` method.
|
67 |
|
68 |
+
### Training
|
69 |
+
The model was trained with SpeechBrain (Commit hash: '2abd9f01').
|
70 |
+
To train it from scratch follow these steps:
|
71 |
+
1. Clone SpeechBrain:
|
72 |
+
```bash
|
73 |
+
git clone https://github.com/speechbrain/speechbrain/
|
74 |
+
```
|
75 |
+
2. Install it:
|
76 |
+
```
|
77 |
+
cd speechbrain
|
78 |
+
pip install -r requirements.txt
|
79 |
+
pip install -e .
|
80 |
+
```
|
81 |
+
|
82 |
+
3. Run Training:
|
83 |
+
```
|
84 |
+
cd recipes/LibriSpeech/ASR/seq2seq/
|
85 |
+
python train.py hparams/train_BPE_1000.yaml --data_folder=your_data_folder
|
86 |
+
```
|
87 |
+
|
88 |
+
You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/1SAndjcThdkO-YQF8kvwPOXlQ6LMT71vt?usp=sharing).
|
89 |
+
|
90 |
### Limitations
|
91 |
The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
|
92 |
|
|
|
99 |
year = {2021},
|
100 |
publisher = {GitHub},
|
101 |
journal = {GitHub repository},
|
102 |
+
howpublished = {\\url{https://github.com/speechbrain/speechbrain}},
|
103 |
}
|
104 |
```
|
105 |
|