Automatic Speech Recognition
ESPnet
English
audio
audio_captioning
File size: 182 Bytes
680452b
 
 
 
 
e8e26b4
680452b
 
 
e8e26b4
 
680452b
 
1
2
3
4
5
6
7
8
9
10
11
12
13
---
tags:
- espnet
- audio
- automatic-speech-recognition
- audio_captioning
language: en
datasets:
- clotho_v2
- slseanwu/clotho-chatgpt-mixup-50K
- audiocaps
license: cc-by-4.0
---