Steveeeeeeen/mls_eng_10k
Updated
•
47
Hey!
You're on the right track! The output you're seeing is still tokenized with special markers. To convert this into natural language text, you need to use your tokenizer's decode function.
You can find an example here: https://huggingface.co/HKUSTAudio/Llasa-1B