Model Card for Model ID

Smart home controller simulator, receiving voice commands from a microphone. Trained to detect the words: "vrata", "svetlo", "zvuk", "otvori", "zatvori", "uključi" and "isključi" to control the state of door, lights and audio in a smart home system.

Model Details

Model Description

  • Developed by: Mihailo Radović
  • Model type: Audio Classification (Smart home controller)
  • Language(s) (NLP): Serbian
  • License: MIT
  • Finetuned from model: facebook/wav2vec2-large-xlsr-53

Model Sources

Uses

Direct Use

Detecting a word said in a short audio clip: "vrata", "svetlo", "zvuk", "otvori", "zatvori", "uključi" and "isključi" to control the state of door, lights and audio in a smart home system.

Out-of-Scope Use

Works the best for the words that are in the dataset. For the words that are out of the vocabulary, the DTW check is implemented (see code in GitHub Repo).

How to Get Started with the Model

Check out the explaination in my GitHub repository README file.

Downloads last month
43
Safetensors
Model size
315M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mradovic38/wav2vec2-large-xlsr-53-serbian-smart-home-commands

Finetuned
(248)
this model