Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS

StepFun
Enterprise
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Welcome to StepFun 🙋
StepFun, founded in April 2023 with the mission to “Scale-up possibilities for everyone,” unites top talent in artificial intelligence from both domestic and international backgrounds, and is dedicated to advancing toward AGI. The company has already launched the Step series of foundation models, which includes Step-2, a cutting-edge trillion-parameter Mixture of Experts (MoE) language model; Step-1.5V, a powerful multimodal large model; and Step-1V, an innovative image generation model, among others.
Collections
1
spaces
2
models
7

stepfun-ai/stepvideo-t2v
Text-to-Video
•
Updated
•
1.91k
•
416

stepfun-ai/Step-Audio-Tokenizer
Updated
•
32

stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text
•
Updated
•
1.2k
•
427

stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
•
Updated
•
2.09k
•
168

stepfun-ai/stepvideo-t2v-turbo
Updated
•
85

stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
•
76.9k
•
1.42k

stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
•
189k
•
171