Hasty Briefsbeta

Qwen3-ASR Technical Report

6 days ago
  • #ASR
  • #Speech Recognition
  • #Forced Alignment
  • Introduction of Qwen3-ASR family, including two ASR models and a forced alignment model.
  • Qwen3-ASR-1.7B and Qwen3-ASR-0.6B support 52 languages and dialects, leveraging large-scale training data.
  • Qwen3-ASR-1.7B achieves SOTA performance among open-sourced ASR models.
  • Qwen3-ASR-0.6B offers the best accuracy-efficiency trade-off with low latency.
  • Qwen3-ForcedAligner-0.6B outperforms existing force alignment models in efficiency and versatility.
  • Models are released under the Apache 2.0 license to accelerate ASR and audio understanding research.