Today we are announcing our work on building speech recognition models without any labeled data! wav2vec-U rivals some of the best supervised systems from only two years ago.

Paper: https://t.co/cYzF9MGu56
Blog: https://t.co/iiGmgdnCiV
Code:

This shows how completely unsupervised speech recognition with wav2vec-U compares to the best supervised systems on the Librispeech benchmark over the past few years.
Here is how it works:
It also works in languages other than English, see the Swahili demo below. So far we tried it on Kyrgyz, Tatar, German, Dutch, French, Spanish, Portuguese, Italian.

More from All

You May Also Like