Covers: implementation of Speech Recognition with Connectionist Temporal Classification
Estimated time needed to finish: 30 minutes
Questions this item addresses:
  • How do you tune an existing connectionist temporal classification speech recognition model?
How to use this item?

Skim the article and then try out the implementation yourself.

Author(s) / creator(s) / reference(s)
Gourav Bais
0 comment

Create Your First Speech-to-text Model With Connectionist Temporal Classification

Total time needed: ~4 hours
Learn about methods of speech recognition, and create your own speech recognition model that recognizes speech on-the-fly
Potential Use Cases
People creating a speech recognition model, or people using an existing speech recognition model that want to understand and tune it better
Who is This For ?
BEGINNERData scientists new to speech recognition
Click on each of the following annotated items to see details.
ARTICLE 1. Speech Recognition — Deep Speech, CTC, Listen, Attend, and Spell
  • What are the different methods of speech recognition?
25 minutes
ARTICLE 2. Sequence Modeling with CTC
  • What is the mathematical background to connectionist temporal classification?
15 minutes
VIDEO 3. Real-time Speech to Text with DeepSpeech - Getting Started on Windows and Transcribe Microphone Free
  • How do you run the Deep Speech speech recognition model, which was trained with connectionist temporal classification?
40 minutes
ARTICLE 4. Train Your Own Speech Recognition Model in 5 Simple Steps
  • How do you tune an existing connectionist temporal classification speech recognition model?
30 minutes
ARTICLE 5. Building an End-to-End Speech Recognition Model in PyTorch
  • How do you build a speech recognition model from scratch with a connectionist temporal classification loss function?
2 hours

Concepts Covered

0 comment