Kaldi is an open source toolkit for research in speech recognition and speech signal processing. Due to its efficient and accurate implementation of many algorithms, this toolkit is leveraged by many organizations that develop speech technology. This tutorial introduces Kaldi in a gentle manner, to improve your level of comfort with the software. This tutorial is aimed toward providing hands-on experience with Kaldi, including setup and training of a basic speech recognizer. The structure of the code and the basics of Kaldi operations are explained, with a general overview of the toolkit capabilities. Kaldi input and output methods, file structure, parallelization, and piping mechanisms are explained. The focus is on getting a hands-on experience with the tools and getting a basic system working during the tutorial.
Participants are expected to have some Linux OS experience and a level of comfort with command line. They are expected to bring either a MacBook or Linux laptop, or have SSH access to a remote Linux machine for use during this workshop. Laptops will not be provided.