Microsoft Speech Corpus (Indian languages) release contains conversational and phrasal speech training and test data for Telugu, Tamil and Gujarati languages. The data package includes audio and corresponding transcripts.
Language - Tamil
Reference - https://msropendata.com/datasets/7230b4b1-912d-400e-be58-f84e0512985e
License type - Computational Use of Data Agreement v1.0