Project Anuvaad has been conceptualised to provide translation capabilities for Indic languages. Project Anuvaad is open sourced under the MIT license and is funded by EkStep foundation.
This repository contains parallel language corpus links for popular Indian languages developed as part of the Anuvaad project. These datasets are cleaned, quality assured and released under MIT-license.
License - CC BY 4.0
Authors - Project Anuvaad
Language - Tamil
Reference- https://github.com/project-anuvaad/anuvaad-ocr-corpus#tamil