Under the supervision of Dr. Uthayasanker Thayasivam, Luxshan made significant contributions to speech and language processing research for low-resource languages.
His Final Year Project focused on the Multilingual Universal Speech Emotion Recognition Model, leading to the creation of EmoTa - the first emotional speech dataset for Tamil. The dataset contains 936 utterances from 22 native Tamil speakers across 5 emotions (anger, happiness, sadness, fear, neutral), achieving F1-scores of 0.91 with XGBoost.
📄 Paper: CHiPSAL @ COLING 2025 | 💻 GitHub Repository
Developed an open-source tool for detecting abusive content in Tamil and Malayalam code-mixed text, achieving 0.78 F1 on Tamil and 0.70 F1 on Malayalam using transfer learning and multi-head attention mechanisms.
💻 DravidaKavacham GitHub | Accepted at DravidianLangTech @ NAACL 2025
Trained a universal speech emotion recognition model supporting 15+ languages using Wav2Vec2 and XLS-R architectures.