Mr.Uthayasanker Thayasivam

  • Home
  • Biography
  • Teaching
  • Awards & Grants
  • Publications
  • Research
    • Projects
    • Students
      • Undergraduate
      • Postgraduate
    • Conferences
  • News
  • Resources
    • Dataset
      • FYP
      • Public
    • FYP Tips

Dataset

  1. Resources
  2. Dataset

Tamil and Sinhala Speech Intent Dataset

Tamil News Classification- Text Classification

Tamil Movie Review - Text Classification

English to Sinhala Neural Machine Translation

Information Extraction from printed Grocery Receipts

End to end speaker model for text-independent speaker identification

Anuvaad Indian Language corpus links

Conversational AI Speech Dialogue System for Sinhala Language

Thirukkural - Text Classification,Semantic analysis

Speech and audio

Microsoft Speech Corpus (Indian languages)

Data from: EnTam: An English-Tamil Parallel Corpus (EnTam v2.0)

Automating web table column annotation using supervised learning

Tamil-Emotion-Analysis - Sentimental analysis

FlowChroma - Deep Learning Based Automated Video Colorization

Sinhala Emotion analysis -Sentimental analysis

Tamil language Corpus of Wikipedia articles

Sinhala news corpus- Text classification

SinMin news - Text classification

Polysemy Embedding

Sinhalese multi-speaker TTS corporate

Multi-SIM User Classification using Call Detail Records

Landslide Prediction System

Large Sinhala ASR training data set

Domain Specific, Intent Classification For Sinhala Speech Data

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Translation Initiative for COVID-19 - Text Classification(Tamil)

Data Driven Instruction Strategies for Sri Lankan Schools

Translation Initiative for COVID-19- Text Classification(Sinhala)

Dakshina Dataset - Text Classification

Tamil News Classification Dataset (Tamilmurasu)-Text classification

Troll Classification of Tamil Memes

Tamil Wikipedia Articles

Crowdsourced high-quality Tamil [te-in] multi-speaker speech dataset

Crowdsourced Sinhala [si-lk] ASR dataset

Crowdsourced high-quality Sinhala [si-lk] multi-speaker speech dataset

AI4Bharat-IndicNLP Dataset

Ponniyan selvan Tamil Book for NLP

IndicCorp Dataset

Tamil News dataset

gfdsg

EmoTa: A Tamil Emotional Speech Dataset

SiTa - Sinhala and Tamil Speaker Diarization Dataset in the Wild

Party Extraction from Legal Contracts Dataset

Search

Recent Posts

LanBix2021

Friday 4th of July 2025 01:31:00 PM

Workshop on "Word Embedding: From Word2Vec to StartSpace"

Monday 7th of December 2020 01:11:48 AM

Webinar on Data Analytics, Cyber Security, and Disruptive Technology

Thursday 26th of November 2020 11:35:22 PM

Delivered a welcome address in "Data Drives"

Thursday 26th of November 2020 11:16:18 PM

Moratuwa Engineering Research Conference (MERCon) 2019

Thursday 26th of November 2020 10:27:07 PM

Quick Links

  • Undergraduates
  • Postgraduates
  • News
  • Dataset

Contact Information

Luxshan Thavarasa

Department: Computer Science & Engineering

Email: luxshan.20[at]cse.mrt.ac.lk

LinkedIn Profile


Charangan Vasantharajan

Department: Computer Science & Engineering

Email: charangan.18[at]cse.mrt.ac.lk

LinkedIn Profile

Academic Profiles

Navigation

  • Home
  • Biography
  • Latest News
  • Research Data
  • Publications

Research Network

Part of the Aaivu Research Initiative

  • Aaivu Research Hub
  • Active Projects
  • Research Ethics
Collaboration Opportunities

Open to academic partnerships and research collaborations in AI and machine learning.

Academic Updates

Subscribe to receive notifications about new publications, research findings, and academic announcements.

Successfully subscribed to academic updates!

Your email will only be used for academic communications and will never be shared.

© Copyright . All Rights Reserved | Academic Research Portfolio