Mr.Uthayasanker Thayasivam

  • Home
  • Biography
  • Teaching
  • Awards & Grants
  • Publications
  • Research
    • Projects
    • Students
      • Undergraduate
      • Postgraduate
    • Conferences
  • News
  • Resources
    • Dataset
      • FYP
      • Public
    • FYP Tips

Dataset

  1. Resources
  2. Dataset

Tamil and Sinhala Speech Intent Dataset

Tamil News Classification- Text Classification

Tamil Movie Review - Text Classification

English to Sinhala Neural Machine Translation

Information Extraction from printed Grocery Receipts

End to end speaker model for text-independent speaker identification

Anuvaad Indian Language corpus links

Conversational AI Speech Dialogue System for Sinhala Language

Thirukkural - Text Classification,Semantic analysis

Speech and audio

Microsoft Speech Corpus (Indian languages)

Data from: EnTam: An English-Tamil Parallel Corpus (EnTam v2.0)

Automating web table column annotation using supervised learning

Tamil-Emotion-Analysis - Sentimental analysis

FlowChroma - Deep Learning Based Automated Video Colorization

Sinhala Emotion analysis -Sentimental analysis

Tamil language Corpus of Wikipedia articles

Sinhala news corpus- Text classification

SinMin news - Text classification

Polysemy Embedding

Sinhalese multi-speaker TTS corporate

Multi-SIM User Classification using Call Detail Records

Landslide Prediction System

Large Sinhala ASR training data set

Domain Specific, Intent Classification For Sinhala Speech Data

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Translation Initiative for COVID-19 - Text Classification(Tamil)

Data Driven Instruction Strategies for Sri Lankan Schools

Translation Initiative for COVID-19- Text Classification(Sinhala)

Dakshina Dataset - Text Classification

Tamil News Classification Dataset (Tamilmurasu)-Text classification

Troll Classification of Tamil Memes

Tamil Wikipedia Articles

Crowdsourced high-quality Tamil [te-in] multi-speaker speech dataset

Crowdsourced Sinhala [si-lk] ASR dataset

Crowdsourced high-quality Sinhala [si-lk] multi-speaker speech dataset

AI4Bharat-IndicNLP Dataset

Ponniyan selvan Tamil Book for NLP

IndicCorp Dataset

Tamil News dataset

gfdsg

EmoTa: A Tamil Emotional Speech Dataset

SiTa - Sinhala and Tamil Speaker Diarization Dataset in the Wild

Party Extraction from Legal Contracts Dataset

Search

Recent Posts

LanBix2021 "Translational Bioinformatics in Precision Medicine"

Friday 19th of March 2021 02:57:13 PM

Workshop on "Word Embedding: From Word2Vec to StartSpace"

Monday 7th of December 2020 01:11:48 AM

Webinar on Data Analytics, Cyber Security, and Disruptive Technology

Thursday 26th of November 2020 11:35:22 PM

Delivered a welcome address in "Data Drives"

Thursday 26th of November 2020 11:16:18 PM

Moratuwa Engineering Research Conference (MERCon) 2019

Thursday 26th of November 2020 10:27:07 PM

Quick Links

  • Undergraduates
  • Postgraduates
  • News
  • Dataset

Site Admins

Luxshan Thavarasa
Email: luxshan.20[at]cse.mrt.ac.lk


Charangan Vasantharajan
Email: charangan.18[at]cse.mrt.ac.lk


Our Links

  • Home
  • Biography
  • News
  • Dataset
  • Research Papers

Aaivu Links

  • Aaivu Home
  • Project Summary
  • Code of Conduct

Our Newsletter

We will send you our updates and publications through our newsletter.

Subscribed!
© Copyright . All Rights Reserved