Skip to content

Bangla NLP dataset. Bangla NER,POStag, text summarization, stopword, translate, sentiment analysis, wiki articles, root word, dataset etc.

License

Notifications You must be signed in to change notification settings

Foysal87/Bangla-NLP-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bangla-NLP-Dataset

Bangla NLP dataset. This repository contains sbnltk datasets, which were used in Bangla nlp toolkit - sbnltk . Also , Existing Datasets are being listed here!

OUR DATASET IS IN LFS MODE! SO YOU HAVE TO CLONE IT FOR GETTING DATA!

WE WILL SOON UPLOAD ALL DEEP LEARNING BASED DATASETS!

sbnltk Dataset List(DUMP & HUMAN Evaluated)(sbnltk Dataset)

  • Bangla Number List drive
  • Bangla root word List drive
  • Bangla Word List (highest to lowest occurrence) drive
  • Bangla Wiki Dump word drive
  • Bangla POStag static dataset(single word) drive
  • Bangla NER Static Dataset(single word) drive
  • Bangla Stop word list drive
  • Bangla Dump Pos tag drive
  • Bangla Dump question Classification Dataset drive
  • Bangla Dump Sentiment Analysis drive
  • Google Translation Dataset drive
  • NER Existing Dataset(Modified + adding Date entity) drive
  • News Article Dataset drive
  • POS tag converted Data drive
  • POS tag human evaluated Data drive
  • DUMP NER data (active and passive both) drive
  • DUMP NER data(active only) drive
  • Extractive Text Summarization github
  • Abstractive Text Summarization(newspaper) drive kaggle
  • News Article Classification(text Classification) drive kaggle
  • Topic Keywords classfication(keywords generator) drive kaggle

Paper

  • Text Summarization paper cite

EXISTING DATASET

I am not the owner of these following datasets. It's just a collection to find amazing peoples and their works Please give them support! Your support will encourage them to do more amazing things.

AWESOME DATASET SOURCES

NEWS ARTICLES AND DOCUMENTS

SPEECH TO TEXT / TEXT TO SPEECH

SENTIMENT ANALYSIS / SENTENCE CLASSIFICATION

BANGLA MACHINE TRANSLATION DATASET

BANGLA POSTAG DATASET

BANGLA NER DATASET

QUESTION ANSWERING DATASET

BANGLA TEXT SUMMARIZATION

BANGLA FAKE NEWS DETECTION

MISC

Motivation

Usage and Contribute

About

Bangla NLP dataset. Bangla NER,POStag, text summarization, stopword, translate, sentiment analysis, wiki articles, root word, dataset etc.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published