Skip to content

Testing Typical Machine Learning Algoritms for Text Classification

Notifications You must be signed in to change notification settings

annatelegina/Text_Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Text_Classification

Testing typical Machine Learning Algoritms on the dataset of Russian texts for tonality analysis.

To run:

  1. Copy this repo
  2. unzip dataset - data.zip to root folder
  3. run in the command line: python3 classifier.py --model [CHOOSE MODEL] --max_features [0..27000] --stop_words[None, Russian] --vertorization [TYPE]

Types of the words vectorization:

  1. Frequency of word
  2. TF-IDF
  3. Bool Vector

Releases

No releases published

Packages

No packages published

Languages