Skip to content

Commit

Permalink
add novel model
Browse files Browse the repository at this point in the history
  • Loading branch information
lianrzh committed Aug 10, 2017
1 parent 40b25f0 commit 63d4715
Showing 1 changed file with 12 additions and 0 deletions.
12 changes: 12 additions & 0 deletions model/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,3 +25,15 @@
* webpage_slda.model # Sentence-LDA模型参数, 存储格式为libSVM格式
* webpage_twe_lda.model # TWE模型参数, 二进制存储, 包括词向量与主题向量
* vocab_info.txt # 词表文件, 中文编码采用UTF-8

### 小说主题模型

使用百度万级别小说数据训练,词表规模243617,LDA模型主题数为500,SentenceLDA(SLDA)模型主题数为500。
模型包括LDA、SentenceLDA(SLDA), Topical Word Embedding (TWE), 其中模型均存放于model/novel目录中

* lda.conf # LDA模型inference所需配置文件
* slda.conf # Sentence-LDA模型inference所需配置文件
* novel_lda.model # LDA模型参数, 存储格式为libSVM格式
* novel_slda.model # Sentence-LDA模型参数, 存储格式为libSVM格式
* novel_twe_lda.model # TWE模型参数, 二进制存储, 包括词向量与主题向量
* vocab_info.txt # 词表文件, 中文编码采用UTF-8

0 comments on commit 63d4715

Please sign in to comment.