text-classify

Simple tool to predict text classes with various models.

Project description

# TextClassify

## Model

* fastText char
* fastText word
* CNN char embedding
* CNN word embedding
* CNN char & word embedding
* CNN + BiGRU + char & word embedding

## Segment Model

* pyltp
* jieba

## Embedding

* fastText (CBOW / skip-gram)
* gensim

char or word embedding

## Usage

```python
from text_classify import TextClassify

# default params
t = TextClassify(
model='fasttext',
cut=False,
cut_model='pyltp',
fasttext_char_model = '/data_hdd/embedding/fasttext/zhihu_char_model.bin', # default path
...
)

text = ''
logtis = t.predict(text)

# get index2label
t.index2label

# get top label
t.get_top_label(text, k=5)
```

* model: 'fasttext' (default), 'cnn', 'mcnn', 'mgcnn'
* cut: True, False (default)
* cut_model: 'pyltp' (default), 'jieba'
* everything in config

Algorithm	Hash digest
SHA256	`7d0f7677e9f3edb04954e26d5c7c6c2b749a21b7fd9bbeabd56729389a46e4ec`
MD5	`1c0a3f3800e95f2e52767be478e7d703`
BLAKE2b-256	`49f8fb2b47e753dba344f071c7f4c39a56067946c2fc66876985cd18043f9076`

text-classify 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes