Library for CJK (chinese, japanese, korean) language data.

These details have not been verified by PyPI

Project links

Project description

cihai ·

Python library for CJK (chinese, japanese, korean) data.

This project is under active development. Follow our progress and check back for updates!

Quickstart

API / Library (this repository)

$ pip install --user cihai

from cihai.core import Cihai

c = Cihai()

if not c.unihan.is_bootstrapped:  # download and install Unihan to db
    c.unihan.bootstrap(unihan_options)

query = c.unihan.lookup_char('好')
glyph = query.first()
print("lookup for 好: %s" % glyph.kDefinition)
# lookup for 好: good, excellent, fine; well

query = c.unihan.reverse_char('good')
print('matches for "good": %s ' % ', '.join([glph.char for glph in query]))
# matches for "good": 㑘, 㑤, 㓛, 㘬, 㙉, 㚃, 㚒, 㚥, 㛦, 㜴, 㜺, 㝖, 㤛, 㦝, ...

See API documentation and /examples.

CLI (cihai-cli)

$ pip install --user cihai-cli

Character lookup:

$ cihai info 好

char: 好
kCantonese: hou2 hou3
kDefinition: good, excellent, fine; well
kHangul: 호
kJapaneseOn: KOU
kKorean: HO
kMandarin: hǎo
kTang: "*xɑ̀u *xɑ̌u"
kTotalStrokes: "6"
kVietnamese: háo
ucn: U+597D

Reverse lookup:

$ cihai reverse library

char: 圕
kCangjie: WLGA
kCantonese: syu1
kCihaiT: '308.302'
kDefinition: library
kMandarin: tú
kTotalStrokes: '13'
ucn: U+5715
--------

UNIHAN data

All datasets that cihai uses have stand-alone tools to export their data. No library required.

unihan-etl - UNIHAN data exports for csv, yaml and json.

Developing

$ git clone https://github.com/cihai/cihai.git`

$ cd cihai/

Bootstrap your environment and learn more about contributing. We use the same conventions / tools across all cihai projects: pytest, sphinx, flake8, mypy, black, isort, tmuxp, and file watcher helpers (e.g. entr(1)).

Algorithm	Hash digest
SHA256	`2c572d94ce05d7dddc0329abad58771df7fb6518717692df1ba5000708591a07`
MD5	`1584ffc4c222e9a28dc517b6cf716535`
BLAKE2b-256	`b1cab356f00c4b562e905ae1d0358aa0e3f11e34b2d0b39872381e800738b7f7`

Algorithm	Hash digest
SHA256	`c74edae7e6e5e8b095a8de85e591952d30e400afb48e0caeeb2001ff37886b22`
MD5	`d42795c81f723b07bf524fb2b57c4c0b`
BLAKE2b-256	`0ff778114cf516939a95c1fe2944f58ededf7c1903480fedbfe394d41b66f71b`

cihai 0.17.1a0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cihai ·

Quickstart

API / Library (this repository)

CLI (cihai-cli)

UNIHAN data

Developing

Quick links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes