跳转到主要内容

经典语言工具包

项目描述

circleci pypi twitter discord

经典语言工具包(CLTK)是一个Python库,为古语言提供自然语言处理(NLP)。

安装

对于CLTK的最新版本

$ pip install cltk

有关更多信息,请参阅安装文档或,要从源代码安装,请参阅开发文档

1.0版本之前的软件可在分支v0.1.x上找到,文档可在https://legacy.cltk.org找到。使用pip install "cltk<1.0"安装。

文档

https://docs.cltk.org的文档。

引用

当使用CLTK时,请引用以下出版物,包括DOI:

Johnson, Kyle P.,Patrick J. Burns,John Stewart,Todd Cook,Clément Besnier和William J. B. Mattingly。“经典语言工具包:古语言的自然语言处理框架。”在《第59届计算语言学年会和第11届国际自然语言处理联合会议:系统演示》中,第20-29页。2021年。10.18653/v1/2021.acl-demo.3

完整的BibTeX条目

@inproceedings{johnson-etal-2021-classical,
    title = "The {C}lassical {L}anguage {T}oolkit: {A}n {NLP} Framework for Pre-Modern Languages",
    author = "Johnson, Kyle P.  and
      Burns, Patrick J.  and
      Stewart, John  and
      Cook, Todd  and
      Besnier, Cl{\'e}ment  and
      Mattingly, William J. B.",
    booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.acl-demo.3",
    doi = "10.18653/v1/2021.acl-demo.3",
    pages = "20--29",
    abstract = "This paper announces version 1.0 of the Classical Language Toolkit (CLTK), an NLP framework for pre-modern languages. The vast majority of NLP, its algorithms and software, is created with assumptions particular to living languages, thus neglecting certain important characteristics of largely non-spoken historical languages. Further, scholars of pre-modern languages often have different goals than those of living-language researchers. To fill this void, the CLTK adapts ideas from several leading NLP frameworks to create a novel software architecture that satisfies the unique needs of pre-modern languages and their researchers. Its centerpiece is a modular processing pipeline that balances the competing demands of algorithmic diversity with pre-configured defaults. The CLTK currently provides pipelines, including models, for almost 20 languages.",
}

许可证

版权(c)2014-2024 Kyle P. Johnson,在MIT许可证下。

项目详情


发布历史 发布通知 | RSS源

下载文件

下载适用于您平台的文件。如果您不确定选择哪个,请了解更多关于 安装包 的信息。

源代码分发

cltk-1.3.0.tar.gz (625.3 kB 查看哈希值)

上传时间 源代码

构建分发

cltk-1.3.0-py3-none-any.whl (695.7 kB 查看哈希值)

上传时间 Python 3

支持者