跳转到主要内容

Dask + Mongo集成

项目描述

Dask-Mongo

Tests Linting

使用Dask读写MongoDB数据

安装

dask-mongo可以用pip安装

pip install dask-mongo

或者用conda

conda install -c conda-forge dask-mongo

示例

import dask.bag as db
import dask_mongo

# Create Dask Bag
records = [
    {"name": "Alice", "fruit": "apricots"},
    {"name": "Bob", "fruit": ["apricots", "cherries"]},
    {"name": "John", "age": 17, "sports": "cycling"},
]

b = db.from_sequence(records)

# Write to a Mongo database
dask_mongo.to_mongo(
    b,
    database="your_database",
    collection="your_collection",
    connection_kwargs={"host": "localhost", "port": 27017},
)

# Read Dask Bag from Mongo database
b = dask_mongo.read_mongo(
    database="your_database",
    collection="your_collection",
    connection_kwargs={"host": "localhost", "port": 27017},
    chunksize=2,
)

# Perform normal operations with Dask
names = b.pluck("name").compute()
assert names == ["Alice", "Bob", "John"]

许可证

BSD-3

项目详情


下载文件

下载适用于您的平台的文件。如果您不确定选择哪个,请了解更多关于安装包的信息。

源分布

dask-mongo-2022.5.0.tar.gz (5.7 kB 查看哈希值)

上传时间:

构建分布

dask_mongo-2022.5.0-py3-none-any.whl (6.1 kB 查看哈希值)

上传时间: Python 3

由以下组织支持