Skip to main content

Provides a set of APIs to consume Azure Open Datasets.

Project description

This package provides a set of APIs to consume Azure Open Datasets.

In general, it allows users to turn the open datasets into both SPARK and Pandas dataframe, with filters that are commonly applied to each specific dataset.

For some of the open datasets, it provides enricher capability to join with other data. For example, you can join your data with weather data by lat/long/zipcode + time quite easily.


This package also contains open datasets with the following third-party notices.

[1]: ftp://ftp.ncdc.noaa.gov/pub/data/noaa/readme.txt [2]: https://www.ncei.noaa.gov/thredds/catalog/gfs-004-files/catalog.html?dataset=gfs-g4-files [3]: http://opendefinition.org/licenses/odc-pddl/ [4]: https://data.seattle.gov/stories/s/Data-Policy/6ukr-wvup/ [5]: https://datasf.org/opendata/terms-of-use/ [6]: https://www.chicago.gov/city/en/narr/foia/data_disclaimer.html [7]: https://www1.nyc.gov/home/terms-of-use.page

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page