跳转到主要内容
Avatar for fgregg from gravatar.com
用户名    fgregg

44 个项目

usaddress

Last released

Parse US addresses using conditional random fields

python-crfsuite

Last released

Python binding for CRFsuite

parserator

Last released

Create parsers

census-area

Last released

Census data for arbitrary geographies

dedupe

Last released

A python library for accurate and scaleable data deduplication and entity-resolution

django-councilmatic

Last released

Core functions for councilmatic.org family

dedupe-variable-address

Last released

Address variable type for dedupe

dedupe-variable-datetime

Last released

DateTime variable type for dedupe

dedupe-variable-name

Last released

Name variable type for dedupe

parseratorvariable

Last released

Structured variable type for dedupe

pyhacrf-datamade

Last released

Hidden alignment conditional random field, a discriminative string edit distance

PyLBFGS

Last released

LBFGS and OWL-QN optimization algorithms

census

Last released

A wrapper for the US Census Bureau's API

datasette-datatable

Last released

Export Datasette records as a DataTable

opencivicdata

Last released

python opencivicdata library

probablepeople

Last released

Parse romanized names & companies using advanced NLP methods

kubra

Last released

command line tool for downloading utility outage data

govqa

Last released

Interact with GovQA, a public records request management platform owned by Granicus

pupa

Last released

scraping framework for muncipal data

chicagorequests

Last released

command line tool for downloading Chicago Open311 data

dedupe-Levenshtein-search

Last released

Search through documents for approximately matching strings. A fork of Matt Anderson's library for MIT licensing

scraper-legistar

Last released

Mixin classes for legistar scrapers

affinegap

Last released

A Cython implementation of the affine gap string distance

rlr

Last released

Case weighted L2 regularized logistic regression

DoubleMetaphone

Last released

Python wrapper for C++ Double Metaphone

dedupe-hcluster

Last released

Hierarchical Clustering Algorithms (Information Theory)

django-proxy-overrides

Last released

Overridable foreign key fields for Proxy models

nwss

Last released

A marshmallow schema for the National Wastewater Surveillance System

dedupe-variable-ilcs

Last released

Dedupe variable for Illinois Compiled Statute (ILCS) codes

ilcs-parser

Last released

Probabilistic parser for tagging data that references the Illinois Compiled Statutes (ILCS).

csvdedupe

Last released

Command line tools for deduplicating and merging csv files

dedupe-variable-number

Last released

Employer variable type for dedupe

django-councilmatic-notifications

Last released

Core functions for councilmatic.org family

datetime-distance

Last released

Compare string distances between dates, timestamps, or datetime objects.

simplecosine

Last released

Simple cosine distance

highered

Last released

Learnable Edit Distance Using PyHacrf

categorical-distance

Last released

Compare two categorical variables

dedupe-variable-person

Last released

Variable type for American Person Names

companyparser

Last released

UNKNOWN

probableparsing

Last released

Common methods for propbable parsers

dedupe-variable-employer

Last released

Employer variable type for dedupe

dedupe-variable-fuzzycategory

Last released

Fuzzy Categoy variable type for dedupe

fuzzycategory

Last released

A context comparison

canonicalize

Last released

canonicalize a cluster of records

由以下支持