Skip to main content
2025 Python Packaging Survey is now live!  Take the survey now

Polars plugin for pair-wise distance functions

Project description

Hellooo :)

This plugin is a work-in progress, main goal is to get pairwise distance metrics on numerical vectors (list, arrays) and string distance metrics.

Examples

import polars
import polars_distance as pld

df = pl.DataFrame({
    "foo":"hello",
    "bar":"hella world"
})

df.select(
    pld.col("foo").pdist_str.hamming('bar').alias('dist')
)
┌──────┐
 dist 
 ---  
 u32  
╞══════╡
 1    
└──────┘


df.select(
    pld.col('foo').pdist_str.levenshtein('bar').alias('dist')
)
┌──────┐
 dist 
 ---  
 u32  
╞══════╡
 6    
└──────┘

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page