Last updated:
0 purchases
polarsdistance 0.4.3
Hellooo :)
This plugin is a work-in progress, main goal is to provide distance metrics on list, arrays and string datatypes.
The docs can be found here: https://ion-elgreco.github.io/polars-distance/
Examples
import polars
import polars_distance as pld
df = pl.DataFrame({
"foo":"hello",
"bar":"hella world"
})
df.select(
pld.col("foo").dist_str.hamming('bar').alias('dist')
)
┌──────┐
│ dist │
│ --- │
│ u32 │
╞══════╡
│ 7 │
└──────┘
df.select(
pld.col('foo').dist_str.levenshtein('bar').alias('dist')
)
┌──────┐
│ dist │
│ --- │
│ u32 │
╞══════╡
│ 6 │
└──────┘
df = pl.DataFrame(
{
"arr": [[1, 2, 10]],
"arr2": [[2, 5, 9]],
},
schema={
"arr": pl.Array(inner=pl.Float64, width=3),
"arr2": pl.Array(inner=pl.Float64, width=3),
},
)
df.select(pld.col('arr').dist_arr.euclidean('arr2').alias('dist'))
shape: (1, 1)
┌──────────┐
│ dist │
│ --- │
│ f64 │
╞══════════╡
│ 3.316625 │
└──────────┘
For personal and professional use. You cannot resell or redistribute these repositories in their original state.
There are no reviews.