问题描述
我正在尝试使用以下代码块将gower_matrix
包中的gower
函数应用于字典的值:
import gower
import pandas as pd
from itertools import chain,combinations
from pydataset import data
from toolz.dicttoolz import valmap
cars = data('mtcars')
vnames=cars.columns
def powerset(iterable):
"powerset([1,2,3]) --> () (1,) (2,) (3,) (1,2) (1,3) (2,3) (1,3)"
s = list(iterable)
return chain.from_iterable(combinations(s,r) for r in range(1,len(s)+1))
combos=list(powerset(vnames))
combos=list(map(list,list(powerset(vnames))))
combo_dicts = {}
keys = range(len(combos))
for i in keys:
combo_dicts[i] = cars[combos[i]]
gower_dicts = valmap(gower.gower_matrix,combo_dicts)
但是我收到以下错误
TypeError: ufunc 'true_divide' output (typecode 'd')
Could not be coerced to provided output parameter (typecode 'q')
according to the casting rule ''same_kind''
将其应用于特定的词典项目有效
gower.gower_matrix(combo_dicts[100])
array([[0.,0.02173357,0.19395797,...,0.12646227,0.35655078,0.11454861],[0.02173357,0.,0.21569154,0.12262693,0.3348172,0.10900868],[0.19395797,0.32042024,0.55050874,0.10668287],[0.12646227,0.23008852,0.21544196],[0.35655078,0.44382587],[0.11454861,0.10900868,0.10668287,0.21544196,0.44382587,0. ]],dtype=float32)
对这个问题有任何想法吗?
解决方法
基于对ufunc 'true_divide' output
的网络搜索,appears尝试将整数值数组除以by时发生错误(不是Numpy错误,但行为在几年前发生了更改)。浮点值。传递浮点值似乎是gower
包的未指定要求。因此,请首先转换cars
数据。我的猜测是,您有一些包含浮点值的列,而另一些包含整数。 combo_dicts
的测试元素可以正常工作,因为它恰好是仅由浮点列生成的。