在 python 中使用 Fuzzymatcher 时如何确定截止或阈值

问题描述

请在照片上提供帮助也是我的输出代码的屏幕截图,我如何使用 best_match_score 我需要按返回的“精确分数”过滤该列仅在合并后出现(即只需返回 'best_match_score' 低于 -1.06 的所有内容

import fuzzymatcher
import pandas as pd
import os

# pd.set_option('display.max_rows',None)
pd.set_option('display.max_columns',None)
pd.set_option('display.width',None)

REDCAP = pd.read_csv(r"C:\Users\Selamola\Desktop\PythonThings\FuzzyMatching\REDCAP Form A v1 and v2 23 Feb 211.csv")
covidSheet = pd.read_csv(r"C:\Users\Selamola\Desktop\PythonThings\FuzzyMatching\Cases missing REC ID 23 Feb 211.csv")

Data_merge = fuzzymatcher.fuzzy_left_join(covidSheet,REDCAP,left_on=['Participant Name','Particfipant Surname','Screening Date','Hospital Number','Alternative Hospital Number'],right_on=['Patient Name','Patient Surname','Date Of Admission','Date Of Sample Collection','Hospital Number'])

# Merged_data = pd.merge(REDCAP,covidSheet,how='left',#                        left_on=['Patient Name','Patient Surname'],#                        right_on=['Participant Name','Particfipant Surname'])

# Data_merge.to_csv(r'C:\Users\Selamola\Desktop\PythonThings\FuzzyMatching\DataMacth.csv')

print(Data_merge)

Image of WorkSpace

解决方法

这看起来很简单,除非我遗漏了什么。请务必尝试阅读有关 slicing data in pandas 的文档。

Property validation failure: [Encountered unsupported properties in {/ContainerProperties/Volumes/0}: [EfsVolumeConfiguration]]