合并熊猫数据框即：订单和库存，然后重新调整库存

问题描述

我有一个带有单独库存系统（Google表格）的订单列表。我正在尝试使用Pandas合并两者，以获得有效的“选择列表”，并且取得了一定的成功。但是，在测试中（向一个订单添加多个数量，具有相同物料/ SKU类型的多个订单），它开始崩溃。

orders = "orderNumber,SKU,Quantity\r\n11111,Greensneakers,2\r\n11111,brown_Handbag,1\r\n22222,1\r\n33333,Blue_Handbag,1"
str_orders = StringIO(orders,newline='\n')
df_orders = pd.read_csv(str_orders,sep=",")

inventory = "SKU,Location\r\nGreensneakers,DA13A\r\nGreensneakers,DA13A\r\nRed_Handbag,DA12A\r\nGreensneakers,DB34C\r\nGreensneakers,DB33C\r\n"
str_inventory = StringIO(inventory,newline='\n')
df_inventory = pd.read_csv(str_inventory,")
df_inventory = df_inventory.sort_values(by='Location',ascending=False)

df_pList = df_orders.merge(df_inventory.drop_duplicates(subset=['SKU']),on='SKU',how='left')
print(df_pList)

伪想要的输出：

'
orderNumber,Quantity,Location
11111,1,DB34C
11111,DB33C
11111,NA
22222,DA13A
33333,NA

'

合并是否可以解决此类问题？尽可能避免循环播放。

解决方法

下面制作了三个数据框。

df_pickList是你要做的

copy_inventory包含了如果您选择所有内容（如果您只想写出DataFrame来覆盖库存文件）将是什么库存，您可以选择不进行复制并使用df_inventory直接，但特别是在beta版本中，它便于复制以进行操作。

df_outOfStock是一个方便的存储桶，用于收集库存中没有的东西。对当前订单进行交叉检查，以了解需要获得的订单

from io import StringIO
import pandas as pd
import copy

orders = """orderNumber,SKU,Quantity
11111,GreenSneakers,2
11111,Brown_Handbag,1
22222,1
33333,Blue_Handbag,1
"""
str_orders = StringIO(orders,newline='\n')
df_orders = pd.read_csv(str_orders,sep=",")

inventory = """SKU,Location
GreenSneakers,DA13A
GreenSneakers,DA13A
Red_Handbag,DA12A
GreenSneakers,DB34C
GreenSneakers,DB33C
"""
str_inventory = StringIO(inventory,newline='\n')
df_inventory = pd.read_csv(str_inventory,")
df_inventory = df_inventory.sort_values(by='Location',ascending=False)

df_outOfStock = pd.DataFrame() #placeholder to report a lack of stock
df_pickList = pd.DataFrame() #placeholder to make pick list
copy_inventory = copy.deepcopy(df_inventory) #make a copy of inventory to decimate

for orderIndex,orderLineItem in df_orders.iterrows():
    for repeat in range(orderLineItem["Quantity"]): #since inventory location is 1 row per item,we need to do that many picks per order line item
        availableInventory = copy_inventory.loc[copy_inventory.loc[:,"SKU"] == orderLineItem["SKU"],:]
        if len(availableInventory) == 0:
            #Failed to find any SKU to pull
            df_outOfStock = df_outOfStock.append(orderLineItem,ignore_index=True)
        else:
            pickRow = {"orderNumber": orderLineItem["orderNumber"],"SKU": orderLineItem["SKU"],"Quantity": 1,"Location": availableInventory.iloc[0]["Location"]}
            df_pickList = df_pickList.append(pickRow,ignore_index=True)
            copy_inventory.drop(index = availableInventory.index[0],inplace=True)

感谢，与处理非整数数量（即角铁的英尺）相比，这是一个有趣的小练习

（下面的原始错误答案）

我建议将行连接到一个表中（不要合并和/或覆盖值），然后使用group by允许值的聚合。

作为入门，我将从将您的数据放在一起的这两个链接开始：

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.concat.html#pandas.concat

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.groupby.html?highlight=groupby#pandas.DataFrame.groupby

pandas pandas python-3.x stringio stringio

合并熊猫数据框即：订单和库存，然后重新调整库存

问题描述

解决方法

相关问答