问题描述
我尝试执行以下命令来分析在文件夹中找到的数据,但是,它将它们保存在我执行该行所在的文件夹中,如何将结果保存在该文件夹中文件位于?
sed -n 14,71p *_fastqc/*fastqc.txt | awk '{print $1" "$2}'| awk '{ sum += $2; n++ } END { if (n > 0) print sum / n; }' > QC-report.txt
解决方法
循环是要走的路
import numpy as np
import pandas as pd
from pandas import DataFrame
import re
data = [['abc123','abc23','123'],[np.nan,'123',np.nan],['bc123','qwe','abc'],['ABC@*&123',np.nan,'abc']]
df=pd.DataFrame(data,columns=['a','b','c'])
print(df)
lst1 =[['a','b'],['c']]
lst2 =[['dropnulls','regex']]
functions = {
'dropnulls': lambda x: df[x].dropna(),'regex': lambda x: df[df[x].str.contains(r'^\d+$',na=True)]
}
for index,internal_list in enumerate(lst1):
for col in internal_list:
operation = lst2[0][index]
function = functions[operation]
print(function(col))