问题描述
我正在尝试获取仅包含合并症字符串和无类别的新列。这是在R中完成的,带有tidyverse偏好。您会看到,其中两行具有我不感兴趣的奇数字符串。 这是我拥有的数据类型。
structure(list(id = c("1","2","3","4","5","6","7","9","8","10","11","12","13","14","15","16","17","18","19","20"),health_care_worker = c("No","No","Yes","No"),how_unwell = c(1,6,1,1),health_cnd = c("None",NA,"Diabetes Type 2,No,Yes,4,Showing Symptoms But Not Tested,Mild,Spanish,No 3bad24c8-0ac9-4269-aa53-5e8d41b03142,35,Female,Rio de Janeiro","High Blood Pressure (hypertension),Self-Isolating With No Symptoms,None,Portuguese,Yes 2656b3f2-d916-43e1-96b2-1d371d8c7b12,58,Belém/ Pará",15,Moderate,No 41cf840a-cfcc-441f-a995-f6b75ecee967,22,Male,Agb,India,2020-08-04 05:25:00,N",NA),health_1 = c("None","None",Asthma,Obesity,Lung-condition,"None")),row.names = c(NA,-20L),class = c("tbl_df","tbl","data.frame"))
这就是我想要新专栏的方式。
structure(list(id = c("1","None"),copy_health_column = c("None",Asthma",Obesity",Lung-condition","data.frame"))
现在,我的原始数据有10万个数据点。因此,我希望我能找到适用于更大数据集的解决方案。
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)