问题描述
我正在查看文本中的大字符串。在文中很多次都有。或者特别是? ?用作分隔符。
那么我想从字符串“送货单”中得到什么?患者在家。送货上门'
是在家送货但不是送货单?病人在家,这是我目前收到的。 否则下面的代码运行良好。
ADDED:鉴于下面的评论,这里使用正则表达式是因为实际数据源中有许多排列,例如在家交付、在救护车中交付、在车辆中交付或在救护车中出生、出生在家里等等。
任何建议表示赞赏-
with txt as (select 1 as ID,'Delivery Note ?Patient was at home. Delivered at home' as note_text from dual)
select txt.*,regexp_substr(NOTE_TEXT,'(born|birth|sv(b|d)|deliver(ed|y|ing)|vbac)[^.|?|,|(|)|;|:|-].{1,24}(vehicle|bathroom|ambulance|home|[^a-z]car([^a-z|]|$))',1,'i')
as results
from txt;
解决方法
看起来这就是你需要的:
with txt as (select 1 as ID,'Delivery Note ?Patient was at home. Delivered at home' as note_text from dual)
select txt.*,regexp_substr(NOTE_TEXT,'(born|birth|sv(b|d)|deliver(ed|y|ing)|vbac)[^.?,)(;:-]{1,24}(vehicle|bathroom|ambulance|home|[^a-z]car([^a-z|]|$))',1,'i')
as results
from txt;
如您所见,我删除了 [^...] 中的 |
,因为您只需要指定不带 or
的排除字符,并删除其后的 .
。
结果:
ID NOTE_TEXT RESULTS
---------- ----------------------------------------------------- --------------------
1 Delivery Note ?Patient was at home. Delivered at home Delivered at home