c# – 解析邮件地址的正则表达式

我有一个地址类,使用正则表达式从地址的第一行解析房屋号码,街道名称和街道类型.这段代码通常运行良好,但我发布在这里与社区分享,看看有没有人有改进的建议.

注意:STREETTYPES和QUADRANT常量分别包含所有相关的街道类型和象限.

在这里添加一个子集:

private const string STREETTYPES = @"ALLEY|ALY|ANNEX|AX|ARCADE|ARC|AVENUE|AV|AVE|BAYOU|BYU|BEACH|...";

private const string QUADRANTS = "N|norTH|S|SOUTH|E|EAST|W|WEST|NE|norTHEAST|NW|norTHWEST|SE|SOUTHEAST|SW|SOUTHWEST";

HouseNumber,Quadrant,StreetName和StreetType都是该类的属性.

private void Parse(string line1)
    {
        HouseNumber = string.Empty;
        Quadrant = string.Empty;
        StreetName = string.Empty;
        StreetType = string.Empty;

        if (!String.IsNullOrEmpty(line1))
        {
            string noperiodsLine1 = String.copy(line1);
            noperiodsLine1 = noperiodsLine1.Replace(".","");

            string addressparseRegEx =
                @"(?ix)
            ^
            \s*
            (?:
               (?<housenumber>\d+)
               (?:(?:\s+|-)(?<quadrant>" +
                QUADRANTS +
                @"))?
               (?:(?:\s+|-)(?<streetname>\S+(?:\s+\S+)*?))??
               (?:(?:\s+|-)(?<quadrant>" +
                QUADRANTS + @"))?
               (?:(?:\s+|-)(?<streettype>" + STREETTYPES +
                @"))?
               (?:(?:\s+|-)(?<streettypequalifier>(?!(?:" +
                QUADRANTS +
                @"))(?:\d+|\S+)))?
               (?:(?:\s+|-)(?<streettypequadrant>(" +
                QUADRANTS + @")))??
               (?:(?:\s+|-)(?<suffix>(?:ste|suite|po\sBox|apt)\s*\S*))?
            |
               (?:(?:po|postoffice|post\s+office)\s+Box\s+(?<postofficeBox>\S+))
            )
            \s*
            $
            ";
            Match match = Regex.Match(noperiodsLine1,addressparseRegEx);
            if (match.Success)
            {
                HouseNumber = match.Groups["housenumber"].Value;
                Quadrant = (string.IsNullOrEmpty(match.Groups["quadrant"].Value)) ? match.Groups["streettypequadrant"].Value : match.Groups["quadrant"].Value;
                if (match.Groups["streetname"].Captures.Count > 1)
                {
                    foreach (Capture capture in match.Groups["streetname"].Captures)
                    {
                        StreetName += capture.Value + " ";
                    }
                    StreetName = StreetName.Trim();
                }
                else
                {
                    StreetName = (string.IsNullOrEmpty(match.Groups["streetname"].Value)) ? match.Groups["streettypequalifier"].Value : match.Groups["streetname"].Value;
                }
                StreetType = match.Groups["streettype"].Value;

                //if the matched street type is found
                //use the abbreviated version...especially for credit bureau calls
                string streetTypeAbbreviation;
                if (StreetTypes.TryGetValue(StreetType.toupper(),out streetTypeAbbreviation))
                {
                    StreetType = streetTypeAbbreviation;
                }
            }
        }

    }

解决方法

我不知道你在哪个国家,但是如果你在美国,想要花费一些钱来进行地址验证,你可以购买相关的usps产品 here.而 here是找到免费的单词列表的好地方usps用于预期的单词和缩写.我相信其他国家可以使用类似的网页.

相关文章

在要实现单例模式的类当中添加如下代码:实例化的时候:frmC...
1、如果制作圆角窗体,窗体先继承DOTNETBAR的:public parti...
根据网上资料,自己很粗略的实现了一个winform搜索提示,但是...
近期在做DSOFramer这个控件,打算自己弄一个自定义控件来封装...
今天玩了一把WMI,查询了一下电脑的硬件信息,感觉很多代码都...
最近在研究WinWordControl这个控件,因为上级要求在系统里,...