正则表达式 – 使用Perl验证年份范围

这些是源文件标题中的有效年份,版权行:

2016
2007,2016
2007-2010,2016
2010,2012-2016

本年度(2016年)应该是其中的一部分.

无效:

2016,2007 # not in order
2010-2007,2016 #range not in order.
2010-2015 # current year missing.

我制作了如下脚本,仍然在努力,是否有更好的方法来检查相同的?

use strict;
use warnings;
use List::Util 'first';
use Time::Piece;

my $t = Time::Piece->new();
my $currentYear=$t->year;

my $filename="sample.txt"; # this file name will be passed-in to the script.
my $fileContent=`cat $filename`;
my $copyrightFound = first { /copyright .* Shakespeare/i } $fileContent;
if (!$copyrightFound) {
     print "\nERROR: copyright missing\n";
     exit;
}
#copyright Found
print $fileContent;
if ($fileContent =~ /copyright \(c\) (.*) by Bill Shakespeare\s+.*All rights reserved./) {
    print "\ncopyright is good: $1\n";
    my $years = $1;
    print "\n$years,$currentYear\n";
    #e.g: 2016
    if ($years !~ ',' && $years !~ '-') {
       print "\nerror\n" if ($years !~ /$currentYear/);
    }
    #e.g: 2010-2016
    elsif ($years !~ ',') {
       print "\nError\n" if ($years !~ /\-$currentYear$/);
    }
    #eg: 2010,2016
    elsif ($years !~ '-') {
       print "\nError\n" if ($years !~ /\,$currentYear$/);
    }
    #e.g: 2008,2010,2011-2016
    elsif ($years =~ '-' && $years =~ ',') {
       print "\nError 5\n" if ($years !~ /\d\d\d\d,\d\d\d\d\-$currentYear$/);
    }
    else {
       print "invalid format"
    }

} else {
      print "\ncopyright needs to be fixed\n";
}

sample.txt有:

copyright (c) 2008,2011-2016 by Bill Shakespeare
All rights reserved.

解决方法

您可以使用 Set::IntSpan来验证日期:

use warnings;
use strict;
use Set::IntSpan qw();

my $currentYear = 2016;

while (<DATA>) {
    s/\s+//g;
    my $ok = 0;
    if (Set::IntSpan->valid($_)) { # Checks order
        my $set = Set::IntSpan->new($_);
        my @years = $set->elements();
        $ok = 1 if grep { $_ == $currentYear } @years;
    }
    print "$ok : $_\n";
}

__DATA__
2007
2007,2012-2016
2016,2007
2010-2007,2016
2010-2015

这打印:

0 : 2007
1 : 2007,2016
1 : 2007-2010,2016
1 : 2010,2012-2016
0 : 2016,2007
0 : 2010-2007,2016
0 : 2010-2015

正如pilcrow在评论中所建议的那样,简化使用成员而不是grep:

my $ok = 0;
if (Set::IntSpan->valid($_)) { 
    my $set = Set::IntSpan->new($_);
    $ok = 1 if $set->member($currentYear);
}
print "$ok : $_\n";

相关文章

正则替换html代码中img标签的src值在开发富文本信息在移动端...
正则表达式
AWK是一种处理文本文件的语言,是一个强大的文件分析工具。它...
正则表达式是特殊的字符序列,利用事先定义好的特定字符以及...
Python界一名小学生,热心分享编程学习。
收集整理每周优质开发者内容,包括、、等方面。每周五定期发...