目录

perl中正则表达式

运用实例

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
=pod
下列是一条蛋白质序列,请统计该序列长度、丙氨酸(A)的个数及所占的比例;
MNAPERQPQPDGGDAPGHEPGGSPQDELDFSILFDYEYLNPNEEEPNAHKVASPPSGPAYPDDVLDYGLKPYSPLASLSGEPPGRFGEPDRVGPQKFLSAAKPAGASGLSPRIEITPSHELIQAVGPLRMRDAGLLVEQPPLAGVAASPRFTLPVPGFEGYREPLCLSPASSGSSASFISDTFSPYTSPCVSPNNGGPDDLCPQFQNIPAHYSPRTSPIMSPRTSLAEDSCLGRHSPVPRPASRSSSPGAKRRHSCAEALVALPPGASPQRSRSPSPQPSSHVAPQDHGSPAGYPPVAGSAVIMDALNSLATDSPCGIPPKMWKTSP
=cut

$protein_sequence = "MNAPERQPQPDGGDAPGHEPGGSPQDELDFSILFDYEYLNPNEEEPNAHKVASPPSGPAYPDDVLDYGLKPYSPLASLSGEPPGRFGEPDRVGPQKFLSAAKPAGASGLSPRIEITPSHELIQAVGPLRMRDAGLLVEQPPLAGVAASPRFTLPVPGFEGYREPLCLSPASSGSSASFISDTFSPYTSPCVSPNNGGPDDLCPQFQNIPAHYSPRTSPIMSPRTSLAEDSCLGRHSPVPRPASRSSSPGAKRRHSCAEALVALPPGASPQRSRSPSPQPSSHVAPQDHGSPAGYPPVAGSAVIMDALNSLATDSPCGIPPKMWKTSP";
$len = length($protein_sequence);
$A_count =  $protein_sequence =~ s/A/A/g;
$A_per = ($A_count/$len)*100;
print "length: $len\n";
print "the count of A: $A_count\n";
print "the percentage of A: $A_per%\n";

参考

Perl 正则表达式 | 菜鸟教程 (runoob.com)

perl 统计字符串中特定字符出现的次数_ok_我的心的博客-CSDN博客