正则表达式问题:匹配字符串<sup>与</sup>之外的逗号
源HTML字符串:
abc<sup>1,2</sup>,defdfe<sup>3,4</sup>,ghijdfd,fdfdk
abc<sup>1,2</sup>;defdfe<sup>3,4</sup>;ghijdfd;fdfdk
Regex reg = new Regex(@"(?i)(?<=^|</sup>).+?(?=<sup>|$)");string result = reg.Replace(yourStr, delegate(Match m) { return m.Value.Replace(",", ";"); });
[解决办法]
var testSource = "abc,<sup>1,2,</sup>,defdfe<sup>3,4</sup>,ghijdfd,fdfdk<sup><span>,ss</span></sup>456464,"; var pattern = @",(?!(?:(?!</?sup\b).)*</sup\s*>)"; var resultStr = Regex.Replace(testSource,pattern,";",RegexOptions.IgnoreCase); Console.WriteLine(resultStr);