为什么下载几乎同样的2个网页,其中一个乱码,另一个显示正常?
http://www.bengou.com/all/index.html?contentfolder=&areafolder=&letterfolder=&state=2&order=update&p=1
和
http://www.bengou.com/all/index.html?contentfolder=&areafolder=&letterfolder=&state=2&order=update&p=2
两个网页。源代码几乎完全相同。
charset也都是utf-8
但是用
WebClient client = new WebClient();
client.Encoding = Encoding.GetEncoding("utf-8");
string htmlStr = client.DownloadString(url);
WebRequest request = HttpWebRequest.Create("http://www.bengou.com/all/index.html?contentfolder=&areafolder=&letterfolder=&state=2&order=update&p=2");
HttpWebResponse response = request.GetResponse() as HttpWebResponse;
Console.WriteLine(response.ContentEncoding);
using (GZipStream zipStream = new GZipStream(response.GetResponseStream(), CompressionMode.Decompress, false))
{
using (StreamReader reader = new StreamReader(zipStream))
{
Console.WriteLine(reader.ReadToEnd());
}
}