为什么得到的网页源代码会如此不同?谢谢!
我用下面的代码去得到新浪首页的网页源代码。
Public Class Form1
Private Sub Button1_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles Button1.Click
Dim Html As String = " "
Dim url As New Uri(http://www.sina.com.cn/)
Dim Req As Net.HttpWebRequest
Req = Net.HttpWebRequest.Create(url)
Dim Resp As Net.HttpWebResponse
Try
Dim netStream As IO.StreamReader
Resp = Req.GetResponse
netStream = New IO.StreamReader(Resp.GetResponseStream, True)
Html = netStream.ReadToEnd
Catch exc As Exception
MsgBox(exc.Message)
End Try
TextBox1.Text = Html
End Sub
End Class
得到的代码主要是分行非常混乱,有些地方该分行的却没有分。和浏览器(IE&Opera)的得到的很大区别。
谢谢指教!
[解决办法]
你取得的字符串里有换行标记吗?
浏览器肯定会自己做一些事情,要不然要它干嘛呢!?