Code Simplified – Viral Sarvaiya

Code Simplified – Viral Sarvaiya, Web Developer Friends, dot net Developer, Sql Server Developer

Get HTML code from URL in ASP.NET

Posted by Viral Sarvaiya on November 10, 2009

To get HTML of web page you need only few lines of code.

To start, place two TextBox controls named txtURL and txtPageHTML, and one button control on web form, like in image bellow:

Web form for getting page HTML at design time

Now, on button’s click event function, place this code:

[ C# ]


// We  need these namespaces
using  System;
using  System.Text;
using  System.Net;

public partial class DefaultCS : System.Web.UI.Page
{

protected void  btnGetHTML_Click(object sender, EventArgs e)
{
// We'll use WebClient class for reading HTML of web  page
WebClient MyWebClient = new WebClient();

// Read web page HTML to byte array
Byte[] PageHTMLBytes;
if (txtURL.Text !=  "")
{
PageHTMLBytes =  MyWebClient.DownloadData(txtURL.Text);

// Convert result from byte array to string
// and display it in TextBox txtPageHTML
UTF8Encoding oUTF8 =  new UTF8Encoding();
txtPageHTML.Text  = oUTF8.GetString(PageHTMLBytes);
}
}
}

[ VB.NET ]

‘ We need these namespaces


Imports  System
Imports  System.Text
Imports  System.Net

Partial Class _Default
Inherits System.Web.UI.Page

Protected Sub btnGetHTML_Click(ByVal  sender As Object, ByVal e As  System.EventArgs) Handles btnGetHTML.Click
' We'll use WebClient class for reading HTML of web  page
Dim MyWebClient As  WebClient = New WebClient()

' Read web page HTML to byte array
Dim PageHTMLBytes() As Byte
If txtURL.Text <> "" Then
PageHTMLBytes =  MyWebClient.DownloadData(txtURL.Text)

' Convert result from byte array to string
' and display it in TextBox txtPageHTML
Dim oUTF8 As  UTF8Encoding = New UTF8Encoding()
txtPageHTML.Text =  oUTF8.GetString(PageHTMLBytes)
End If
End Sub

End Class

Now you can start sample project, type some valid URL in first TextBox control and click to “btnGetHTML” button. Code listed above will return HTML code of requested URL and display it in second text box, like in image bellow:

HTML code is read and shown in text box

As you see, loading of HTML code of web page is relatively easy. Analyzing of this data is much harder and depends of page structure.

Reference :

http://www.beansoftware.com/ASP.NET-FAQ/Read-Page-HTML.aspx

5 Responses to “Get HTML code from URL in ASP.NET”

  1. AV said

    Hi,

    How to get html code displayed in a text area for a particular web part, say image(which has to display html code for image path,ie img src= .. in text area)

  2. viralsarvaiya said

    Thanks for the appreciation

  3. Bravo, your idea it is brilliant

  4. Nilesh said

    thanks for the post.. its really helpful.

Leave a comment