5
Contributors
11
Replies
26
Views
6 Years
Discussion Span
Last Post by fiaworkz
Featured Replies
  • :) Try this modification. [CODE] '//--- THIS FILE WILL BE REMOVED FROM MY WEBSITE ONCE THREAD IS SOLVED ---\\ Dim linkUrl As String = "http://www.codeorder.net/files/tempTestFile.html" '//-- used for testing only. '//--- THE FILE CONTENT IN THE LINK ABOVE IS EXACTLY THE CONTENT PREVIOUSLY POSTED BY "killerbeat" ---\\ Try Dim myResponse … Read More

0
  • How are you viewing and getting the innerHtml content of the website, from a vb.net WebBrowser?
  • What do you mean by dynamic, the text is not in the original Html file, but added from something like JavaScript once the page is loaded?
0

Hey, codeorder

1. I'm using an webbrowser, but a webrequest will work to
2. Yes, the text changes but is in the page source

0

Is it possible to post the Html source code from the web-page?

If too much code to add in this thread, send me a private message with the Html source code and bold out or highlight the area of content you want to retrieve.

0
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> 
<html> 
<head> 
    <title>Networktools: reverseip google.nl - Reverse IP Lookup, Whois, Ping, RBL Check, DNS Records, Traceroute, Host information</title> 
    <meta name="description" content="Network information tools; reverseip google.nl query whois records, ping hosts, query dns records, trace hosts, display host information, domains on ip, reverse ip, check spam blacklist, rbl check"> 
    <meta name="keywords" content="networktools reverseip google.nl networktools network network tool network tools whois domain domains dns domain name ping traceroute host dns zone dns records reverse ip rbl check spam blacklist check"> 
    <meta name="google-site-verification" content="0iZo5eSZbvygFjlVJLJJwekwfqNWtiSMyeAdnXZq6JE" /> 
 
    <script type="text/javascript" src="/js/jquery-1.3.2.min.js"></script> 
    <script type="text/javascript" src="/js/common.js"></script> 
 
    <link rel="stylesheet" type="text/css" href="/css/main.css" media="screen" /> 
    <link rel="alternate" type="application/atom+xml" title="Networktools - Atom" href="http://networktoolsblog.blogspot.com/feeds/posts/default" /> 
    <link rel="alternate" type="application/rss+xml" title="Networktools - RSS" href="http://networktoolsblog.blogspot.com/feeds/posts/default?alt=rss" /> 
 
 
    <link type="text/css" href="/css/custom-theme/jquery-ui-1.7.2.custom.css" rel="stylesheet" /> 
    <script type="text/javascript" src="/js/jquery-ui-1.7.2.custom.min.js"></script> 
 
    <script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js?pub=networktools"></script> 
 
    <!--[if lt IE 7]>
    <script defer type="text/javascript" src="/js/pngfix.js"></script>
    <![endif]--> 
 
    <!--[if lte IE 8]><link rel="stylesheet" type="text/css" href="/css/ie8-main.css" /><![endif]--> 
 
    <link rel="Shortcut Icon" href="/favicon.ico"> 
    
</head> 
<body bgcolor="#333333" link="#ff5500" vlink="#ff5500" alink="#ff0000"> 
<center> 
 
 
<div id="contentWrapper"> 
 
<div id="contentHeader"> 
    <table border="0" cellpadding="0" cellspacing="0" style="border-collapse: collapse" width="100%"> 
        <tr> 
            <td width="80%"> 
                <a href="/"> 
                    <img src="/img/network.png"/> 
                    <img src="/img/networktools.png"></img> 
                </a> 
            </td> 
        </tr> 
    </table> 
&nbsp;
</div> 
 
<div id="toolContent"> 
<noscript> 
<div style='font-size: 15px;'> 
    <b>Hi, you have JavaScript turned off. This website needs JavaScript to function properly.</b> 
</div> 
</noscript> 
<div style='float:right;' onclick="return addthis_open(this, '', '[URL]', '[TITLE]')" onmouseout="addthis_close()" onclick="return addthis_sendto()"> 
<a href="javascript:;" title='Bookmark & Share'><img src='/img/addthis.gif' /></a> 
</div> 
 
<table border="0" cellpadding="5" cellspacing="0" width="100%"> 
<tr> 
<td> 
    <table border='0'> 
        <tr> 
            <td id="toolWhois" class="tool" title='Query whois records'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>Whois</b></font> 
            </td> 
            <td id="toolPing" class="tool" title='Ping hosts'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>Ping</b></font> 
            </td> 
            <td id="toolDNS" class="tool" title='Query dns records'> 
                    <input type='radio' name='toolType' /> 
                    <font><b>DNS Zone</b></font> 
            </td> 
            <td id="toolTraceroute" class="tool" title='Trace hosts'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>Traceroute</b></font> 
            </td> 
            <td id="toolHost" class="tool" title='Display host information'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>Host</b></font> 
            </td> 
        </tr> 
        <tr> 
        <td colspan="5">&nbsp;</td> 
        </tr> 
        <tr> 
             <td id="toolReverseIP" class="tool" colspan="1" title='Show domains on an IP address'> 
                    <input type='radio' name='toolType' checked /> 
                    <font><b>Reverse IP</b></font> 
            </td> 
             <td id="toolRBLCheck" class="tool" colspan="1" title='Checks whether an IP address is listed in SPAM blacklists'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>RBL Check</b></font> 
            </td> 
             <td id="toolASInfo" class="tool" colspan="1" title='Shows information on Autonomous Systems'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>AS Info</b></font> 
            </td> 
             <td id="toolServices" class="tool" colspan="1" title='Shows active services on a host and information related to the services (comparable to a portscan)'> 
                    <input type='radio' name='toolType'  /> 
                    <font><b>Services</b></font> 
            </td> 
        </tr> 
    </table> 
</form> 
</td> 
</tr> 
 
<tr> 
 
 
    <td align="left" valign="top" width="750" colspan="3"> 
 
        <table id="submitTable"> 
            <tr> 
                <td align="center"> 
                    <form id="toolSubmitInput" action="/actionhandler" method="POST"> 
                      <input id="toolAction" type="hidden" name="toolAction"> 
                        <tr> 
                            <td><input id="toolInput" name="toolInput" type="text" class="textInput" name="ip" size="40" value="google.nl"></td> 
                            <td colspan="3" align="right"> 
                                <input type="submit" class="button" value="Go!" style='width: 50px;height: 40px;'> 
                                <!--<input type="image" src='/img/querybutton.png' style='margin-top: 2px;' title='Search!' />--> 
                            </td> 
                        </tr> 
			<tr> 
				<td> 
				<script type="text/javascript"> 
google_ad_client = "pub-0305511562035867";
google_ad_slot = "8813106053";
google_ad_width = 468;
google_ad_height = 60;
</script><script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script> 
				</td> 
			</tr> 
 
                    </form> 
                </td> 
            </tr> 
 
        </table> 
 
        <pre> 
Domains on <b>74.125.79.147</b>: 
 
74.125.79.147
www.karmats.net
alt1.toolbarqueries.google.es
alt1.toolbarqueries.google.co.uk
picasa.google.se
 
        </pre> 
 
    </td> 
</tr> 
</table> 
 
</div> 
 
<div id='footerContent'> 
 
 
&copy; 2010 FY. All rights reserved.
&nbsp;|&nbsp;<a href='' title='Random search'>Random</a> 
&nbsp;|&nbsp;<a id='newToolLink' href='javascript:;'>Request a new tool</a> 
<!--&nbsp;|&nbsp;<a id='feedbackLink' href='javascript:;'>Feedback</a>--> 
&nbsp;|&nbsp;<a id='blogLink' href='http://networktoolsblog.blogspot.com/' target="_blank">Blog</a> 
:&nbsp;<a href='http://networktoolsblog.blogspot.com/2010/08/top-searches-08aug2010-part4.html' target='_blank' title='Latest blog entry'>Top searches (08/Aug/2010) Part4</a> 
 
<br />Also available at &nbsp;
 
<a href='http://mx-records.net' title='mx-records.net'>mx-records.net</a> &nbsp;
 
<a href='http://ip-reverse-lookup.com' title='ip-reverse-lookup.com'>ip-reverse-lookup.com</a> &nbsp;
 
<a href='http://dns-records.info' title='dns-records.info'>dns-records.info</a> &nbsp;
 
<a href='http://rbl-check.info' title='rbl-check.info'>rbl-check.info</a> &nbsp;
 
<a href='http://dnsbl.nl' title='dnsbl.nl'>dnsbl.nl</a> &nbsp;
 
<br /> 
Your IP address is 85.144.248.123
<!--<a href="http://coverproxy.com" target="_blank" style="float: right;margin-top: -30px;margin-right: 40px;" title="Try Cover Proxy! Cover Proxy is a web proxy for keeping yourself anonymous and bypassing network restrictions.">
                  <img src="/img/coverproxy_new.png" border="0" />
 
                </a>--> 
<br /> 
<u>Linkpartners:</u> <a href="http://coverproxy.com" target="_blank">coverproxy.com</a> | <a href="http://warpproxy.com" target="_blank">warpproxy.com</a> | 
<a href="http://awesomeproxy.com" target="_blank">awesomeproxy.com</a> | <a href="http://stegoproxy.com" target="_blank">stegoproxy.com</a> | 
<a href="http://prxy.eu" target="_blank">prxy.eu</a> | <a href="http://swapip.com" target="_blank">swapip.com</a> 
| <a href="http://rapidprivacy.com" target="_blank">rapidprivacy.com</a> 
| <a href="http://rockhopped.com" target="_blank">rockhopped.com</a> 
<br /> 
<!--<u>Advertisement:</u> 
 | <a href="http://www.vakantiekrant.nl/reisaanbod/?tt=2794_98804_57048_&amp;r=" target="_blank">Last-minutes</a>
 | <a href="http://www.vakantiekrant.nl/reisaanbod/?tt=2794_98802_57048_&amp;r=" target="_blank">Hotels</a>
 | <a href="http://www.vdx.nl/tt/?tt=1659_42676_57048_&amp;r=" target="_blank">Linux PHP webhosting</a>
<br />--> 
<script type="text/javascript"> 
google_ad_client = "pub-0305511562035867";
google_ad_slot = "8813106053";
google_ad_width = 468;
google_ad_height = 60;
</script><script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script> 
</div> 
 
		<div id="newToolDialog" title="Request a new tool" style='display: none;'> 
			<table> 
             <tr> 
             <td valign="top">Tool name: </td> 
             <td><input id="newToolName" type="text" size="25" class="textInput" /></td> 
            </tr> 
            <tr> 
            <td valign="top">E-mail: </td> 
            <td><input id="newToolEmail" type="text" size="25" class="textInput" /></td> 
            </tr> 
            <tr> 
            <td valign="top">Description: </td> 
            <td><textarea id="newToolDescription" rows="5" cols="23" class="textAreaInput"></textarea></td> 
            </tr> 
            </table> 
		</div> 
 
		<div id="feedbackDialog" title="Feedback" style='display: none;'> 
			<table> 
            <tr> 
            <td valign="top">E-mail: </td> 
            <td><input id="feedbackEmail" type="text" size="25" class="textInput" /></td> 
            </tr> 
            <tr> 
            <td valign="top">Comments: </td> 
            <td><textarea id="feedbackComments" rows="5" cols="26" class="textAreaInput"></textarea></td> 
            </tr> 
            </table> 
		</div> 
 
 
<script type="text/javascript"> 
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl" : "http://www");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
</script> 
<script type="text/javascript"> 
try {
var pageTracker = _gat._getTracker("UA-12756464-1");
pageTracker._trackPageview();
} catch(err) {}</script> 
 
<script type="text/javascript"> 
  var uservoiceOptions = {
    key: 'networktools',
    host: 'networktools.uservoice.com', 
    forum: '51911',
    alignment: 'right',
    background_color:'#f00', 
    text_color: 'white',
    hover_color: '#06c',
    lang: 'en',
    showTab: true
  };
  function _loadUserVoice() {
    var s = document.createElement('script');
    s.src = ("https:" == document.location.protocol ? "https://" : "http://") + "uservoice.com/javascripts/widgets/tab.js";
    document.getElementsByTagName('head')[0].appendChild(s);
  }
  _loadSuper = window.onload;
  window.onload = (typeof window.onload != 'function') ? _loadUserVoice : function() { _loadSuper(); _loadUserVoice(); };
</script> 
 
</body> 
</html>

The piece of code:

<pre> 
Domains on <b>74.125.79.147</b>: 
 
74.125.79.147
[url]www.karmats.net[/url]
alt1.toolbarqueries.google.es
alt1.toolbarqueries.google.co.uk
picasa.google.se
 
        </pre>
0

Let me know if this helps.

No WebBrowser needed.

'//--- THIS FILE WILL BE REMOVED FROM MY WEBSITE ONCE THREAD IS SOLVED ---\\
        Dim linkUrl As String = "http://www.codeorder.net/files/tempTestFile.html" '//-- used for testing only.
        '//--- THE FILE CONTENT IN THE LINK ABOVE IS EXACTLY THE CONTENT PREVIOUSLY POSTED BY "killerbeat" ---\\
        Try
            Dim myResponse As Net.HttpWebResponse = Net.HttpWebRequest.Create(linkUrl).GetResponse '// connect.
            Dim myStream As IO.Stream = myResponse.GetResponseStream() '// get.
            Dim myReader As New IO.StreamReader(myStream) '// read.

            Dim myWebFileInfo As String = myReader.ReadToEnd '// get innerHtml of Page.

            '// close reader, information stream, and connection.
            myReader.Close() : myStream.Close() : myResponse.Close()

            '// get location of <pre> by index.
            Dim myStartIndex As Integer = myWebFileInfo.IndexOf("<pre>") + 5 '// add +5 to not add "<pre>" to final result.
            '// get location of </pre> by index.
            Dim myEndIndex As Integer = myWebFileInfo.IndexOf("</pre>")
            '// subtract to get a total number for the substring length result.
            Dim myStringLengthToExtract As Integer = myEndIndex - myStartIndex
            '// display result.
            MsgBox(myWebFileInfo.Substring(myStartIndex, myStringLengthToExtract), MsgBoxStyle.Information, "...codeorder...")
        Catch ex As Exception
            MsgBox("Connection Error.", MsgBoxStyle.Critical, "myCool Error Message")
        End Try
0

A Really big thanks, codeorder
Theres only one problem left:
I want to display the text into an textbox called "ipinfo"
when i do it like this:

ipinfo.text = myWebFileInfo.Substring(myStartIndex, myStringLengthToExtract)

it is one big mess

1

:)
Try this modification.

'//--- THIS FILE WILL BE REMOVED FROM MY WEBSITE ONCE THREAD IS SOLVED ---\\
        Dim linkUrl As String = "http://www.codeorder.net/files/tempTestFile.html" '//-- used for testing only.
        '//--- THE FILE CONTENT IN THE LINK ABOVE IS EXACTLY THE CONTENT PREVIOUSLY POSTED BY "killerbeat" ---\\
        Try
            Dim myResponse As Net.HttpWebResponse = Net.HttpWebRequest.Create(linkUrl).GetResponse '// connect.
            Dim myStream As IO.Stream = myResponse.GetResponseStream() '// get.
            Dim myReader As New IO.StreamReader(myStream) '// read.

            Dim myWebFileInfo As String = myReader.ReadToEnd '// get innerHtml of Page.
            
            '// close reader, information stream, and connection.
            myReader.Close() : myStream.Close() : myResponse.Close()

            '// get location of <pre> by index.
            Dim myStartIndex As Integer = myWebFileInfo.IndexOf("<pre>") + 5 '// add +5 to not add "<pre>" to final result.
            '// get location of </pre> by index.
            Dim myEndIndex As Integer = myWebFileInfo.IndexOf("</pre>")
            '// subtract to get a total number for the substring length result.
            Dim myStringLengthToExtract As Integer = myEndIndex - myStartIndex

            '// extract string.
            myWebFileInfo = myWebFileInfo.Substring(myStartIndex, myStringLengthToExtract)
            '// split into string arrays.
            Dim myHtmlTextLines() As String = myWebFileInfo.Split(vbNewLine.ToCharArray)
            '// display result.
            For Each line As String In myHtmlTextLines '// loop thru each line.
                If Not line.StartsWith(" ") Then '// do not add empty lines.
                    If TextBox1.Lines.Length = 0 Then
                        TextBox1.Text &= line & vbNewLine & vbNewLine '// add an empty line after line 1.
                    Else
                        TextBox1.Text &= line & vbNewLine
                    End If
                End If
            Next
            '// remove HTML formatting.
            With TextBox1 : .Text = .Text.Replace("<b>", "") : .Text = .Text.Replace("</b>", "") : End With

        Catch ex As Exception
            MsgBox(ex.Message, MsgBoxStyle.Critical, "myCool Error Message")
        End Try
0

Hi, Just wanted to say thanks for the code. I modified it to grab any text between <td> and </td> tags.

0

This is very cool. Any way to modify this so that it can batch process a list of URLs? i.e. copy/paste a list of urls into a textbox and have your code loop for each url and spit out results per line into another text box or file?

0

>>This is very cool...
Then start your own thread And Not spam someone else's thread. Sucks not to be able to edit past 30min., right?:D

-1

I got the same problem! but i need to use regex. does anyone know how I can fix my codes using regex?

This question has already been answered. Start a new discussion instead.
Have something to contribute to this discussion? Please be thoughtful, detailed and courteous, and be sure to adhere to our posting rules.