How to read and write a line in HTML code

Question

bo0ga 2 Newbie Poster

13 Years Ago

Hey guys. I am new at programming and would appreciate any help I can get. I want to make a program that reads the HTML code of a web page, and writes a specific line into a document. So for example, I want my code to read the source of the www.daniweb.com homepage and write whats inbetween the <title></title> tags into a notepad file which should be "DaniWeb - Technology Publication Meets Social Media." The code below works, but returns the entire HTML source code of the page.

import java.io.*;
import java.net.MalformedURLException;
import java.net.URL;

public class UrlReadPageDemo {
    public static void main(String[] args) {
        try {
            URL url = new URL("http://www.daniweb.com");

            BufferedReader reader = new BufferedReader(new InputStreamReader(url.openStream()));
            BufferedWriter writer = new BufferedWriter(new FileWriter("data1.txt"));

            String line;
            while ((line = reader.readLine()) != "<title>") {
                System.out.println(line);
                writer.write(line);
                writer.newLine();
            }
            reader.close();
            writer.close();
        } catch (MalformedURLException e) {
            e.printStackTrace();
        }  catch (IOException e) {
            e.printStackTrace();
        }
    }
}

html-css java social-media

4 Contributors
5 Replies
210 Views
3 Days Discussion Span
Latest Post 13 Years Ago Latest Post by NormR1

All 5 Replies

NormR1 563 Posting Sage

13 Years Ago

A simple way would be to read the input stream until the starting tag is found and then save what is read until the ending tag is found.

NormR1 563 Posting Sage

13 Years Ago

What if the tags are on different lines?

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

anuj_sharma -3 Junior Poster · Answer 1 · 2012-05-11T04:38:00+00:00

new

Try using this:

 while (!(line = reader.readLine()).equals("<title>"))

Not sure that the logic you are using will get the desired results. Try thinking over it again.

bo0ga 2 Newbie Poster · Answer 2 · 2012-05-13T23:28:33+00:00

bo0ga 2 Newbie Poster

13 Years Ago

anyone?

ColmSmith 0 Newbie Poster · Answer 3 · 2012-05-14T20:56:55+00:00

String line;
String outLine;

while ((line = reader.readLine()) != null) {

if (line.contains("<title>")){
    outLine = line.substring(line.indexOf("<title>")+7, line.indexOf("</title>") );
    writer.write(outLine);
    writer.newLine();
    System.out.println(outLine);
    }

How to read and write a line in HTML code

Recommended Answers Collapse Answers

All 5 Replies

Recommended Answers