Hello,

I'm currently having some problems with the StringTokenizer from
java util.

I've declared a StringTokenizer like:

StringTokenizer token = new StringTokenizer(line,"|");

to token out a string read from a txt file such as:

Diablo|RPG|PG|PS2|20

and it works fine(detects all the "|" as delimeters) :)


The problem now is, if I have lines with spaces like:

C&C Generals|Real Time Strategy|PG|PS2|20

it detects the spaces as well and the tokenizing is out. :sad:

So, I'm wondering if anyone out there can help me solve this problem. I just want the tokenizer to recognize "|" as a delimiter and not anything else.

Thanks in advance.

Recommended Answers

All 9 Replies

Member Avatar for iamthwee

How bout this?

Tokens.java

import java.io.*;
import java.util.*;
class Tokens
{
public static void main(String[] args)
{
int count=0;
String array[]= new String[100];
StringTokenizer st = new StringTokenizer("Diablo & generals|RPG|PG|PS2|20","|");
while (st.hasMoreTokens())
{
//System.out.println(st.nextToken());
array[count] =st.nextToken();
count++;
}
System.out.print(array[0]); //change here to see different token
}
}


[/B]
[B]String line = "C&C Generals|Real Time Strategy|PG|PS2|20";[/B]
[B]String[] tokens = line.split("|");[/B]
[B]for (int i=0;i<tokens.length;i++)[/B]
[B]     System.out.println(tokens[i]);[/B]
[B]

Yup.... I'm using the

line.split("|",5)

Found that in the Sun Java forum. I can forget bout StringTokenizer now,
cause the split method is definitely much more cooler...:mrgreen:

And the StringTokenizer is not recommended as of 5.0 I think...:-|

Well...forced to do my project in Java... if it were in Delphi I can finish it with 1/3 of the time I spent using Java.:twisted:

Thanks for the replies guys...really helped:)

It depends on the situation I guess.

For mine it works just fine. No performance drop compared to StringTokenizer, plus it works better... since there's no more need to create an array or arraylist to store the stuff and it ignores the spaces in between data...plus you get to specify how many tokens you want from each line as a second argument. Also you get to save quite a number of lines.

Was having problems with StringTokenizer cause it detects the spaces as delimeters as well though I specified that the delimeter is only "|".

As for the link, I think it's full of crap. Some people just can't switch to something new where they're already comfortable with the old one. Getting memory leaks in Java is quite impossible, cause that's what a garbage collector is for. You should see how memory leaks are like in Delphi...your com doesn't hang but it moves very very very slow till you reboot...if not it'll hang indefinitely. Looping a 100,000,000 times oh my... :cheesy:

That link makes some valid points about initialization costs. I find splitString easier to use though, and it does more than just search for delimiters, its parameter is actually a regular expression. The regex search could possibly slow the parsing down a bit, but I'm not sure by how much or how much it would even take to be noticable.

Split is simple.
It follows the regular expression.
Anyways for the above code there is slight change

String line = "C&C Generals|Real Time Strategy|PG|PS2|20";
String[] tokens = line.split("\\|");
for (int i=0;i<tokens.length;i++)
System.out.println(tokens);

janarjan

give me the sample output that you want to be output

give me the sample output that you want to be output

It has been 2 years since the last post. You think that the OP was waiting for you?
If you really want to help someone then why don't you try the most recent posts?

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.