hi, im a java beginner.i want to read multiple text files,say 100 files.i want to remove the stop words from each of the file and want to save the contents after the stop words removal into newer files. For instance, a file name.txt contains: my name is sash after the stop word (my,is)removal it should be saved into an other file named name1.txt containing: name sash i have written code for stop word removal from a string.now i wan to do the same for a number of text files.here is the code for string:

/* This is the code for removing stop words from a text * stop words are saved in a string type array SWList[] and the text is passed as string parameter in the method cleandoc() from the main() */ public class testing {

String SWList[] = {"a","able","about","above","according","accordingly","across","actually","after","afterwards","again","against","all","allow","allows","almost","alone","along","already","also","although","always","am","among","amongst","an","and","another","any","anybody","anyhow","anyone","anything","anyway","anyways","anywhere","apart","appear","appreciate","appropriate","are","around","as","aside","ask","asking","associated","at","available","away","awfully",
           "b","be","became","because","become","becomes","becoming","been","before","beforehand","behind","being","believe","below","beside","besides","best","better","between","beyond","both","brief","but","by",
           "c","came","can","cannot","cant","cause","causes","certain","certainly","changes","clearly","co","com","come","comes","concerning","consequently","consider","considering","contain","containing","contains","corresponding","could","course","currently",
           "d","definitely","described","despite","did","different","do","does","doing","done","down","downwards","during",
           "e","each","edu","eg","eight","either","else","elsewhere","enough","entirely","especially","et","etc","even","ever","every","everybody","everyone","everything","everywhere","ex","exactly","example","except",
           "f","far","few","fifth","first","five","followed","following","follows","for","former","formerly","forth","four","from","further","furthermore",
           "g","get","gets","getting","given","gives","go","goes","going","gone","got","gotten","greetings",
           "h","had","happens","hardly","has","have","having","he","hello","help","hence","her","here","hereafter","hereby","herein","hereupon","hers","herself","hi","him","himself","his","hither","hopefully","how","howbeit","however",
           "i","ie","if","ignored","immediate","in","inasmuch","inc","indeed","indicate","indicated","indicates","inner","insofar","instead","into","inward","is","it","its","itself",
           "j","just","k","keep","keeps","kept","know","knows","known",
           "l","last","lately","later","latter","latterly","least","less","lest","let","like","liked","likely","little","ll","look","looking","looks","ltd",
           "m","mainly","many","may","maybe","me","mean","meanwhile","merely","might","more","moreover","most","mostly","much","must","my","myself",
           "n","name","namely","nd","near","nearly","necessary","need","needs","neither","never","nevertheless","new","next","nine","no","nobody","non","none","noone","nor","normally","not","nothing","novel","now","nowhere",
           "o","obviously","of","off","often","oh","ok","okay","old","on","once","one","ones","only","onto","or","other","others","otherwise","ought","our","ours","ourselves","out","outside","over","overall","own",
           "p","particular","particularly","per","perhaps","placed","please","plus","possible","presumably","probably","provides",
           "q","que","quite","qv",
           "r","rather","rd","re","really","reasonably","regarding","regardless","regards","relatively","respectively","right",
           "s","said","same","saw","say","saying","says","second","secondly","see","seeing","seem","seemed","seeming","seems","seen","self","selves","sensible","sent","serious","seriously","seven","several","shall","she","should","since","six","so","some","somebody","somehow","someone","something","sometime","sometimes","somewhat","somewhere","soon","sorry","specified","specify","specifying","still","sub","such","sup","sure",
           "t","take","taken","tell","tends","th","than","thank","thanks","thanx","that","thats","the","their","theirs","them","themselves","then","thence","there","thereafter","thereby","therefore","therein","theres","thereupon","these","they","think","third","this","thorough","thoroughly","those","though","three","through","throughout","thru","thus","to","together","too","took","toward","towards","tried","tries","truly","try","trying","twice","two",
           "u","un","under","unfortunately","unless","unlikely","until","unto","up","upon","us","use","used","useful","uses","using","usually","uucp",
           "v","value","various","ve","very","via","viz","vs","w","want","wants","was","way","we","welcome","well","went","were","what","whatever","when","whence","whenever","where","whereafter","whereas","whereby","wherein","whereupon","wherever","whether","which","while","whither","who","whoever","whole","whom","whose","why","will","willing","wish","with","within","without","wonder","would","would",
           "x",
           "y","yes","yet","you","your","yours","yourself","yourselves",
           "z","zero"};


public void cleanDoc(String str)
{   //variable declaration
    int nWordLength = 0; //length of every word in stop words list
    int pos=0;           //position of stop word in the main text string
    for(int i = 0; i< SWList.length; i++) 
    {
        nWordLength = SWList[i].length();
        pos = str.indexOf(" "+SWList[i]+" ");//inserting spaces before and after stop word checking
        if(pos!=-1)
        {
            System.out.println("Stopwrod \t\""+SWList[i]+"\"\t found at position:\t"+ pos);
            str = str.substring(0,pos+1).concat(str.substring(pos+1 +nWordLength));//concatenate string without stop word removal

        }

    }
    System.out.println("The final String is: " +str);//Display the final String after removing stop words

}



public static void main(String[] args)
{
    testing cd = new testing();
    cd.cleanDoc(" my name is sash and i like to do good things" );
}

Recommended Answers

All 4 Replies

Use another loop.

Use another loop.

if i use a loop how do i handle the names of files which i statically mentioned in the code.actually i want to pick all the files from a directory one by one..and then apply the processing and finally want to store the contents back to the same files..i have done it for one file but how should i do it for multiple files?

Well open the API docs and look at the File class. There is a method for receiving a list of all items in a directory, you know?

Well open the API docs and look at the File class. There is a method for receiving a list of all items in a directory, you know?

yup i got that...thanks

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.